Predicting likelihood of fraud among financial distressed firms in Malaysia using textual analysis

This research paper aims to analyze and predict fraud patterns among failed companies in Malaysia. The approach involves utilizing textual analysis on the management discussion and analysis (MD&A) section within the annual reports. The dataset is subjected to text clustering to group companies b...

Full description

Bibliographic Details
Published in:Indonesian Journal of Electrical Engineering and Computer Science
Main Author: Marzuki M.M.; Nasir S.A.M.; Zain S.F.M.; Mangsor N.S.M.N.
Format: Article
Language:English
Published: Institute of Advanced Engineering and Science 2024
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85185777743&doi=10.11591%2fijeecs.v33.i3.pp1620-1631&partnerID=40&md5=e96f76f1416b0488ac83da68ddddbfea
id 2-s2.0-85185777743
spelling 2-s2.0-85185777743
Marzuki M.M.; Nasir S.A.M.; Zain S.F.M.; Mangsor N.S.M.N.
Predicting likelihood of fraud among financial distressed firms in Malaysia using textual analysis
2024
Indonesian Journal of Electrical Engineering and Computer Science
33
3
10.11591/ijeecs.v33.i3.pp1620-1631
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85185777743&doi=10.11591%2fijeecs.v33.i3.pp1620-1631&partnerID=40&md5=e96f76f1416b0488ac83da68ddddbfea
This research paper aims to analyze and predict fraud patterns among failed companies in Malaysia. The approach involves utilizing textual analysis on the management discussion and analysis (MD&A) section within the annual reports. The dataset is subjected to text clustering to group companies based on similar financial characteristics. This clustering process entails several steps, including data conversion, collation, and summarization into a structured format, followed by text pre-processing to cleanse the dataset. Notably, RapidMiner Studio software was utilized to extract data for the study. Subsequently, the documents are clustered using both the K-means and latent dirichlet allocation (LDA) methods. Upon examining a sample of 22 failed companies in the year 2020, the study reveals that financially distressed companies exhibit prominent financial negativity and utilize litigious financial terms within their MD&A sections. These linguistic traits are found to be closely associated with seven distinct characteristics of fraudulent firms. This preliminary findings provide compelling evidence that financial pressure may serve as a triggering factor for fraudulent activities within companies. © 2024 Institute of Advanced Engineering and Science. All rights reserved.
Institute of Advanced Engineering and Science
25024752
English
Article
All Open Access; Gold Open Access
author Marzuki M.M.; Nasir S.A.M.; Zain S.F.M.; Mangsor N.S.M.N.
spellingShingle Marzuki M.M.; Nasir S.A.M.; Zain S.F.M.; Mangsor N.S.M.N.
Predicting likelihood of fraud among financial distressed firms in Malaysia using textual analysis
author_facet Marzuki M.M.; Nasir S.A.M.; Zain S.F.M.; Mangsor N.S.M.N.
author_sort Marzuki M.M.; Nasir S.A.M.; Zain S.F.M.; Mangsor N.S.M.N.
title Predicting likelihood of fraud among financial distressed firms in Malaysia using textual analysis
title_short Predicting likelihood of fraud among financial distressed firms in Malaysia using textual analysis
title_full Predicting likelihood of fraud among financial distressed firms in Malaysia using textual analysis
title_fullStr Predicting likelihood of fraud among financial distressed firms in Malaysia using textual analysis
title_full_unstemmed Predicting likelihood of fraud among financial distressed firms in Malaysia using textual analysis
title_sort Predicting likelihood of fraud among financial distressed firms in Malaysia using textual analysis
publishDate 2024
container_title Indonesian Journal of Electrical Engineering and Computer Science
container_volume 33
container_issue 3
doi_str_mv 10.11591/ijeecs.v33.i3.pp1620-1631
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85185777743&doi=10.11591%2fijeecs.v33.i3.pp1620-1631&partnerID=40&md5=e96f76f1416b0488ac83da68ddddbfea
description This research paper aims to analyze and predict fraud patterns among failed companies in Malaysia. The approach involves utilizing textual analysis on the management discussion and analysis (MD&A) section within the annual reports. The dataset is subjected to text clustering to group companies based on similar financial characteristics. This clustering process entails several steps, including data conversion, collation, and summarization into a structured format, followed by text pre-processing to cleanse the dataset. Notably, RapidMiner Studio software was utilized to extract data for the study. Subsequently, the documents are clustered using both the K-means and latent dirichlet allocation (LDA) methods. Upon examining a sample of 22 failed companies in the year 2020, the study reveals that financially distressed companies exhibit prominent financial negativity and utilize litigious financial terms within their MD&A sections. These linguistic traits are found to be closely associated with seven distinct characteristics of fraudulent firms. This preliminary findings provide compelling evidence that financial pressure may serve as a triggering factor for fraudulent activities within companies. © 2024 Institute of Advanced Engineering and Science. All rights reserved.
publisher Institute of Advanced Engineering and Science
issn 25024752
language English
format Article
accesstype All Open Access; Gold Open Access
record_format scopus
collection Scopus
_version_ 1809677882459422720