Multi-Class Imbalanced Data Classification: A Systematic Mapping Study

Multi-class data classification is distinguished as a significant and challenging research topic in contemporary machine learning, particularly when concerning imbalanced data sets. Hence, a thorough investigation of multi-class imbalanced data classification is becoming increasingly pertinent. In t...

Full description

Bibliographic Details
Published in:Engineering, Technology and Applied Science Research
Main Author: Wang Y.; Rosli M.M.; Musa N.; Li F.
Format: Article
Language:English
Published: Dr D. Pylarinos 2024
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85196059492&doi=10.48084%2fetasr.7206&partnerID=40&md5=a9f9d75d9111bd0a4c2d14a705d8535d
Description
Summary:Multi-class data classification is distinguished as a significant and challenging research topic in contemporary machine learning, particularly when concerning imbalanced data sets. Hence, a thorough investigation of multi-class imbalanced data classification is becoming increasingly pertinent. In this paper, an overview of multi-class imbalanced data classification was generated via conducting a systematic mapping study, which endeavors to analyze the state of contemporary multi-class imbalanced data classification, with the primary goal of ascertaining the corpus of research undertaken in machine learning. To achieve this aim, 7,164 papers were assessed and the 147 prominent ones were selected from five digital libraries, which were further categorized according to techniques, issues, and types of datasets. After a thorough review of these papers, a taxonomy of multi-class imbalanced data classification techniques is proposed. Based on the results, researchers widely employ algorithmic-level, ensemble, and oversampling strategies to address the issue of multi-class imbalance in medical datasets, primarily to mitigate the impact of challenging data factors. This research highlights an urgent need for more studies on multi-class imbalanced data classification. © by the authors.
ISSN:22414487
DOI:10.48084/etasr.7206