A review of dimension reduction techniques for classification on high-dimensional data

With the technological advancements, vast amounts of data are generated in various domains, leading to massive growth in data sizes, dimensions and complexity. Big data, often associated with high dimensionality, poses significant challenges to the classification process. Certain technique such as d...

Full description

Bibliographic Details
Published in:AIP Conference Proceedings
Main Author: Hasan S.N.S.; Jamil N.W.; Ahmat H.
Format: Conference paper
Language:English
Published: American Institute of Physics Inc. 2023
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85179778302&doi=10.1063%2f5.0177327&partnerID=40&md5=65b4dae8577f4d0e3ed9cb63834e4a98
id 2-s2.0-85179778302
spelling 2-s2.0-85179778302
Hasan S.N.S.; Jamil N.W.; Ahmat H.
A review of dimension reduction techniques for classification on high-dimensional data
2023
AIP Conference Proceedings
2896
1
10.1063/5.0177327
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85179778302&doi=10.1063%2f5.0177327&partnerID=40&md5=65b4dae8577f4d0e3ed9cb63834e4a98
With the technological advancements, vast amounts of data are generated in various domains, leading to massive growth in data sizes, dimensions and complexity. Big data, often associated with high dimensionality, poses significant challenges to the classification process. Certain technique such as dimension reduction is required to enhance the accuracy of classification task whilst improving the prediction performance. Hence, this review aims at presenting the study of dimension reduction techniques for the classification of high-dimensional data. The Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) criteria are followed while conducting the systematic literature review. Publications from 2017 to 2021 on Scopus and Web of Science databases were searched with dimension reduction, classification and high-dimensional data as the keyword. From 129 articles, 29 studies were covered in this systematic review. In general, these studies report the performance of classification and the empirical comparisons of dimension reduction techniques applied to the high-dimensional dataset. Based on the articles reviewed, most of the proposed method significantly increases the accuracy rates and are able to improve the performance of high-dimensional data. Some of the new methods proposed resulted in higher accuracy when compared with the widely applied dimension reduction methods. Despite that, the overall evaluation showed that all studies presented the efficacy of the proposed dimension reduction approaches along with the improvement in the classification performance. However, there is no clear winner which dimensionality reduction technique performs best but reducing the number of features when analyzing the high-dimensional data is significant for the accuracy performance. © 2023 Author(s).
American Institute of Physics Inc.
0094243X
English
Conference paper

author Hasan S.N.S.; Jamil N.W.; Ahmat H.
spellingShingle Hasan S.N.S.; Jamil N.W.; Ahmat H.
A review of dimension reduction techniques for classification on high-dimensional data
author_facet Hasan S.N.S.; Jamil N.W.; Ahmat H.
author_sort Hasan S.N.S.; Jamil N.W.; Ahmat H.
title A review of dimension reduction techniques for classification on high-dimensional data
title_short A review of dimension reduction techniques for classification on high-dimensional data
title_full A review of dimension reduction techniques for classification on high-dimensional data
title_fullStr A review of dimension reduction techniques for classification on high-dimensional data
title_full_unstemmed A review of dimension reduction techniques for classification on high-dimensional data
title_sort A review of dimension reduction techniques for classification on high-dimensional data
publishDate 2023
container_title AIP Conference Proceedings
container_volume 2896
container_issue 1
doi_str_mv 10.1063/5.0177327
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85179778302&doi=10.1063%2f5.0177327&partnerID=40&md5=65b4dae8577f4d0e3ed9cb63834e4a98
description With the technological advancements, vast amounts of data are generated in various domains, leading to massive growth in data sizes, dimensions and complexity. Big data, often associated with high dimensionality, poses significant challenges to the classification process. Certain technique such as dimension reduction is required to enhance the accuracy of classification task whilst improving the prediction performance. Hence, this review aims at presenting the study of dimension reduction techniques for the classification of high-dimensional data. The Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) criteria are followed while conducting the systematic literature review. Publications from 2017 to 2021 on Scopus and Web of Science databases were searched with dimension reduction, classification and high-dimensional data as the keyword. From 129 articles, 29 studies were covered in this systematic review. In general, these studies report the performance of classification and the empirical comparisons of dimension reduction techniques applied to the high-dimensional dataset. Based on the articles reviewed, most of the proposed method significantly increases the accuracy rates and are able to improve the performance of high-dimensional data. Some of the new methods proposed resulted in higher accuracy when compared with the widely applied dimension reduction methods. Despite that, the overall evaluation showed that all studies presented the efficacy of the proposed dimension reduction approaches along with the improvement in the classification performance. However, there is no clear winner which dimensionality reduction technique performs best but reducing the number of features when analyzing the high-dimensional data is significant for the accuracy performance. © 2023 Author(s).
publisher American Institute of Physics Inc.
issn 0094243X
language English
format Conference paper
accesstype
record_format scopus
collection Scopus
_version_ 1809677579475484672