Formulation of K-nearest neighbor model by varying the distance metrics of mahalanobis, correlation and cosine in discriminating different grades of Aquilaria oil
This study formulates a K-Nearest Neighbor (K-NN) model for the classification of Aquilaria (agarwood) oil grades by varying distance metrics: Mahalanobis, Correlation and Cosine. A total of 96 agarwood oil samples were analyzed using Gas chromatography-mass spectrometry (GC-MS), identifying key che...
發表在: | JOURNAL OF ESSENTIAL OIL BEARING PLANTS |
---|---|
Main Authors: | , , , |
格式: | Article; Early Access |
語言: | English |
出版: |
TAYLOR & FRANCIS LTD
2025
|
主題: | |
在線閱讀: | https://www-webofscience-com.uitm.idm.oclc.org/wos/woscc/full-record/WOS:001437517500001 |
author |
Yusoff Zakiah Mohd; Ismail Nurlaila; Sabri Noor Aida Syakira Ahmad |
---|---|
spellingShingle |
Yusoff Zakiah Mohd; Ismail Nurlaila; Sabri Noor Aida Syakira Ahmad Formulation of K-nearest neighbor model by varying the distance metrics of mahalanobis, correlation and cosine in discriminating different grades of Aquilaria oil Plant Sciences |
author_facet |
Yusoff Zakiah Mohd; Ismail Nurlaila; Sabri Noor Aida Syakira Ahmad |
author_sort |
Yusoff |
spelling |
Yusoff, Zakiah Mohd; Ismail, Nurlaila; Sabri, Noor Aida Syakira Ahmad Formulation of K-nearest neighbor model by varying the distance metrics of mahalanobis, correlation and cosine in discriminating different grades of Aquilaria oil JOURNAL OF ESSENTIAL OIL BEARING PLANTS English Article; Early Access This study formulates a K-Nearest Neighbor (K-NN) model for the classification of Aquilaria (agarwood) oil grades by varying distance metrics: Mahalanobis, Correlation and Cosine. A total of 96 agarwood oil samples were analyzed using Gas chromatography-mass spectrometry (GC-MS), identifying key chemical compounds contributing to quality differentiation. The proposed model achieved 100% classification accuracy at k = 1 across all distance metrics with Mahalanobis distance consistently outperforming others as k increased. Cross-validation results demonstrated the lowest error rates for Mahalanobis followed by Correlation and Cosine, confirming its robustness in high-dimensional chemical datasets. These findings highlight the effectiveness of machine learning in agarwood oil grading and underscore the importance of selecting optimal distance metrics for improved classification accuracy. Future work could incorporate ensemble learning and advanced feature selection to further refine performance. TAYLOR & FRANCIS LTD 0972-060X 0976-5026 2025 10.1080/0972060X.2025.2469677 Plant Sciences Green Submitted WOS:001437517500001 https://www-webofscience-com.uitm.idm.oclc.org/wos/woscc/full-record/WOS:001437517500001 |
title |
Formulation of K-nearest neighbor model by varying the distance metrics of mahalanobis, correlation and cosine in discriminating different grades of Aquilaria oil |
title_short |
Formulation of K-nearest neighbor model by varying the distance metrics of mahalanobis, correlation and cosine in discriminating different grades of Aquilaria oil |
title_full |
Formulation of K-nearest neighbor model by varying the distance metrics of mahalanobis, correlation and cosine in discriminating different grades of Aquilaria oil |
title_fullStr |
Formulation of K-nearest neighbor model by varying the distance metrics of mahalanobis, correlation and cosine in discriminating different grades of Aquilaria oil |
title_full_unstemmed |
Formulation of K-nearest neighbor model by varying the distance metrics of mahalanobis, correlation and cosine in discriminating different grades of Aquilaria oil |
title_sort |
Formulation of K-nearest neighbor model by varying the distance metrics of mahalanobis, correlation and cosine in discriminating different grades of Aquilaria oil |
container_title |
JOURNAL OF ESSENTIAL OIL BEARING PLANTS |
language |
English |
format |
Article; Early Access |
description |
This study formulates a K-Nearest Neighbor (K-NN) model for the classification of Aquilaria (agarwood) oil grades by varying distance metrics: Mahalanobis, Correlation and Cosine. A total of 96 agarwood oil samples were analyzed using Gas chromatography-mass spectrometry (GC-MS), identifying key chemical compounds contributing to quality differentiation. The proposed model achieved 100% classification accuracy at k = 1 across all distance metrics with Mahalanobis distance consistently outperforming others as k increased. Cross-validation results demonstrated the lowest error rates for Mahalanobis followed by Correlation and Cosine, confirming its robustness in high-dimensional chemical datasets. These findings highlight the effectiveness of machine learning in agarwood oil grading and underscore the importance of selecting optimal distance metrics for improved classification accuracy. Future work could incorporate ensemble learning and advanced feature selection to further refine performance. |
publisher |
TAYLOR & FRANCIS LTD |
issn |
0972-060X 0976-5026 |
publishDate |
2025 |
container_volume |
|
container_issue |
|
doi_str_mv |
10.1080/0972060X.2025.2469677 |
topic |
Plant Sciences |
topic_facet |
Plant Sciences |
accesstype |
Green Submitted |
id |
WOS:001437517500001 |
url |
https://www-webofscience-com.uitm.idm.oclc.org/wos/woscc/full-record/WOS:001437517500001 |
record_format |
wos |
collection |
Web of Science (WoS) |
_version_ |
1828987784416722944 |