Predicting sea levels using ML algorithms in selected locations along coastal Malaysia
In consideration of the distinct behavior of machine learning (ML) algorithms, six well-defined ML used were carried out in this study for predicting sea level on a day-to-day basis. Data compiled from 1985 to 2018 was utilized for training and testing the developed models. An assessment of the mult...
الحاوية / القاعدة: | Heliyon |
---|---|
المؤلف الرئيسي: | |
التنسيق: | مقال |
اللغة: | English |
منشور في: |
Elsevier Ltd
2023
|
الوصول للمادة أونلاين: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85168853997&doi=10.1016%2fj.heliyon.2023.e19426&partnerID=40&md5=8125708b70c895e264e6c132019e0772 |
id |
Hazrin N.A.; Chong K.L.; Huang Y.F.; Ahmed A.N.; Ng J.L.; Koo C.H.; Tan K.W.; Sherif M.; El-shafie A. |
---|---|
spelling |
Hazrin N.A.; Chong K.L.; Huang Y.F.; Ahmed A.N.; Ng J.L.; Koo C.H.; Tan K.W.; Sherif M.; El-shafie A. 2-s2.0-85168853997 Predicting sea levels using ML algorithms in selected locations along coastal Malaysia 2023 Heliyon 9 9 10.1016/j.heliyon.2023.e19426 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85168853997&doi=10.1016%2fj.heliyon.2023.e19426&partnerID=40&md5=8125708b70c895e264e6c132019e0772 In consideration of the distinct behavior of machine learning (ML) algorithms, six well-defined ML used were carried out in this study for predicting sea level on a day-to-day basis. Data compiled from 1985 to 2018 was utilized for training and testing the developed models. An assessment of the multiple statistics-driven regression algorithms resulted such that each tested location was associated with a particular preferred model. The following were the developed best models for their respective study areas: In Peninsular Malaysia, the interactions linear regression model was the best at Pulau Langkawi (RMSE = 19.066), the Matern 5/2 gaussian process regression model at Geting (RMSE = 49.891), and the trilayered artificial neural network at Pulau Pinang (RMSE = 20.026), while the linear regression model was the best at Sandakan in Sabah, East Malaysia (RMSE = 14.054). Other metrics, such as MAE and R-square, were also at their best values, each providing its best values, further substantiating the RMSE respectively, at each of the study areas. These empirical statistics (or metrics) also revealed that despite employing sea level as the sole parameter, results obtained were exceptional better when utilizing a 7-day lag, regardless of the model used. Notably, lag variables with less than a 7-day lag could degrade the model's accuracy in representing ground reality. The study emphasizes the importance of thorough training and testing of ML to aid decision-makers in developing mitigation actions for the climate change phenomena of sea level rise through reliable ML. © 2023 Elsevier Ltd 24058440 English Article All Open Access; Gold Open Access; Green Open Access |
author |
2-s2.0-85168853997 |
spellingShingle |
2-s2.0-85168853997 Predicting sea levels using ML algorithms in selected locations along coastal Malaysia |
author_facet |
2-s2.0-85168853997 |
author_sort |
2-s2.0-85168853997 |
title |
Predicting sea levels using ML algorithms in selected locations along coastal Malaysia |
title_short |
Predicting sea levels using ML algorithms in selected locations along coastal Malaysia |
title_full |
Predicting sea levels using ML algorithms in selected locations along coastal Malaysia |
title_fullStr |
Predicting sea levels using ML algorithms in selected locations along coastal Malaysia |
title_full_unstemmed |
Predicting sea levels using ML algorithms in selected locations along coastal Malaysia |
title_sort |
Predicting sea levels using ML algorithms in selected locations along coastal Malaysia |
publishDate |
2023 |
container_title |
Heliyon |
container_volume |
9 |
container_issue |
9 |
doi_str_mv |
10.1016/j.heliyon.2023.e19426 |
url |
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85168853997&doi=10.1016%2fj.heliyon.2023.e19426&partnerID=40&md5=8125708b70c895e264e6c132019e0772 |
description |
In consideration of the distinct behavior of machine learning (ML) algorithms, six well-defined ML used were carried out in this study for predicting sea level on a day-to-day basis. Data compiled from 1985 to 2018 was utilized for training and testing the developed models. An assessment of the multiple statistics-driven regression algorithms resulted such that each tested location was associated with a particular preferred model. The following were the developed best models for their respective study areas: In Peninsular Malaysia, the interactions linear regression model was the best at Pulau Langkawi (RMSE = 19.066), the Matern 5/2 gaussian process regression model at Geting (RMSE = 49.891), and the trilayered artificial neural network at Pulau Pinang (RMSE = 20.026), while the linear regression model was the best at Sandakan in Sabah, East Malaysia (RMSE = 14.054). Other metrics, such as MAE and R-square, were also at their best values, each providing its best values, further substantiating the RMSE respectively, at each of the study areas. These empirical statistics (or metrics) also revealed that despite employing sea level as the sole parameter, results obtained were exceptional better when utilizing a 7-day lag, regardless of the model used. Notably, lag variables with less than a 7-day lag could degrade the model's accuracy in representing ground reality. The study emphasizes the importance of thorough training and testing of ML to aid decision-makers in developing mitigation actions for the climate change phenomena of sea level rise through reliable ML. © 2023 |
publisher |
Elsevier Ltd |
issn |
24058440 |
language |
English |
format |
Article |
accesstype |
All Open Access; Gold Open Access; Green Open Access |
record_format |
scopus |
collection |
Scopus |
_version_ |
1828987864996642816 |