Predicting sea levels using ML algorithms in selected locations along coastal Malaysia

In consideration of the distinct behavior of machine learning (ML) algorithms, six well-defined ML used were carried out in this study for predicting sea level on a day-to-day basis. Data compiled from 1985 to 2018 was utilized for training and testing the developed models. An assessment of the mult...

وصف كامل

التفاصيل البيبلوغرافية
الحاوية / القاعدة:Heliyon
المؤلف الرئيسي: 2-s2.0-85168853997
التنسيق: مقال
اللغة:English
منشور في: Elsevier Ltd 2023
الوصول للمادة أونلاين:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85168853997&doi=10.1016%2fj.heliyon.2023.e19426&partnerID=40&md5=8125708b70c895e264e6c132019e0772
id Hazrin N.A.; Chong K.L.; Huang Y.F.; Ahmed A.N.; Ng J.L.; Koo C.H.; Tan K.W.; Sherif M.; El-shafie A.
spelling Hazrin N.A.; Chong K.L.; Huang Y.F.; Ahmed A.N.; Ng J.L.; Koo C.H.; Tan K.W.; Sherif M.; El-shafie A.
2-s2.0-85168853997
Predicting sea levels using ML algorithms in selected locations along coastal Malaysia
2023
Heliyon
9
9
10.1016/j.heliyon.2023.e19426
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85168853997&doi=10.1016%2fj.heliyon.2023.e19426&partnerID=40&md5=8125708b70c895e264e6c132019e0772
In consideration of the distinct behavior of machine learning (ML) algorithms, six well-defined ML used were carried out in this study for predicting sea level on a day-to-day basis. Data compiled from 1985 to 2018 was utilized for training and testing the developed models. An assessment of the multiple statistics-driven regression algorithms resulted such that each tested location was associated with a particular preferred model. The following were the developed best models for their respective study areas: In Peninsular Malaysia, the interactions linear regression model was the best at Pulau Langkawi (RMSE = 19.066), the Matern 5/2 gaussian process regression model at Geting (RMSE = 49.891), and the trilayered artificial neural network at Pulau Pinang (RMSE = 20.026), while the linear regression model was the best at Sandakan in Sabah, East Malaysia (RMSE = 14.054). Other metrics, such as MAE and R-square, were also at their best values, each providing its best values, further substantiating the RMSE respectively, at each of the study areas. These empirical statistics (or metrics) also revealed that despite employing sea level as the sole parameter, results obtained were exceptional better when utilizing a 7-day lag, regardless of the model used. Notably, lag variables with less than a 7-day lag could degrade the model's accuracy in representing ground reality. The study emphasizes the importance of thorough training and testing of ML to aid decision-makers in developing mitigation actions for the climate change phenomena of sea level rise through reliable ML. © 2023
Elsevier Ltd
24058440
English
Article
All Open Access; Gold Open Access; Green Open Access
author 2-s2.0-85168853997
spellingShingle 2-s2.0-85168853997
Predicting sea levels using ML algorithms in selected locations along coastal Malaysia
author_facet 2-s2.0-85168853997
author_sort 2-s2.0-85168853997
title Predicting sea levels using ML algorithms in selected locations along coastal Malaysia
title_short Predicting sea levels using ML algorithms in selected locations along coastal Malaysia
title_full Predicting sea levels using ML algorithms in selected locations along coastal Malaysia
title_fullStr Predicting sea levels using ML algorithms in selected locations along coastal Malaysia
title_full_unstemmed Predicting sea levels using ML algorithms in selected locations along coastal Malaysia
title_sort Predicting sea levels using ML algorithms in selected locations along coastal Malaysia
publishDate 2023
container_title Heliyon
container_volume 9
container_issue 9
doi_str_mv 10.1016/j.heliyon.2023.e19426
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85168853997&doi=10.1016%2fj.heliyon.2023.e19426&partnerID=40&md5=8125708b70c895e264e6c132019e0772
description In consideration of the distinct behavior of machine learning (ML) algorithms, six well-defined ML used were carried out in this study for predicting sea level on a day-to-day basis. Data compiled from 1985 to 2018 was utilized for training and testing the developed models. An assessment of the multiple statistics-driven regression algorithms resulted such that each tested location was associated with a particular preferred model. The following were the developed best models for their respective study areas: In Peninsular Malaysia, the interactions linear regression model was the best at Pulau Langkawi (RMSE = 19.066), the Matern 5/2 gaussian process regression model at Geting (RMSE = 49.891), and the trilayered artificial neural network at Pulau Pinang (RMSE = 20.026), while the linear regression model was the best at Sandakan in Sabah, East Malaysia (RMSE = 14.054). Other metrics, such as MAE and R-square, were also at their best values, each providing its best values, further substantiating the RMSE respectively, at each of the study areas. These empirical statistics (or metrics) also revealed that despite employing sea level as the sole parameter, results obtained were exceptional better when utilizing a 7-day lag, regardless of the model used. Notably, lag variables with less than a 7-day lag could degrade the model's accuracy in representing ground reality. The study emphasizes the importance of thorough training and testing of ML to aid decision-makers in developing mitigation actions for the climate change phenomena of sea level rise through reliable ML. © 2023
publisher Elsevier Ltd
issn 24058440
language English
format Article
accesstype All Open Access; Gold Open Access; Green Open Access
record_format scopus
collection Scopus
_version_ 1828987864996642816