Explainable machine learning-based prediction of fuel consumption in ship main engines using operational data
Volume
76
Issue number
4
Article number
76405
Received
15 March 2025
Received in revised form
6 June 2025
Accepted
1 July 2025
Available online
1 August 2025
Authors
Anh Tuan Hoang1,2, Thi Anh Em Bui3, Xuan Phuong Nguyen4, Van Hung Bui5, Quang Chien Nguyen6,7, Thanh Hai Truong4, Nghia Chung8,*
1 Faculty of Engineering, Dong Nai Technology University, Bien Hoa City, Vietnam.
2 Graduate School of Energy and Environment, Korea University, 145 Anam-ro, Seongbuk-gu, Seoul, 02841, South Korea
3 Institute of Engineering, HUTECH University, Ho Chi Minh City, Viet Nam.
4 PATET Research Group, Ho Chi Minh City University of Transport, Ho Chi Minh City, Vietnam.
5 University of Technology and Education, The University of Danang, Danang, Vietnam.
6 Institute of Research and Development, Duy Tan University, Da Nang, Vietnam.
7 School of Engineering & Technology, Duy Tan University, Da Nang, Vietnam.
8 Institute of Maritime, Ho Chi Minh City University of Transport, Ho Chi Minh City, Vietnam.
Corresponding author email
Abstract
A significant percentage of fuel consumption and emissions from transportation activities is related to maritime transportation. Hence, accurate prediction models for fuel consumption are quite important. Machine learning offers a data-driven approach to improving fuel consumption prediction, thereby promoting environmental sustainability, lowering operational costs, and enhancing financial viability. This work explores several machine learning approaches by employing statistical measures, including mean squared error (MSE), coefficient of determination (R²), and Kling-Gupta efficiency (KGE), to develop main engine fuel consumption (MEFC) prediction models. Hyperparameter optimization via grid search was conducted to improve the generalizability and robustness of the models. With the lowest test MSE (0.69), a robust testing R² (0.9867), and a high KGE (0.9681), the Random Forests proved to be the most appropriate model for MEFC modeling among all others. Extreme Gradient Boosting followed closely with competitive accuracy, with MSE values of 0.75 and a robust testing R² (0.9856). Using Shapley additive explanations and Local interpretable model-agnostic explanations, this study improves model interpretability even more and indicates that main engine speed and wind speed were revealed to be the most important factors controlling MEFC. Explainable artificial intelligence techniques offer transparency in decision-making, thereby helping marine operators maximize fuel economy. Employing reliable and interpretable predictive modeling, this study offers insightful information for sustainable shipping, hence lowering operating costs and emissions.
Keywords
Ship fuel consumption, Prediction model, Local interpretable model-agnostic explanations, Shapley additive explanations, Explainable artificial intelligence