Comparison of Supervised Learning Algorithms for Predicting Airline Passenger Satisfaction

Authors

  • Agil Irman Fadri Universitas Islam Negeri Sultan Syarif Kasim Riau, Indonesia
  • Abid Zahfran Universitas Islam Negeri Sultan Syarif Kasim Riau, Indonesia
  • Taylan Irak Sabanci University, Turkey
  • Naufal Helga Firjatullah Asia Pacific University, Malaysia
  • Jelita Ekaraya Herianto Taylors University, Malaysia

DOI:

https://doi.org/10.57152/ijatis.v2i1.1868

Keywords:

Decision Tree, K-Nearest Neighbors, Naïve Bayes, Random Forest, Support Vector Machine

Abstract

Service quality and airline passenger satisfaction are the main factors in business success in the modern aviation industry. This research compares the performance of supervised learning algorithms, namely K-Nearest Neighbor (K-NN), Naïve Bayes, Decision Tree, Random Forest, and Support Vector Machine (SVM), to predict passenger satisfaction. The k-fold cross-validation method with k=20 was applied to ensure comprehensive model evaluation by dividing the data proportionally. Using a high value of ???? was chosen to optimize the stability of the model estimates, reduce the risk of overfitting, and produce more accurate evaluation metrics. The research results show that the Random Forest algorithm provides the highest accuracy of 95.78%, followed by Decision Tree (93.82%) and K-NN (91.85%). These results indicate that the Random Forest algorithm better classifies passenger satisfaction than other algorithms. This research confirms the potential of machine learning algorithms as a practical solution in data analysis to support strategic decision-making, especially for airlines that want to improve customer experience.

References

N. Vojtek and B. Smudja, “Improving The Passenger Feedback Process In Airline Industry,” vol. 9, no. 2, pp. 255–269, 2019, [Online]. Available: doi: http://dx.doi.org/10.7708/ijtte.2019.9(2).10

S. Soklaridis, A. M. Geske, and S. Kummer, “Journal of the Air Transport Research Society Key characteristics of perceived customer centricity in the passenger airline industry?: A systematic literature review,” J. Air Transp. Res. Soc., vol. 3, no. July, p. 100031, 2024, doi: 10.1016/j.jatrs.2024.100031.

M. B. Gorza?czany and F. Rudzi, “applied sciences Business Intelligence in Airline Passenger Satisfaction Study — A Fuzzy-Genetic Approach with Optimized,” 2021, [Online]. Available: https://doi.org/10.3390/app11115098

B. H. Hayadi, J. Kim, K. Hulliyah, and H. T. Sukmana, “Predicting Airline Passenger Satisfaction with Classification Algorithms,” vol. 4, no. 1, pp. 82–94, 2021, [Online]. Available: https://doi.org/10.47738/ijiis.v4i1.80

M. H. Setiono, “Komprasi Algoritma Decision Tree, Random Forest, SVM dan K-NN Dalam Klasifikasi Kepuasan Penumpang Maskapai Penerbangan,” vol. 17, no. 1, pp. 32–39, 2022.

P. Sun et al., “IBRO Neuroscience Reports Bipolar disorder?: Construction and analysis of a joint diagnostic model using random forest and feedforward neural networks,” vol. 17, no. December 2023, pp. 145–153, 2024.

S. Kang, “k -Nearest Neighbor Learning with Graph Neural Networks,” 2021, [Online]. Available: https://doi.org/10.3390/math9080830

S. Lee, C. Lee, K. G. I. Mun, and D. Kim, “Decision Tree Algorithm Considering Distances Between Classes,” vol. 10, no. July, 2022, [Online]. Available: 10.1109/ACCESS.2022.3187172

D. S. Suprapto and R. S. Oetama, “Jurnal Informatika Ekonomi Bisnis Analysis of Airline Passenger Satisfaction Using Decision Tree and Naïve Bayes Algorithms,” vol. 5, pp. 1493–1500, 2023, doi: 10.37034/infeb.v5i4.728.

I. T. Akinola, Y. Sun, I. G. Adebayo, and Z. Wang, “Daily peak demand forecasting using Pelican Algorithm optimised Support Vector Machine ( POA-SVM ),” Energy Reports, vol. 12, no. October, pp. 4438–4448, 2024, doi: 10.1016/j.egyr.2024.10.017.

A. S. Nugroho, A. B. Witarto, and D. Handoko, “Support Vector Machine,” 2003, [Online]. Available: https://asnugroho.net/papers/ikcsvm.pdf

A. C. Y. Hong, K. W. Khaw, X. Y. Chew, and W. C. Yeong, “Prediction of US airline passenger satisfaction using machine learning algorithms,” vol. 4, no. 1, pp. 7–22, 2023, [Online]. Available: https://doi.org/10.15282/daam.v4i1.9071

A. Handayanto, K. Latifa, N. D. Saputro, and R. R. Waliyansyah, “Analisis dan Penerapan Algoritma Support Vector Machine ( SVM ) dalam Data Mining untuk Menunjang Strategi Promosi ( Analysis and Application of Algorithm Support Vector Machine ( SVM ) in Data Mining to Support Promotional Strategies ),” vol. 7, no. November, pp. 71–79, 2019.

P. F. Pratama, D. Rahmadani, and R. S. Nahampun, “Random Forest Optimization Using Particle Swarm Optimization for Diabetes Classification,” vol. 1, no. July, pp. 41–46, 2023.

B. P. Christie, D. M. Tat, and T. Irwin, “Comparation of K-Nearest Neighboor ( K-NN ) and Naive Bayes Algorithm for the Classification of the Poor in Recipients of Social Assistance Comparation of K-Nearest Neighboor ( K-NN ) and Naive Bayes Algorithm for the Classification of the Poor in Recipients of Social Assistance”, doi: 10.1088/1742-6596/1641/1/012077.

A. Rahmah, N. Sepriyanti, and M. H. Zikri, “Implementation of Support Vector Machine and Random Forest for Heart Failure Disease Classification,” vol. 1, no. July, pp. 34–40, 2023.

Z. C. Dwynne, “Comparison Of Machine Learning Algorithms On Sentiment Analysis Of Elsagate Content,” 2024 Int. Conf. Smart Comput. IoT Mach. Learn., pp. 239–243, 2024, doi: 10.1109/SIML61815.2024.10578186.

S. M. Ganie and M. B. Malik, “Healthcare Analytics An ensemble Machine Learning approach for predicting Type-II diabetes mellitus based on lifestyle indicators,” Healthc. Anal., vol. 2, no. August, p. 100092, 2022, doi: 10.1016/j.health.2022.100092.

M. Bansal, A. Goyal, and A. Choudhary, “A comparative analysis of K-Nearest Neighbor , Genetic , Support Vector Machine , Decision Tree , and Long Short Term Memory algorithms in machine learning,” Decis. Anal. J., vol. 3, no. May, p. 100071, 2022, doi: 10.1016/j.dajour.2022.100071.

Z. C. Dwinnie et al., “Application of the Supervised Learning Algorithm for Classification of Pregnancy Risk Levels,” vol. 1, no. July, pp. 26–33, 2023.

W. Cherif, “ScienceDirect ScienceDirect Optimization of K-NN algorithm by clustering and reliability coefficients?: application to breast-cancer diagnosis,” Procedia Comput. Sci., vol. 127, pp. 293–299, 2018, doi: 10.1016/j.procs.2018.01.125.

A. I. Putri, N. A. Husna, N. M. Cia, and M. A. Arba, “Implementation of K-Nearest Neighbors , Naïve Bayes Classifier , Support Vector Machine and Decision Tree Algorithms for Obesity Risk Prediction,” vol. 2, no. July, pp. 26–33, 2024.

G. Pappalardo, S. Cafiso, A. Di Graziano, and A. Severino, “Decision Tree Method to Analyze the Performance of Lane Support Systems,” 2021, [Online]. Available: https://doi.org/10.3390/su13020846

A. F. Lubis, H. Z. Haq, I. Lestari, and M. Iltizam, “Classification of Diabetes Mellitus Sufferers Eating Patterns Using K-Nearest Neighbors , Naïve Bayes and Decission Tree,” vol. 2, no. July, pp. 44–51, 2024.

D. R. Prehanto, A. D. Indriyanti, and K. D. Nuryana, “Preprocessing Using Correlation Based Features Selection on Naive Bayes Classification Preprocessing Using Correlation Based Features Selection on Naive Bayes Classification”, doi: 10.1088/1757-899X/982/1/012012.

M. R. Romadhon, “A Comparison of Naive Bayes Methods , Logistic Regression and K-NN for Predicting Healing of Covid-19 Patients in Indonesia,” pp. 41–44, 2021.

M. Sheykhmousa, M. Mahdianpari, H. Ghanbari, F. Mohammadimanesh, P. Ghamisi, and S. Member, “Support Vector Machine Versus Random Forest for Remote Sensing Image Classification?: A Meta-Analysis and Systematic Review,” vol. 13, pp. 6308–6325, 2020.

C. Avci, M. Budak, N. Yagmur, and F. B. Balcik, “Comparison between random forest and support vector machine algorithms for LULC classification,” vol. 8, no. 1, pp. 1–10, 2023, doi: 10.26833/ijeg.987605.

D. M. Abdullah and A. M. Abdulazeez, “Machine Learning Applications based on SVM Classification?: A Review,” pp. 81–90, doi: 10.48161/Issn.2709-8206.

E. Y. Boateng, J. Otoo, and D. A. Abaye, “Basic Tenets of Classification Algorithms K -Nearest-Neighbor , Support Vector Machine , Random Forest and Neural Network?: A Review,” pp. 341–357, 2020, doi: 10.4236/jdaip.2020.84020.

Downloads

Published

2025-03-03