Implementation of Support Vector Machine and Random Forest for Heart Failure Disease Classification

Authors

  • Astriana Rahmah Universitas Islam Negeri Sultan Syarif Kasim Riau
  • Nurhafiza Sepriyanti Universitas Islam Negeri Sultan Syarif Kasim Riau
  • Muhammad Hafis Zikri Universitas Islam Negeri Sultan Syarif Kasim Riau
  • Isnani Ambarani Universitas Islam Negeri Sultan Syarif Kasim Riau
  • Muhammad Yusuf bin Shahar Universitas Sains Islam Malaysia

DOI:

https://doi.org/10.57152/predatecs.v1i1.816

Keywords:

Classification, Comparison, Heart Failure, Random Forest, Support Vector Machine

Abstract

Heart failure is a life-threatening disease and its management should be considered a global public health priority. The use of data mining in data processing operations to identify existing patterns and identify the information stored in them. In this study, researchers classify using two algorithms for comparison of algorithms, namely Random Forest (RF) and Support Vector Machine (SVM). The purpose of this study is to find patterns in finding the best accuracy for the 2 algorithms. The results of this study obtained an accuracy of 81.51%. with a Hold Out of 60 : 40% on the SVM algorithm, while an accuracy of 83.33 % with a Hold Out of 9 0 : 1 0% on the R F algorithm . From these results it can be seen that the highest accuracy value is obtained at RF making the best algorithm compared to the SVM algorithm.

References

AK Faieq and MM Mijwil, "Prediction of heart diseases using support vector machines and artificial neural networks," Indones. J.Electr. Eng. Comput. Sc., vol. 26, no. 1, pp. 374–380, 2022.

P. Ponikowski et al., “Heart failure: preventing disease and death worldwide,” ESC Hear. Files, vol. 1, no. 1, pp. 4–25, 2014.

A. Groenewegen, FH Rutten, A. Mosterd, and AW Hoes, “Epidemiology of heart failure,” Eur. J. Heart Fail., vol. 22, no. 8, pp. 1342–1356, 2020.

SP Shaji, "Prediction and diagnosis of heart disease patients using data mining technique," in 2019 international conference on communication and signal processing (ICCSP), 2019, pp. 848–852.

A Comparative Study on Heart Disease Prediction Using Data Mining Techniques and Feature Selection

Efficient Prediction of Stroke Patients Using Random Forest Algorithm in Comparison to Support Vector Machine

VF Rodriguez-Galiano, B. Ghimire, J. Rogan, M. Chica-Olmo, and JP Rigol-Sanchez, “An assessment of the effectiveness of a random forest classifier for land-cover classification,” ISPRS J. Photogramm . Remote Sens., vol. 67, pp. 93–104, 2012.

OL Mangasarian, “Data mining via support vector machines,” in IFIP Conference on system modeling and optimization, 2001, pp. 91–112.

S. Karamizadeh, SM Abdullah, M. Halimi, J. Shayan, and M. javad Rajabi, “Advantage and drawbacks of support vector machine functionality,” in 2014 international conference on computer, communications, and control technology (I4CT) , 2014, pp. 63–65.

A. Shmilovici, “Support vector machines,” in Data mining and knowledge discovery handbook, Springer, 2009, pp. 231–247.

J. Du, Y. Liu, Y. Yu, and W. Yan, “A prediction of precipitation data based on support vector machine and particle swarm optimization (PSO-SVM) algorithms,” Algorithms, vol. 10, no. 2, p. 57, 2017.

N. Horning, “Random Forests: An algorithm for image classification and generation of continuous field data sets,” in Proceedings of the International Conference on Geoinformatics for Spatial Infrastructure Development in Earth and Allied Sciences, Osaka, Japan, 2010, vol . 911, pp. 1–6.

SA Dick, R. Zaman, and S. Epelman, “Using high-dimensional approaches to probe monocytes and macrophages in cardiovascular disease,” Front. Immunol., vol. 10, p. 2146, 2019.

AP Lumi, VFF Joseph, and NCI Polii, "Cardiac Rehabilitation in Patients with Chronic Heart Failure," J. Biomedicine JBM, vol. 13, no. 3, pp. 309–316, 2021.

Performance Comparison of the K-Means Method for Classification in Diabetes Patients Using Two Normalization Methods

P. Purwono, A. Wirasto, and K. Nisa, "Comparison of Machine Learning Algorithms for Classification of Drug Groups," SISFOTENIKA, vol. 11, no. 2, pp. 196–207, 2021.

M. Mayasari, D. Iskandar Mulyana, M. Betty Yel, and S. Higher Computer Science Cipta Karya Informatika Jl Raden, "Comparison of Classification of Rhizome Plant Types Using Principal Component Analysis, Support Vector Machine, K-Nearest Neighbor and Decision Tree,” J.Tek. inform. Kaputama, vol. 6, no. 2, 2022.

Garcia-Carretero, Rafael, et all. 2020. Use of a K-Nearest Neighbors Model to Predict The Development of Type 2 Diabetes Within 2 Years in An Obese, Hypertensive Population. International Federation for Medical and Biological Engineering.

D. A. Kristiyanti, “Analisis Sentimen Review Produk Kosmetik menggunakan Algoritma Support Vector Machine dan Particle Swarm Optimization sebagai Metode Seleksi Fitur,” SNIT 2015, vol. 1, no. 1, pp. 134–141, 2015.

L. Breiman, ''Random forests,'' Mach. Learn., vol. 45, no. 1, pp. 5–32, Oct. 2001, doi: 10.1023/A:1010933404324.

J. Peters, NEC Verhoest, R. Samson, P. Boeckx, and B. De Baets, ''Wetland vegetation distribution modeling for the identification of constraining environmental variables,'' Landscape Ecol., vol. 23, no. 9, pp. 1049–1065, Sp. 2008, doi: 10.1007/s10980-008-9261-4.

K. Schouten, F. Frasincar, and R. Dekker, “An information gain-driven feature study for aspect-based sentiment analysis,” in International Conference on Applications of Natural Language to Information Systems, 2016, pp. 48–59.

B. An and Y. Suh, "Identifying financial statement fraud with decision rules obtained from Modified Random Forest," Data Technol. Appl., vol. 54, no. 2, pp. 235–255, 2020.

A. Javeed, S. Zhou, L. Yongjian, I. Qasim, A. Noor, and R. Nour, "An intelligent learning system based on random search algorithm and optimized random forest model for improved heart disease detection," IEEE access, vol. 7, pp. 180235–180243, 2019.

F. Akbar, HW Saputra, AK Maulaya, MF Hidayat, and R. Rahmaddeni, "Implementation of the C4 Decision Tree Algorithm. 5 and Support Vector Regression for Prediction of Stroke: Implementation of Decision Tree Algorithm C4. 5 and Support Vector Regression for Stroke Disease Prediction,” MALCOM Indonesia. J. Mach. learn. Comput. Sc., vol. 2, no. 2, pp. 61–67, 2022.

N. Sepriyanti, R. S. Nahampun, M. H. Zikri, I. Ambarani, and A. Rahmadeyan, “Penerapan K-Means Clustering Untuk Mengelompokkan Tingkat Kemiskinan di Provinsi Riau: Implementation of K-Means Clustering to Group Poverty Levels in Riau Province,” in SENTIMAS: Seminar Nasional Penelitian dan Pengabdian Masyarakat, 2022, vol. 1, no. 1, pp. 59–65.

Downloads

Published

2023-07-24