Model Hybrid PSO, Feature Selection Correlation dan Logistic Regression untuk Deteksi Penyakit Jantung

Authors

  • Muhammad Wahyu Hidayatullah Universitas Muhammadiyah Kalimantan Timur
  • Taghfirul Azhima Yoga Siswa Universitas Muhammadiyah Kalimantan Timur
  • Wawan Joko Pranoto Universitas Muhammadiyah Kalimantan Timur

DOI:

https://doi.org/10.23969/infomatek.v28i1.43123

Keywords:

Jantung, Correlation, Machine Learning, Logistic Regression, Particle Swarm Optimization

Abstract

Penyakit jantung merupakan salah satu penyebab utama kematian baik di Indonesia maupun secara global sehingga diperlukan model deteksi dini yang akurat. Penelitian ini bertujuan meningkatkan kinerja Logistic Regression dengan regularisasi L2 melalui optimasi Particle Swarm Optimization (PSO) dan feature selection berbasis correlation. Metode yang digunakan meliputi pre-processing, standarisasi, seleksi fitur, serta evaluasi menggunakan K-10 Fold Cross Validation. Hasil pengujian menunjukkan bahwa Logistic Regression menghasilkan accuracy 82,47%, precision 80,31%, recall 88,56%, dan F1-score 84,10%. Setelah dioptimasi dengan PSO, performa meningkat menjadi accuracy 84,45%, precision 81,74%, recall 91,01%, dan F1-score 85,98%. Hasil tersebut menegaskan bahwa pendekatan hybrid yang diusulkan efektif dalam meningkatkan deteksi penyakit jantung.

Downloads

Download data is not yet available.

References

Adhitya, R. R., Witanti, W., & Yuniarti, R. (2023). Perbandingan metode CART dan Naïve Bayes untuk klasifikasi customer churn. INFOTECH Journal, 9(2), 307–318. https://doi.org/10.31949/infotech.v9i2.5641

Agarwal, N., Deepakshi, Harikiran, J., Lakshmi, Y. B., Kumar, A. P., Muniyandy, E., & Verma, A. (2024). Predictive modelling for heart disease diagnosis: A comparative study of classifiers EAI Endorsed Transactions on Pervasive Health and Technology, 10,1–11. https://doi.org/10.4108/eetpht.10.5518

Permana, A. A. (2024). Implementasi Metode K-Nearest Neighbors (KNN) untuk Klasifikasi Penyakit Jantung. G-Tech: Jurnal Teknologi Terapan, 8(1), 186–195.

Aspiah, R., Azhima, T., & Siswa, Y. (2022). Implementasi correlation based feature selection (CFS) untuk peningkatan akurasi algoritma C4.5 dalam prediksi performa akademik mahasiswa berbasis learning management system. Jurnal Ilmiah Betrik, 13(3), 199–207.

Bietrosula, A. B., Werdiningsih, I., & Wuriyanto, E. (2024). Classification of cardiovascular disease based on lifestyle using random forest and logistic regression methods. International Journal of Electrical and Electronic Engineering, 12(2). https://doi.org/10.52549/ijeei.v12i2.5388

Black, J. E., Kueper, J. K., & Williamson, T. S. (2023). An introduction to machine learning for classification and prediction. Family Practice, 40(1), 200–204. https://doi.org/10.1093/fampra/cmac104

Wibowo, A. C., Lestari, S. A., & Nurchim, N. (2024). Analisis penggunaan machine learning dalam klasifikasi penentuan penyakit jantung. Simtek: Jurnal Sistem Informasi dan Teknik Komputer, 9(2), 97–101. https://doi.org/10.51876/simtek.v9i2.395

Damari, A., Azhima, T., Siswa, Y., & Pranoto, W. J. (2025). Penerapan metode PSO-SMOTE pada algoritma Naive Bayes untuk mengatasi class imbalance data bencana tanah longsor. Jurnal Inovtek Polbeng, 10(1), 332–343.

Dinova, D. B., & Prasetiyo, B. (2024). Implementasi random forest dalam klasifikasi kanker paru-paru. JOINTER: Journal of Informatics Engineering, 5(1), 27–31.

Elshewey, A. M., Abed, A. H., Khafaga, D. S., Alhussan, A. A., Eid, M. M., & El-Kenawy, E. S. M. (2025). Enhancing heart disease classification using optimization algorithm and LSTM. Scientific Reports, 15(1), 1–21. https://doi.org/10.1038/s41598-024-83592-0

Fan, C., Chen, M., Wang, X., Wang, J., & Huang, B. (2021). A review on data preprocessing techniques toward efficient knowledge discovery. Frontiers in Energy Research, 9, 1–17. https://doi.org/10.3389/fenrg.2021.652801

Firnanda, P. A., Litaysa, S., Rahma, F., & Fauzi, F. (2025). Analisis perbandingan decision tree dan random forest dalam klasifikasi penjualan produk pada supermarket. Emerging Statistics and Data Science Journal, 3(1), 445–461.

Harris, J. K. (2021). Primer on binary logistic regression. Family Medicine and Community Health, 9, 1–7. https://doi.org/10.1136/fmch-2021-001290

Hicks, S. A., Strümke, I., Thambawita, V., Hammou, M., Riegler, M. A., Halvorsen, P., & Parasa, S. (2022). On evaluation metrics for medical applications of artificial intelligence. Scientific Reports, 12(1), 1–9. https://doi.org/10.1038/s41598-022-09954-8

Hidayat, R., Saputra, Y., Sujana, T., Husnah, M., & Saputra, H. T. (2024). Implementasi machine learning untuk prediksi penyakit jantung menggunakan algoritma support vector machine. BIOS: Jurnal Teknologi Informasi dan Rekayasa Komputer, 5(2), 161–168.

Jain, M., Saihjpal, V., Singh, N., & Singh, S. B. (2022). An overview of variants and advancements of PSO algorithm. Applied Sciences, 12(17), 1–21. https://doi.org/10.3390/app12178392

Jibril, A. U., Haruna, K., & Jiangsheng, Z. (2023). Feature selection and parameter optimization using PSO. Journal of Computer Science and Technology, 11(1), 21–47. https://doi.org/10.15640/jcsit.v11n1a3

Junifer, J., Tanjaya, H., & Kenichi, K. (2021). Mendeteksi penyakit jantung menggunakan logistic regression. Journal Information System Development, 6(2), 1–10.

Markoulidakis, I., Rallis, I., Georgoulas, I., Kopsiaftis, G., Doulamis, A., & Doulamis, N. (2021). Multiclass confusion matrix reduction method. Technologies, 9(4). https://doi.org/10.3390/technologies9040081

Oise, G. P., Oyedotun, S. A., Nwabuokei, O. C., Babalola, A. E., & Unuigbokhai, N. B. (2025). Enhanced prediction of coronary artery disease using logistic regression. Fudma Journal of Sciences, 9(3), 201–208. https://doi.org/10.33003/fjs-2025-0903-3263

Oon, W. Y., Tuti, D., Yee, L. S., & Susanti. (2022). Penerapan data mining untuk klasifikasi kelulusan mahasiswa. SATIN - Sains dan Teknologi Informasi, 8(2), 122–131. https://doi.org/10.33372/stn.v8i2.885

World Health Organization. (2025). Cardiovascular diseases. https://www.who.int/news-room/fact-sheets/detail/cardiovascular-diseases

Ratnasari, R., Wahidin, J., Setiawan, A. E., & Bintoro, P. (2024). Machine learning untuk klasifikasi penyakit jantung. Aisyah Journal of Informatics and Electrical Engineering, 6(1), 145–150. https://doi.org/10.30604/jti.v6i1.272

Kementerian Kesehatan RI. (2024). Kenali gejala jantung sejak dini. https://kemkes.go.id/id/kenali-gejala-jantung-sejak-dini

Rojas, R., Seseña, H., Zuñiga, M., & Martínez, M. (2024). Performance comparison in optimization algorithms. Revista de Ciencias de la Salud, 1–8. https://doi.org/10.35429/johs.2024.11.30.1.8

Salsabila, C. A., Yulianto, F., & Siswa, T. A. Y. (2025). Implementasi metode Naive Bayes untuk klasifikasi kecelakaan lalu lintas. Jurnal Informatika dan Teknik Elektro Terapan, 13(1). https://doi.org/10.23960/jitet.v13i1.5890

Sani, N. S., Esa, M. I., & Musawi, B. A. (2023). Malware detection using deep learning and correlation-based feature selection. Symmetry, 15(3).

Saputra, D., Dharmawan, W. S., & Irmayani, W. (2022). Performance comparison of SVM and SVM-PSO algorithms. International Journal of Advances in Data and Information Systems, 3(2), 74–86. https://doi.org/10.25008/ijadis.v3i2.1243

Sathyanarayanan, S. (2024). Confusion matrix-based performance evaluation metrics. African Journal of Biomedical Research, 4023–4031. https://doi.org/10.53555/ajbr.v27i4s.4345

Sepharni, A., Hendrawan, I. E., & Rozikin, C. (2022). Klasifikasi penyakit jantung menggunakan algoritma C4.5. STRING, 7(2), 117. https://doi.org/10.30998/string.v7i2.12012

Setiani, H., Sunyoto, A., & Nasiri, A. (2022). Metode Naïve Bayes dan particle swarm optimization untuk klasifikasi penyakit jantung. Explore, 12(2), 6. https://doi.org/10.35200/explore.v12i2.566

Sonjaya, C., Masruriyah, A. F. N., Kusumaningrum, D. S., & Pratama, A. R. (2022). Performance comparison of classification algorithm. INTERNAL Journal, 5(2), 166–175. https://doi.org/10.32627/internal.v5i2.595

Sung, W. T., Isa, I. G. T., & Hsiao, S. J. (2023). Designing aquaculture monitoring system based on data fusion. Electronics, 12(9). https://doi.org/10.3390/electronics12092032

Taher, H. A., & Abdulazeez, A. M. (2023). Machine learning approaches for heart disease detection. International Journal of Research and Applied Technology, 3(2), 267–282.

Yi, J., Yu, P., Huang, T., & Xu, Z. (2024). Optimization of transformer heart disease prediction model. ICFTIC Proceedings, 1109–1113. https://doi.org/10.1109/ICFTIC64248.2024.10913096

Zabor, E. C., Reddy, C. A., Tendulkar, R. D., & Patil, S. (2022). Logistic regression in clinical studies. International Journal of Radiation Oncology Biology Physics, 112(2), 271–277. https://doi.org/10.1016/j.ijrobp.2021.08.007

Zhang, L., Geisler, T., Ray, H., & Xie, Y. (2022). Improving logistic regression on imbalanced data. Journal of Applied Statistics, 49(13), 3257–3277. https://doi.org/10.1080/02664763.2021.1939662

Zhu, Y., & Liu, R. (2021). An algorithmic view of L2 regularization. Journal of Machine Learning Research, 22, 1–62.

Downloads

Published

2026-04-23

How to Cite

Hidayatullah, M. W., Siswa, T. A. Y., & Pranoto, W. J. (2026). Model Hybrid PSO, Feature Selection Correlation dan Logistic Regression untuk Deteksi Penyakit Jantung. Infomatek, 28(1), 163–175. https://doi.org/10.23969/infomatek.v28i1.43123