##plugins.themes.bootstrap3.article.main##

Heart disease has become one of the alarming issues of death. It is accountable for fatty plaques in the arteries. If this fatal condition can be identified early, we can preserve many people’s arteries. Different types of supervised machine learning algorithms are applied in our research paper in order to predict heart disease existence in patient body. Besides this, we have focused on an efficient way to improve the performance of our applied classifiers. Imputing mean value technique is applied to handle null values present in our dataset. The features which are unnecessary are removed by using the info-gain feature selection technique. In order to calculate prediction accuracy, K-Nearest Neighbors (KNN), Naive Bayes and Random Forest are applied to the heart disease dataset. Accuracy, precision, recall, F1-score, and ROC are calculated which help us to compare the performance of the classification models. Handling null values on a particular column by imputing mean values of that column and our applied info-gain feature selection technique has aided us in improving the accuracy of our prediction models. Random Forest among all has given the best classification accuracy which is 95.63% with precision, recall, F1-score and ROC are 0.93, 0.92, 0.92 and 0.9, respectively.

References

  1. Yilmaz R, Yagin FH. Early detection of coronary heart disease based on machine learning methods. International Medical Journal. 2022 Jan 1; 4(1): 1–6. doi: 10.37990/medr.1011924.
     Google Scholar
  2. Pal M, Parija S. Prediction of heart diseases using random forest. Journal of Physics: Conference Series. 2021 Mar 15; 1817(1): 1–9. doi: 10.1088/1742-6596/1817/1/012009.
     Google Scholar
  3. Boukhatem C, Youssef HY, Nassif AB. Heart disease prediction using machine learning. IEEE Advances in Science and Engineering Technology International Conferences (ASET). 2022 Feb 21–24, Dubai, United Arab Emirates.
     Google Scholar
  4. Riyaz L, Butt MA, Zaman M, Ayob O. Heart disease prediction using machine learning techniques: a quantitative review. International Conference on Innovative Computing and Communications, pp. 81–94, vol. 1394, Singapore: Springer; 2022.
     Google Scholar
  5. Rahman MM, Rana MR, Alam MNA, Khan MSI, Uddin KMM. A web-based heart disease prediction system using machine learning algorithms. Network Biology. 2022 Jun 1; 12(2): 64–80.
     Google Scholar
  6. Riyaz L, Butt MA, Zaman M. Improving coronary heart disease prediction by outlier elimination. Applied Computer Science. 2022 Mar 28; 18(1): 70–88. doi: 10.35784/acs-2022-6.
     Google Scholar
  7. Jindal H, Agrawal S, Khera R, Jain R, Nagrath P. Heart disease prediction using machine learning algorithms. IOP conference series: materials science and engineering. 2021 Jan 18; 1022(1): 1–11. doi: 10.1088/1757-899X/1022/1/012072.
     Google Scholar
  8. Rajdhan A, Sai M, Agarwal A, Ravi D, Ghuli DP. Heart disease prediction using machine learning. International Journal of Research and Technology. 2020; 9(4): 659–662.
     Google Scholar
  9. Shah D, Patel S, Bharti SK. Heart disease prediction using machine learning Techniques. SN Computer Science. 2020 Oct 16; 1(6): 1–6. doi: https://doi.org/10.1007/s42979-020-00365-y.
     Google Scholar
  10. Sing A, Kumar R. Heart disease prediction using machine learning algorithms. IEEE international conference on electrical and electronics engineering (ICE3). 2020 Feb 14–15, pp. 452–457, Gorakhpur, India.
     Google Scholar
  11. Hasan SMM, Mamun MA, Uddin MP, Hossain MA. Comparative analysis of classification approaches for heart disease prediction. IEEE International Conference on Computer, Communication, Chemical, Material and Electronic Engineering (IC4ME2). 2018 Feb 8–9, Rajshahi, Bangladesh.
     Google Scholar
  12. Nahiduzzaman M, Nayeem MJ, Ahmed MT, Zaman MSU. Prediction of heart disease using multi-layer perceptron neural network and support vector machine. IEEE 4th International conference on electrical information and communication technology (EICT). 2019 Dec 20-22, Khulna, Bangladesh.
     Google Scholar
  13. Nayeem MJ, Rana S, Alam F, Rahman MA. Prediction of hepatitis disease using k-nearest neighbors, naive bayes, support vector machine, multi-layer perceptron and random forest. IEEE International Conference on Information and Communication Technology for Sustainable Development (ICICT4SD). 2021 Feb 27–28, pp. 280–284, Dhaka, Bangladesh.
     Google Scholar
  14. Kaggle.com. Kaggle Cardiovascular Disease Dataset. [Internet]. 2019 [updated 2019 Jan 20; cited 2022 Sep 01]; Available from: https://www.kaggle.com/datasets/sulianova/cardiovascular-disease-dataset.
     Google Scholar