Performance Analysis of Various Data mining Classification Algorithms on Diabetes Heart dataset

Authors

  • Gokilam GG Department of Computer Science and Engineering, PRIST University, Thanjavur, TamilNadu
  • Shanthi K Department of Computer Science and Engineering, PRIST University, Thanjavur, TamilNadu

Keywords:

Data mining, classification algorithms, Diabetes, Heart problem, WEKA Tool

Abstract

This Data mining techniques are applied in building software for fast and easy classification models. Early identification has high-risk modules also likely to have a high number of faults. Classification tree models are simple and effective as software quality prediction models while predictions of defects from such models can be used to achieve high software reliability. In this paper, the performance of some data mining classifier algorithms named J48, Random Forest, Random Tree, REP and Naïve Bayesian classifier (NBC) are evaluated based on 10 fold cross validation test. Diabetes is the most rapidly growing chronic disease of our time. People with diabetes are more likely to cause of new blindness, kidney disease, amputation and cardiovascular disease (heart disease and stoke). In this paper we take diabetes and heart datasets relate with their matching fields then apply the classification algorithm in diabetes heart dataset in WEKA (software tool) finding weather people affected by diabetes are getting chance to get heart disease or not, output are evaluated as Tested Negative (No Diabetes), Tested Normal(Not affected), Tested High(affected).

References

. Jiawei Han and Micheline Kamber, “Data Mining Concepts and Techniques”, second edition, Morgan Kaufmann Publishers an imprint of Elsevier.

. V.Karthikeyani,”Comparative of Data Mining Classification Algorithm (CDMCA) in Diabetes Disease Prediction” International Journal of

Computer Applications (0975 – 8887) Volume 60–No.12, December 2012.

. M. Anbarasi “Enhanced Prediction of Heart Disease with Feature Subset Selection using Genetic Algorithm” International Journal of Engineering Science and Technology Vol. 2(10), 2010, 5370-5376.

. http://www.niddk.nih.gov/health-information/healthtopics/Diabetes/diabetes-heart-diseasestroke/Pages/index.aspx#connection.

. Jyoti Soni, Ujma Ansari, Dipesh Sharma, Sunita Soni “Predictive Data Mining for Medical Diagnosis: An Overview of Heart Disease Prediction International” Journal of Computer Applications (0975 – 8887) Volume 17– No.8, March 2011.

. Chau, M., Shin,D., “A Comparative study of Medical Data classification Methods Based on Decision Tree and Bagging algorithms”,Proceedings of IEEE International Conference on Dependable,Autonomic and Secure Computing "2009, pp.183-187.

. Gaganjot Kaur “Improved J48 Classification Algorithm for the Prediction of Diabetes” International Journal of Computer Applications (0975– 8887) Volume 98 – No.22, July 2014

. Random Forest by Leo Breiman and Adele Cutler: http://www.stat.berkeley.edu/~breiman/RandomForests/cc_home.htm

. J.R. Quinlan, “Simplifying decision trees”, Internal Journal of Human Computer Studies,Vol.51, pp. 497- 491, 1999.

. N. Laves son and P. Davidson, “Multi-dimensional measures function for classifier performance”, 2nd. IEEE International conference on intelligent system, pp.508-513, 2004.

Downloads

Published

2024-02-26

How to Cite

Gokilam, G., & Shanthi, K. (2024). Performance Analysis of Various Data mining Classification Algorithms on Diabetes Heart dataset. COMPUSOFT: An International Journal of Advanced Computer Technology, 5(03), 2074–2079. Retrieved from https://ijact.in/index.php/j/article/view/363

Issue

Section

Original Research Article

Similar Articles

1 2 3 4 5 6 7 8 9 10 > >> 

You may also start an advanced similarity search for this article.