Performance Analysis of Various Data mining Classification Algorithms on Diabetes Heart dataset
Keywords:
Data mining, classification algorithms, Diabetes, Heart problem, WEKA ToolAbstract
This Data mining techniques are applied in building software for fast and easy classification models. Early identification has high-risk modules also likely to have a high number of faults. Classification tree models are simple and effective as software quality prediction models while predictions of defects from such models can be used to achieve high software reliability. In this paper, the performance of some data mining classifier algorithms named J48, Random Forest, Random Tree, REP and Naïve Bayesian classifier (NBC) are evaluated based on 10 fold cross validation test. Diabetes is the most rapidly growing chronic disease of our time. People with diabetes are more likely to cause of new blindness, kidney disease, amputation and cardiovascular disease (heart disease and stoke). In this paper we take diabetes and heart datasets relate with their matching fields then apply the classification algorithm in diabetes heart dataset in WEKA (software tool) finding weather people affected by diabetes are getting chance to get heart disease or not, output are evaluated as Tested Negative (No Diabetes), Tested Normal(Not affected), Tested High(affected).
References
. Jiawei Han and Micheline Kamber, “Data Mining Concepts and Techniques”, second edition, Morgan Kaufmann Publishers an imprint of Elsevier.
. V.Karthikeyani,”Comparative of Data Mining Classification Algorithm (CDMCA) in Diabetes Disease Prediction” International Journal of
Computer Applications (0975 – 8887) Volume 60–No.12, December 2012.
. M. Anbarasi “Enhanced Prediction of Heart Disease with Feature Subset Selection using Genetic Algorithm” International Journal of Engineering Science and Technology Vol. 2(10), 2010, 5370-5376.
. Jyoti Soni, Ujma Ansari, Dipesh Sharma, Sunita Soni “Predictive Data Mining for Medical Diagnosis: An Overview of Heart Disease Prediction International” Journal of Computer Applications (0975 – 8887) Volume 17– No.8, March 2011.
. Chau, M., Shin,D., “A Comparative study of Medical Data classification Methods Based on Decision Tree and Bagging algorithms”,Proceedings of IEEE International Conference on Dependable,Autonomic and Secure Computing "2009, pp.183-187.
. Gaganjot Kaur “Improved J48 Classification Algorithm for the Prediction of Diabetes” International Journal of Computer Applications (0975– 8887) Volume 98 – No.22, July 2014
. Random Forest by Leo Breiman and Adele Cutler: http://www.stat.berkeley.edu/~breiman/RandomForests/cc_home.htm
. J.R. Quinlan, “Simplifying decision trees”, Internal Journal of Human Computer Studies,Vol.51, pp. 497- 491, 1999.
. N. Laves son and P. Davidson, “Multi-dimensional measures function for classifier performance”, 2nd. IEEE International conference on intelligent system, pp.508-513, 2004.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2016 COMPUSOFT: An International Journal of Advanced Computer Technology
This work is licensed under a Creative Commons Attribution 4.0 International License.
©2023. COMPUSOFT: AN INTERNATIONAL OF ADVANCED COMPUTER TECHNOLOGY by COMPUSOFT PUBLICATION is licensed under a Creative Commons Attribution 4.0 International License. Based on a work at COMPUSOFT: AN INTERNATIONAL OF ADVANCED COMPUTER TECHNOLOGY. Permissions beyond the scope of this license may be available at Creative Commons Attribution 4.0 International Public License.