Comparative Study of Classification Algorithms in Chronic Kidney Disease
Pratibha Devishri S1, Ragin O R2, Anisha G S3
1Pratibha Devishri S PG Student, Master of Computer Application, Amrita Vishwa Vidyapeetham, Amrita School of Arts and Sciences, Brahmasthanam, Edappally North P.O. Kochi – 682 024, Kerala.
2Ragin O R PG Student, Master of Computer Application, Amrita Vishwa Vidyapeetham, Amrita School of Arts and Sciences, Brahmasthanam, Edappally North P.O. Kochi – 682 024, Kerala.
3Anisha G S, Faculty Associate, Dept. of CS & IT, Amrita Vishwa Vidyapeetham, Amrita School of Arts and Sciences, Edappally North P.O. Kochi-682024, Kerala. Qualification – MCA, M. Phil (CS). Area of Interest – Networks and Data Mining.
Manuscript received on 01 April 2019 | Revised Manuscript received on 05 May 2019 | Manuscript published on 30 May 2019 | PP: 180-184 | Volume-8 Issue-1, May 2019 | Retrieval Number: A3000058119/19©BEIESP
Open Access | Ethics and Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: Chronic Kidney Disease is a very dangerous health problem that has been spreading globally due to alterations in lifestyle such as food habits, changes in the atmosphere, etc. So it is essential to decide on any remedy to avoid and to predict the disease in early stage which helps to avoid wastage of life. We show that feature selection approach is well suited for chronic kidney disease prediction. Principal Component Analysis is one of the feature selection techniques that filters out less important attributes; it also picks attributes of importance from the dataset. We also compare different data classification approaches in terms of how accurately they predict chronic kidney disease. We examine Decision stump, Rep tree, IBK, K-star, SGD and SMO classifiers using performance measures like Kappa statistics, Receiver Operating Characteristic, Mean Absolute Error and Root mean squared Error using WEKA. Accuracy measures used to compare classifiers are Recall, F-measure and Precision by implementing on WEKA. WEKA-a software for data mining, that uses collection of algorithm for data mining. It is possible to apply these algorithms directly to the data or call them from java code. Results obtained show better accuracy measures for Decision stump and Rep tree where the mean absolute error were less with error rate of 0.010 and 0.012 respectively.
Index Terms: Chronic Kidney Disease, Principal Component Analysis, Decision Stump, Rep tree, IBK, K-Star, SGD, SMO Recall, F-measure, Precision
Scope of the Article: Classification