A Prediction of Pediatric Cardiomyopathy Disease Associated Genes using Machine Learning Algorithms
K. Jayanthi1, C. Mahesh2
1K. Jayanthi, Research Scholar, Department of Computer Science and Engineering, Vel Tech Rangarajan Dr. Sagunthala R&D Institute of Science and Technology, Avadi, Chennai (Tamil Nadu), India.
2C. Mahesh, Associate Professor, Department of Information Technology Vel Tech Rangarajan Dr. Sagunthala R&D Institute of Science and Technology, Avadi, Chennai (Tamil Nadu), India.
Manuscript received on 19 August 2019 | Revised Manuscript received on 10 September 2019 | Manuscript Published on 17 September 2019 | PP: 994-999 | Volume-8 Issue-2S8 August 2019 | Retrieval Number: B11900882S819/2019©BEIESP | DOI: 10.35940/ijrte.B1190.0882S819
Open Access | Editorial and Publishing Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: Pediatric cardiomyopathy is considered as one of the heart diseases, which causes by abnormal disorder of the heart muscle. If pediatric cardiomyopathy remains untreated and unidentified at the early stages, it leads to heart failure. The global number of deaths and disability attributed to cardiomyopathy has steadily increased. Hence, machine learning approaches can solves the problem of identifying the critical problem by determining the pediatric cardiomyopathy disease associated genes from the collection of differentially expressed genes that are recognized by biological process of genes. The main objective of this study is to design a machine learning model which can predict the likelihood of pediatric cardiomyopathy in genes specified biological features with maximum of accuracy. Identified high throughput machine learning algorithms like Logistic Regression, Naive Bayes, Random Forest, and Support Vector Machine were used in this experiment to determine the genes which can be derived from internal database repository having biological process of genes specified. Experiments are conducted on Gene Expression Omnibus (GEO) datasets which sourced from cardiogenomics.org and Biohunter tool. The performance of these machine learning algorithms is evaluated on various measures like Accuracy, Precision, Recall, F-Measure, and Receiver Operating Characteristics (ROC). From the obtained results shows that Random Forest provides high accuracy 84.4% when compared to other four machine learning algorithms.
Keywords: Machine Learning Algorithms Prediction Characteristics Model.
Scope of the Article: Machine Learning