Statistical and Unsupervised MLs Analysis on Parkinson’s Disease Data Set Acquired from A.P. India
T. PanduRanga Vital1, P. Shiny2, S. E. Ashish3, T. Sai Kumar4
1T. PanduRanga Vital, Dept. of Computer Science and Engineering, Aditya Institute of Technology and Management (Autonomous), Tekkali-532 201, Andhra Pradesh, India
2P. Shiny, Dept. of Computer Science and Engineering, Aditya Institute of Technology and Management, Tekkali-532 201, Andhra Pradesh, India
3S. E. Ashish, Dept. of Computer Science and Engineering, Aditya Institute of Technology and Management, Tekkali-532 201, Andhra Pradesh, India
4T. Sai Kumar, Dept. of Computer Science and Engineering, Aditya Institute of Technology and Management, Tekkali-532 201, Andhra Pradesh, India

Manuscript received on 01 April 2019 | Revised Manuscript received on 07 May 2019 | Manuscript published on 30 May 2019 | PP: 372-380 | Volume-8 Issue-1, May 2019 | Retrieval Number: A3373058119/19©BEIESP
Open Access | Ethics and Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Abstract: In recent years, the voice analysis is the important work for identifying the neurological diseases like Parkinson’s disease (PD). PD is the subsequent general neurodegenerative disorder after Alzheimer’s lacking of dopamine in mid brain. In most people, symptoms appear at the age of 50 years or over. In this research, one thousand two hundred vowel-sounded (/.a/.e/.i/.o/.u) voice records are collected from A.P., India for analyzing people with PD from that of healthy people. The records constitute the data of 40 PD patients and 36 non-PD people who are having their age between 50 and 85. Those voice recordings are processed and relevant features or characters are extracted. Here, the data set contains features of both people with PD and healthy to distinguish performance. In this, we analyzed the PD dataset with statistical and unsupervised machine learning analysis. The efficient clustering k-means algorithm represents the Centroids of each attribute of the PD voice data set in two clusters (cluster 0, cluster 1). Another used unsupervised ML algorithm, hierarchal clustering clusters the data set in row wise (attribute wise) as well column wise (data wise) and analyze the projections of attributes and their rankings with using PCA (Principle component analysis). Parkinson’s disease (PD), Unsupervised Machine Learning, Voice, PCA
Index Terms: Parkinson’s Disease (PD), Unsupervised Machine Learning, Voice, PCA

Scope of the Article: Machine Learning