Learning of Concept Drift and Multi Class Imbalanced Dataset using Resampling Ensemble Methods
K. Vasantha Kokilam1, D. Ponmary Pushpa Latha2, D. Joseph Pushpa Raj3
1Mrs. K Vasantha Kokilam, Department of Information Technology at Karunya Institute of Technology and Sciences, Coimbatore (Tamil Nadu), India.
2Dr. D. Ponmary Pushpa Latha, M.C.A., MPhil., M.E., Ph. D Associate Professor, Department of Information Technology, Karunya Institute of Technology and Sciences, Coimbatore (Tamil Nadu) India.
3D. Joseph Pushpa Raj, M.E. Degree in Computer Science and Engineering and Currently, he is Working in Francis Xavier Engineering, College, Coimbatore (Tamil Nadu) India.
Manuscript received on 20 April 2019 | Revised Manuscript received on 24 May 2019 | Manuscript published on 30 May 2019 | PP: 1332-1340 | Volume-8 Issue-1, May 2019 | Retrieval Number: A3267058119/19©BEIESP
Open Access | Ethics and Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: In modern days, very often usage of mobile phones paves way for advanced technologies which includes Internet-of-Things (IoT), wearable technology and big data. As the technology grows, huge volume of data with its complexities also increases rapidly. Flooding of data leads to combat in terms of online class imbalance problem and concept drift. Class imbalance problem is one of the issues in which number of class labels is not balanced and also majority classes are given more importance than the minority class. This type of situations leads to none accurate classification of data. Spam filtering, Fault detection in Engineering industry, Disease diagnosis are few applications where multiclass imbalance with concept drift makes prediction challenging. In this paper, a novel approach of Concept Drift Detector and Resampling Ensemble (CDRE) algorithm was proposed to overcome the problem of concept drift in multi-class. Misclassification occurs sometimes due to imbalance ratio and data distribution. Detailed analysis was done based on different levels of imbalance ratio and data distribution. There is decline in accuracy when multi-class problem suffers from concept drift also. When compared to normal multi-class imbalance problem, class imbalance problem with concept drift is analyzed. Concept Drift Detector and Resampling Ensemble (CDRE) algorithm was implemented to deal multi-class problem with concept drift. CDRE algorithm shows better results in recall, precision, F-measure on an average 85% when compared with algorithm without optimization.
Keywords: Concept Drift, Imbalance Ratio, Multi-Class, Data Distribution, Bagging.
Scope of the Article: E-Learning