Data Pre-Processing on Web Server Access Logs of University for User Interaction Patterns
Chaitra H.K1, Suneetha K.R2

1Chaitra H K, Assistant Professor, Department of CSE, SJB Institute of Technology, Bangalore, India.
2Dr. Suneetha K R, Associate Professor, Department of CSE, Bangalore Institute of Technology, Bangalore, India.

Manuscript received on May 25, 2020. | Revised Manuscript received on June 29, 2020. | Manuscript published on July 30, 2020. | PP: 213-220 | Volume-9 Issue-2, July 2020. | Retrieval Number: B3385079220/2020©BEIESP | DOI: 10.35940/ijrte.B3385.079220
Open Access | Ethics and Policies | Cite | Mendeley
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Abstract: In the current digital Era, websites are developed and organized into multifaceted in nature. It is essential to distinguish user sessions/intent and browsing behavior from logs in order to recommend appropriate content for the web designers and administrators. This paper focuses on data preprocessing of the weblogs received from Kannada University Hampi, Vidyaranya Karnataka state are cleaned viably by applying various pre-processing methodologies. The work identifies the superior quality of data to discover user interactions, user sessions, the specific web pages, and the regularly visited Uri’s, most visited pages, most time spent on pages and incorrect webpages served to users. These pre-processed webserver access log files will be utilized to discover patterns, fine grained analysis and study. This paper also focuses on challenges of log file analysis. 
Keywords: Data cleaning, User Analysis, Log files, Data Preprocessing.