Monitoring High Throughput Distributed System using Statistical Data Analysis
Divya Jain1, Swarnalatha P.2

1Divya Jain,“School of “Computer Science and” Engineering”“Vellore Institute of Technology”“Vellore, India.
2” Prof. Swarnalatha P.,“School of “Computer Science and” Engineering”“Vellore Institute of Technology”“Vellore, India”.
Manuscript received on March 15, 2020. | Revised Manuscript received on March 24, 2020. | Manuscript published on March 30, 2020. | PP: 4590-4590 | Volume-8 Issue-6, March 2020. | Retrieval Number: F9810038620/2020©BEIESP | DOI: 10.35940/ijrte.F9810.038620

Open Access | Ethics and Policies | Cite | Mendeley
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Abstract: “Monitoring high throughput distributed system by using a statistical analysis of the “historical time series” of an Instrumentation Data”. “The Pipeline has been made to process the information which can be otherwise called data pipeline, is a lot of information handling components associated in arrangement, where yield of one component is the contribution of the next one”. Several codes are giving different visualization for statistical analysis of data. “Network and Cloud Data Centers” generate a lot of data every second; this data can be gathered as period arrangement information. A time-series is a grouping taken at progressive similarly dispersed focuses in time that implies at a particular time interval to a particular time, the estimations of explicit information that was taken is known as information of a time-series. “This time-series information can be gathered utilizing framework measurements like CPU, Memory, and Disk utilization”. The TICK and ELK Stack is abbreviation for a foundation of open source instruments worked “to make collection, storage, graphing, and alerting” on time arrangement data incredibly easy. As an information collector, using Telegraf, “for storing and analyzing” information and the time-series database Influx DB and Elasticsearch. For plotting and visualizing used Grafana and Kibana. Watchman is utilized for alert refinement and once system metrics usage exceeds the specified threshold, the alert is generated and sends it to the Telegram.
Keywords: ELK, TICK, Watchman, Monitoring, Grafana.
Scope of the Article: High Performance Computing.