Exploratory Data Analysis on Diamond Dataset using R
K. Durga Charan1, K. Sandhya Rani2, Somula Ramasubbareddy3, K. Govinda4, E. Swetha5
1K. Durga Charan, Dhanekula Institute of Engineering, Technology, Vijayawada (Andhra Pradesh), India.
2K. Sandhya Rani, Dhanekula Institute of Engineering & Technology, Vijayawada (Andhra Pradesh), India.
3Somula Ramasubbareddy, VNRVJIET, Hyderabad (Telangana), India.
4K. Govinda, VIT University, Vellore (Tamil Nadu), India.
5E. Swetha, SV College of Engineering, Tirupati (Andhra Pradesh), India.
Manuscript received on 27 March 2019 | Revised Manuscript received on 06 April 2019 | Manuscript Published on 27 April 2019 | PP: 698-701 | Volume-7 Issue-6S2 April 2019 | Retrieval Number: F11170476S219/2019©BEIESP
Open Access | Editorial and Publishing Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: The socioeconomic and political history of the diamond industry is fascinating. Understanding diamonds are important because each diamond is unique in its way. Even an expert cannot incorporate as much information about price as a picture of the entire market without analysing the characteristics of the diamonds. Hadley’s ggplot2 ships with a data set that records the carat size and the price of more than 50 thousand diamonds, from Diamond Search Engine collected in 2008. In this paper we perform an exploratory data analysis using R on the diamond dataset to understand the diamond market trends, quality and price by analysing factors for Market Research. The difficulties that may arise with the analysis include, improper dataset, Presence of Outliers, Faulty Data etc. Also since the dataset dates back to 2008, the estimates may not be used with today’s market trends.
Keywords: Diamond, Exploratory Data analysis, R, Market Research.
Scope of the Article: Data Analytic