Predilection Decoded: Web Based Spam Detection and Review Analysis for Online Portals
Anam Jawaid1, Saima Dev2, Radhika Sharma3, Veena G.S4
1Anam Jawaid, Department of Computer Science and Engineering, Ramaiah Institute of Technology, Bangalore, India.
2Saima Dev, Department of Computer Science and Engineering, Ramaiah Institute of Technology, Bangalore, India.
3Radhika Sharma, Department of Computer Science and Engineering, Ramaiah Institute of Technology, Bangalore, India.
4Dr. Veena G.S, Department of Computer Science and Engineering, Ramaiah Institute of Technology, Bangalore, India.

Manuscript received on 06 April 2019 | Revised Manuscript received on 12 May 2019 | Manuscript published on 30 May 2019 | PP: 2773-2778 | Volume-8 Issue-1, May 2019 | Retrieval Number: A1433058119/19©BEIESP
Open Access | Ethics and Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Abstract: A vast majority of people depend on pre-existing information available on social media to aid them in their decisions. The most common being: Reviews on various products available in the market. With internet services being provided to any and every human being, there are certain drawbacks with such as leaving negative or disingenuous reviews about various products and services offered on internet platforms varying in interests. The classification and determination of such spammers along with the spam content is quite growing topic for analysis and more deep research. A substantial quantity of researches have been carried out regarding this topic, however, the methodologies that have been presented are of high complexity and do not have an easy to use interface for the same. In this research paper, we put forth a simple yet highly effective framework that uses basic algorithms of cosine similarity and sentiment analysis, to implement a web-based model for spam and fake review detection. We segregate the comments as fake, meta-fake and genuine reviews. Sentiment Analysis, Negative Ratio Checking and Cosine Similarity are used for detection of fake reviews and spam content along with other examinations. Incorporating changes based on customer feedback is one of the most important activities carried out by product designers. Spam detection and fake review identification can help an organization analyze, improve and enhance their product based on the suggestions in the real classified reviews given by the customers. If this information is made public by the organization, people can decide whether to buy the product or not based on the real reviews that have been identified by the system.
Keywords: Spam Detection, Dataflow Diagrams, Datasets, Cosine Similarity, Negative Ratio Text, Bar graph, Pie Chart, Meta Fake Review Table, Database, Review Analysis, Bias Detection, Spam Detection, Java Server Page, Web Interface, User Interface

Scope of the Article: Predictive Analysis