Big Data on Content Credibility of Social Networking Sites and Instant Messaging Applications
Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2016, Vol 18, Issue 5
Abstract
Abstract: The quantity of unstructured and structured data from social networking platforms and mobile phone instant messaging applications is massive and is produced at an exponential rate yet there is no mechanism to verify the content’s truthfulness and trustworthiness. In this paper we have proposed a theory of how Big Data technology can be employed to validate the credibility of the vastly diffused data. Using technologies like Hadoop to analytically process the fast paced incoming data and measure the reliability of the content. To verify the integrity, the processed data must be inspected against entrusted sources. These sources must beaccessible and should have trustworthy data that can assist in measuring the authenticity of contents from various sources. While privacy concerns are often dismissed when data is scraped from public-facing platforms such as Facebook, the need for these sites to validate the data posted on their site becomes prudent. Postingfalse rumors devalues the extent to which social networking acts as an effective method of spreading true information. In this paper we provide a brief exploration on Big Data, Hadoop and influence of unsolicited messages propagated from social networking websites and instant messaging applications.
Authors and Affiliations
Anurag Kumar , Monica Mishra, , Rinkita Mittal
Performance Evaluation of Different Data Mining Classification Algorithm and Predictive Analysis
Data mining is the knowledge discovery process by analyzing the large volumes of data from various perspectives and summarizing it into useful information; data mining has become an essential component in various...
Exponential software reliability using SPRT: MLE
In Classical Hypothesis testing volumes of data is to be collected and then the conclusions are drawn, which may need more time. But, Sequential Analysis of Statistical science could be adopted in order to ...
Intelligent Fault Identification System for Transmission Lines Using Artificial Neural Network
Transmission and distribution lines are vital links between generating units and consumers. They are exposed to atmosphere, hence chances of occurrence of fault in transmission line is very high, which has to be...
Version Control in Open Source Software
Abstract: open source software is software whose source code is freely available for anyone. The Open source software can be redistributed to others users and they can use it according to their own needs. Version C...
Enhanced cAntMinerPB Algorithm for Induction of Classification Rules using Ant Colony Approach
Abstract : Mining classification rules from data is a key mission of data mining and is getting great attention in recent years. Rule induction is a method used in data mining where the desired output is a set of R...