Multi-Class Tweet Categorization Using Map Reduce Paradigm

Journal Title: INTERNATIONAL JOURNAL OF COMPUTER TRENDS & TECHNOLOGY - Year 2014, Vol 9, Issue 2

Abstract

Twitter is one of the most popular micro-blogging website in today's globalized world. Twitter messages can be mined to gain valuable information. Although Twitter provides a list of most popular topics people tweet about known as Trending Topics in real time, it is often hard to understand what these trending topics are about. Therefore, various efforts are being made to classify these topics into general categories with high accuracy for better information retrieval. We propose the use of one of the classification algorithm called Naïve Bayes for the categorization of tweets which has been discussed in this paper. It then proposes how the Map – Reduce paradigm can be applied to existing Naïve Bayes algorithm to handle large number of tweets.

Authors and Affiliations

Mohit Tare , Indrajit Gohokar , Jayant Sable , Devendra Paratwar , Rakhi Wajgi

Keywords

Related Articles

Microblogging Service to Report about Earthquake

Data Mining is the extraction of hidden predictive information from large Database set. The huge amount of data is a key resource to be processed and analyzed for knowledge extraction. Volcanic action Reporting system is...

BPR: Evaluation of Existing Methodologies and Limitations

Many of known organizations had their business processes changed and reengineered in order to achieve their objectives, meet their customer’s expectations and attain competitive advantage. Thus, they were willing to adop...

To Improve Data Security by Using Secure Data Transmission

The “Secure data transmission” is a software solution which provides security during transmission .Present day security is the main issue that the third person attacks on the data. To provide security Tiny Encsryption Al...

Analysis of Job Scheduling Algorithms in Cloud Computing

Cloud computing is flourishing day by day and it will continue in developing phase until computers and internet era is in existence. While dealing with cloud computing, a number of issues are confronted like heavy load o...

Risk Assessment in Online Banking System

With the development of information technology and the popular use of the information network system, the security of the information system becomes particularly important. To ensure the security of the information syste...

Download PDF file
  • EP ID EP157683
  • DOI -
  • Views 126
  • Downloads 0

How To Cite

Mohit Tare, Indrajit Gohokar, Jayant Sable, Devendra Paratwar, Rakhi Wajgi (2014). Multi-Class Tweet Categorization Using Map Reduce Paradigm. INTERNATIONAL JOURNAL OF COMPUTER TRENDS & TECHNOLOGY, 9(2), 78-81. https://europub.co.uk/articles/-A-157683