POS Tagging of Hindi-English Code Mixed Text from Social Media

Abstract

Language is way of expressing ideas and feelings using movement, symbol and sounds; particular style of speaking and writing. Language is divided into two, spoken language and written language. Spoken language is a form of communication in which words derived from a large vocabulary (usually at 10.000) together with a diverse variety of names are uttered through or with the mouth, while written language is the representation of a language by means of a writing system. Hundreds of millions people in the world routinely use two or more languages in their daily lives (multilingual). Social media is the social interaction among people in which they treat, share information and ideas in virtual communities and networks. One of social media features that are updated any time by users is status. Through status, the user can inform all activity, news, opinions, exchange ideas, business, and so on. In addition, they also are able to comment or respond to the latest status of their fellow social media users.The user of the social media sometimes mixes and uses several languages to update their status or comment to their friends’ status, for example when they chat with other people at facebook or wechat. Information retrieval deals with the issues of storing and retrieving information from all types of resources inlcuding social media which is very tough with regard to tokenizing and text processing.

Authors and Affiliations

Ajita Singh, Amit Kanskar, Angad Singh

Keywords

Related Articles

Study of pH and electrical Conductivity of Soil in Deulgaon Raja Taluka, Maharashtra

The aim of this paper is to study the pH and electrical conductivity of soil and find out its nutrient value in region of Deulgaon raja taluka Maharashtra. Soil pH is important as it affects the growth of plants. pH of...

A Study on Comparative Analysis in Retail Industry with Special Reference to Kalessuwari Refinery Pvt.Ltd

The project has been undertaken with a view to study the comparative analysis in retail industry. This study is intended to help Kalessuwari to achieve competitive advantages among other competitors. Major domestic play...

Comparative Analysis for Least Mean Square and Normalized LMS for Speech Enhancement Application

Adaptive Signal Processing (ASP) is an active research area. Adaptive Filter based speech enhancement technique is now a day’s getting very popular due to wide range of applications like mobile communication, hearing ai...

Iot Based Smart Parking System

In modern days concepts of smart cities have gained grater popularity. Problems like limited car parking services and road safety are being addressed by IoT. In this thesis an IoT based cloud integrated smart parking sy...

Pre-Treatment Process for Sorghum Biomass for Preparation of Bio-Ethanol

BIO-MASS Biomass is the term used for the biological material from living or recently living organisms such as wood, waste materials, gases and alcohol fuels. It is commonly plant matter that is specifically grown in o...

Download PDF file
  • EP ID EP22742
  • DOI -
  • Views 273
  • Downloads 4

How To Cite

Ajita Singh, Amit Kanskar, Angad Singh (2016). POS Tagging of Hindi-English Code Mixed Text from Social Media. International Journal for Research in Applied Science and Engineering Technology (IJRASET), 4(10), -. https://europub.co.uk/articles/-A-22742