POS Tagging of Hindi-English Code Mixed Text from Social Media

Abstract

Language is way of expressing ideas and feelings using movement, symbol and sounds; particular style of speaking and writing. Language is divided into two, spoken language and written language. Spoken language is a form of communication in which words derived from a large vocabulary (usually at 10.000) together with a diverse variety of names are uttered through or with the mouth, while written language is the representation of a language by means of a writing system. Hundreds of millions people in the world routinely use two or more languages in their daily lives (multilingual). Social media is the social interaction among people in which they treat, share information and ideas in virtual communities and networks. One of social media features that are updated any time by users is status. Through status, the user can inform all activity, news, opinions, exchange ideas, business, and so on. In addition, they also are able to comment or respond to the latest status of their fellow social media users.The user of the social media sometimes mixes and uses several languages to update their status or comment to their friends’ status, for example when they chat with other people at facebook or wechat. Information retrieval deals with the issues of storing and retrieving information from all types of resources inlcuding social media which is very tough with regard to tokenizing and text processing.

Authors and Affiliations

Ajita Singh, Amit Kanskar, Angad Singh

Keywords

Related Articles

A new Approach of Blow fish Algorithm in the Network System

This paper is about encryption and decryption of the text, image, audio, video using a single key with 64 bits block cipher which is an improved the security from source to destination in the network system this algorit...

A study of manet, attacks on it and defencing against packet dropping

Mobile ad-hoc network is a self configuring infrastructure, rapidly deployable, less time consuming and a mobile networks , due to which it is applied in various fields. But there are number of attacks that affect the n...

Fast and Accurate Spectral Clustering Based KNN-Similarity Graph Analysis

The recent years as an important analytical technique, both due to the prevalence of graph data, and the usefulness of graph structures for exploiting intrinsic data characteristics. However, as graph data grows in scal...

Facets of Semantic Web (3.0)

Ontology represents relationships among set of terms and concepts in hierarchical fashion. Ontology plays crucial role in formulization of information related to given domain. Understanding these ontologies without havi...

Comparison of Isolated and Non-Isolated Bidirectional DC-DC Converter Fed PMDC Motor

This paper includes designing and implementation of a bidirectional DC-DC converter which is fed from permanent magnet DC motor which can also be used as traction system for hybrid electrical vehicle system. The major...

Download PDF file
  • EP ID EP22742
  • DOI -
  • Views 252
  • Downloads 4

How To Cite

Ajita Singh, Amit Kanskar, Angad Singh (2016). POS Tagging of Hindi-English Code Mixed Text from Social Media. International Journal for Research in Applied Science and Engineering Technology (IJRASET), 4(10), -. https://europub.co.uk/articles/-A-22742