An Efficient Method for Noisy Annotation Data Modeling

Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2014, Vol 16, Issue 5

Abstract

Abstract : Probabilistic topic models are used for analyzing and extracting content-related annotations from noisy annotated discrete data like WebPages on WWW and these WebPages are stored using social bookmarking services with the help of social bookmarking services, reason behind this process most of time users can attach annotations freely, some annotations do not describe the semantics of the content, therefore they are noisy, simply they are not content related. The extraction of content-related annotations can be used as a prepossessing step in machine learning. Prepossessing step in machine learning is like text classification and image recognition, and can improve information retrieval performance. The proposed model is a generative model for content and annotations, where annotations are assumed to be originated either from topics that generated the content or from a general distribution unrelated to the content. We demonstrate the effectiveness of the proposed method with the help of synthetic data and real social annotation data for text and images

Authors and Affiliations

Sushama Shinde , Shyam Gupta

Keywords

Related Articles

Migrate and Map: A Framework to Access Data from Mysql, Mongodb or Hbase Using Mysql Queries

Abstract: Due to ever-increasing amount of data, scalability factor of the databases becomes a major constraint. Moreover, traditional relational databases fix the user’s perspective to view data in tabular format. Paral...

 Investigation and Analysis of SNR Estimation in OFDM system

 Estimation of signal to noise ratio (SNR) of received signal and to transmit the signal effectively for the modern communication system. The performance of existing non-data-aided (NDA) SNR estimation methods &nb...

 Enhancement of Cryptographic Security using Stopping Sets

 In this paper, we have used channel codes in cryptographic secrecy. Main idea of this paper is to develop the notion of combined security due to cryptography and channel coding. Thus, it is providing a more complet...

 Twitter Sentiment Classification on Sanders Data using HybridApproach

Abstract : Sentiment analysis is very perplexing and massive issue in the field of social data mining. Twitter isone of the mostly used social media where people discuss on various issues in a dense way. The tweets about...

Direction-Length Code (DLC) To Represent Binary Objects

 Abstract: More and more images have been generated in digital form around the world. Efficient way of description and classification of objects is a well needed application to identify the objects present in images...

Download PDF file
  • EP ID EP152688
  • DOI 10.9790/0661-16512024
  • Views 190
  • Downloads 0

How To Cite

Sushama Shinde, Shyam Gupta (2014).  An Efficient Method for Noisy Annotation Data Modeling. IOSR Journals (IOSR Journal of Computer Engineering), 16(5), 20-24. https://europub.co.uk/articles/-A-152688