An Efficient Method for Noisy Annotation Data Modeling
Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2014, Vol 16, Issue 5
Abstract
Abstract : Probabilistic topic models are used for analyzing and extracting content-related annotations from noisy annotated discrete data like WebPages on WWW and these WebPages are stored using social bookmarking services with the help of social bookmarking services, reason behind this process most of time users can attach annotations freely, some annotations do not describe the semantics of the content, therefore they are noisy, simply they are not content related. The extraction of content-related annotations can be used as a prepossessing step in machine learning. Prepossessing step in machine learning is like text classification and image recognition, and can improve information retrieval performance. The proposed model is a generative model for content and annotations, where annotations are assumed to be originated either from topics that generated the content or from a general distribution unrelated to the content. We demonstrate the effectiveness of the proposed method with the help of synthetic data and real social annotation data for text and images
Authors and Affiliations
Sushama Shinde , Shyam Gupta
Migrate and Map: A Framework to Access Data from Mysql, Mongodb or Hbase Using Mysql Queries
Abstract: Due to ever-increasing amount of data, scalability factor of the databases becomes a major constraint. Moreover, traditional relational databases fix the user’s perspective to view data in tabular format. Paral...
Investigation and Analysis of SNR Estimation in OFDM system
Estimation of signal to noise ratio (SNR) of received signal and to transmit the signal effectively for the modern communication system. The performance of existing non-data-aided (NDA) SNR estimation methods &nb...
Enhancement of Cryptographic Security using Stopping Sets
In this paper, we have used channel codes in cryptographic secrecy. Main idea of this paper is to develop the notion of combined security due to cryptography and channel coding. Thus, it is providing a more complet...
Twitter Sentiment Classification on Sanders Data using HybridApproach
Abstract : Sentiment analysis is very perplexing and massive issue in the field of social data mining. Twitter isone of the mostly used social media where people discuss on various issues in a dense way. The tweets about...
Direction-Length Code (DLC) To Represent Binary Objects
Abstract: More and more images have been generated in digital form around the world. Efficient way of description and classification of objects is a well needed application to identify the objects present in images...