Using Unlabeled Data to Improve Inductive Models by Incorporating Transductive Models
Journal Title: International Journal of Advanced Research in Artificial Intelligence(IJARAI) - Year 2014, Vol 3, Issue 2
Abstract
This paper shows how to use labeled and unlabeled data to improve inductive models with the help of transductivemodels.We proposed a solution for the self-training scenario. Self- training is an effective semi-supervised wrapper method which can generalize any type of supervised inductive model to the semi-supervised settings. it iteratively refines a inductive model by bootstrap from unlabeled data. Standard self-training uses the classifier model(trained on labeled examples) to label and select candidates from the unlabeled training set, which may be problematic since the initial classifier may not be able to provide highly confident predictions as labeled training data is always rare. As a result, it could always suffer from introducing too much wrongly labeled candidates to the labeled training set, which may severely degrades performance. To tackle this problem, we propose a novel self-training style algorithm which incorporate a graph-based transductive model in the self-labeling process. Unlike standard self-training, our algorithm utilizes labeled and unlabeled data as a whole to label and select unlabeled examples for training set augmentation. A robust transductive model based on graph markov random walk is proposed, which exploits manifold assumption to output reliable predictions on unlabeled data using noisy labeled examples. The proposed algorithm can greatly minimize the risk of performance degradation due to accumulated noise in the training set. Experiments show that the proposed algorithm can effectively utilize unlabeled data to improve classification performance.
Authors and Affiliations
ShengJun Cheng, Jiafeng Liu, XiangLong Tang
Web-based Expert Decision Support System for Tourism Destination Management in Nigeria
The use of Information Technologies have played and currently playing prominent roles in many organizations, such as business, education, commerce. The tourism industry has witnessed the use and application of vari...
Category Decomposition Method for Un-Mixing of Mixels Acquired with Spaceborne Based Visible and Near Infrared Radiometers by Means of Maximum Entropy Method with Parameter Estimation Based on Simulated Annealing
Category decomposition method for un-mixing of mixels (Mixed Pixels) which is acquired with spaceborne based visible to near infrared radiometers by means of Maximum Entropy Method (MEM) with parameter estimation b...
An Intelligent Location Management approaches in GSM Mobile Network
Location management refers to the problem of updating and searching the current location of mobile nodes in a wireless network. To make it efficient, the sum of update costs of location database must be minimized. Previo...
Bi-Directional Reflectance Distribution Function: BRDF Effect on Un-mixing, Category Decomposition of the Mixed Pixel (MIXEL) of Remote Sensing Satellite Imagery Data
Method for unmixing, category decomposition of the mixed pixel (MIXEL) of remote sensing satellite imagery data taking into account the effect due to Bi-Directional Reflectance Distribution Function: BRDF is proposed. Al...
Comparative study between the proposed shape independent clustering method and the conventional methods (K-means and the other)
Cluster analysis aims at identifying groups of similar objects and, therefore helps to discover distribution of patterns and interesting correlations in the data sets. In this paper, we propose to provide a consist...