Ensemble based novel class identification for Class Imbalance under sampled Data
Journal Title: International Journal for Research in Applied Science and Engineering Technology (IJRASET) - Year 2016, Vol 4, Issue 11
Abstract
The Classification of data is represented as research challenge in handling the class imbalance problem. Classification problems are represented by highly unbalanced data sets, in which, the number of samples from one class is much smaller than from another. This is known as class imbalance problem and is often reported as an obstacle for constructing a model that can successfully discriminate the minority samples from the majority samples. Under sampling is a popular method in dealing with class-imbalance problems, which uses only a subset of the majority class and thus is very efficient. The main deficiency is that many minority class examples are ignored. Ensemble based under sampling method is proposed for the class imbalance problem. The class imbalance problem is defined in terms of which the ratio of the majority and minority class cardinalities is inversed. The main idea is to severely under sample the majority class thus creating a large number of distinct training sets using normalized information gain. For each training set we then find a decision boundary which separates the minority class from the majority class using the classifier c5.0. By combining the multiple designs through fusion, we construct a composite boundary between the majority class and the minority class using entropy calculation. Experimental results show that both proposed method class-imbalance learning method out performs state of arts approaches higher in terms of precision, recall and f-measure for disproportionate class sample size for different boundaries.
Authors and Affiliations
Sulfyth. M, Mrs. D. Priyadarshini
Generation High Voltage: A Technique for Laboratory Educational Works
In this paper a method is discussed to generate high voltage DC up to 110kV using Cockroft-Walton Voltage Multiplier for study and research at educational laboratory. As High Voltage DC (HVDC) transmission is becoming m...
College Monitoring System
Now-a-days, education is playing very significant role in the society. Admissions are increasing day-by-day so there by a ratio of establishment of new colleges are also increasing. But the actual challenge is starting...
Anaerobic digestion of Municipal Solid biodegradable wastes for methane production:A Review
The untreated and undisposed municipal solid waste generated through different sources is a major concern of the world now-a-days. There are millions of tonnes of municipal solid waste produced every year and the amount...
Implementation of an efficient low complexity method for wireless CE using a BEM for the wireless channel taps.
The matrix representation of the signal model of MIMO-OFDM systems, which clearly describes the relation of signals in frequency domain and time domain and expressing operations like adding CP and removing CP as matrix...
An overview of Multiplicative data perturbation for privacy preserving Data mining
Privacy is an important issue when one wants to make use of data that involves individuals’ sensitive information. Research on protecting the privacy of individuals and the confidentiality of data has received contribut...