Conservative Noise Filters

Abstract

Noisy training data have a huge negative impact on machine learning algorithms. Noise-filtering algorithms have been proposed to eliminate such noisy instances. In this work, we empirically show that the most popular noise-filtering algorithms have a large False Positive (FP) error rate. In other words, these noise filters mistakenly identify genuine instances as outliers and eliminate them. Therefore, we propose more conservative outlier identification criteria that improve the FP error rate and, thus, the performance of the noise filters. With the new filter, an instance is eliminated if and only if it is misclassified by a mutual decision of Naïve Bayesian (NB) classifier and the original filtering criteria being used. The number of genuine instances that are incorrectly eliminated is reduced as a result, thereby improving the classification accuracy.

Authors and Affiliations

Mona M. Jamjoom, Khalil El Hindi

Keywords

Related Articles

Studying Applicability Feasibility of OFDM in Upcoming 5G Network

Orthogonal frequency-division multiplexing (OFDM) is one of unbeatable multiplexing technique till date. However with increasing version of next generation mobile standards like 5G, the applicability of OFDM is quite que...

Regression-Based Feature Selection on Large Scale Human Activity Recognition

In this paper, we present an approach for regression-based feature selection in human activity recognition. Due to high dimensional features in human activity recognition, the model may have over-fitting and can’t learn...

Localisation of Numerical Date Field in an Indian Handwritten Document

This paper describes a method to localise all those areas which may constitute the date field in an Indian handwritten document. Spatial patterns of the date field are studied from various handwritten documents and an al...

TOWARDS A SEAMLESS FUTURE GENERATION NETWORK FOR HIGH SPEED WIRELESS COMMUNICATIONS

The MIMO technology towards achieving future generation broadband networks design criteria is presented. Typical next generation scenarios are investigated. The MIMO technology is integrated with the OFDM technology for...

A Proposed Textual Graph Based Model for Arabic Multi-document Summarization

Text summarization task is still an active area of research in natural language preprocessing. Several methods that have been proposed in the literature to solve this task have presented mixed success. However, such meth...

Download PDF file
  • EP ID EP101638
  • DOI 10.14569/IJACSA.2016.070548
  • Views 70
  • Downloads 0

How To Cite

Mona M. Jamjoom, Khalil El Hindi (2016). Conservative Noise Filters. International Journal of Advanced Computer Science & Applications, 7(5), 354-360. https://europub.co.uk/articles/-A-101638