Pre-Processing Approach for Discrimination Prevention in Data Mining
Journal Title: International Journal of Engineering Sciences & Research Technology - Year 30, Vol 3, Issue 4
Abstract
Data mining is an important technology for extracting useful knowledge hidden in large collections of data. In data mining, discrimination is a very important issue when considering the legal and ethical aspects of data mining. It is more than observable that the majority people do not want to be discriminated because of their gender, nationality, religion, age and so on. Especially when these type of attributes are used for decision making purpose such as giving them a job, loan. Insurance etc.. Discrimination can be either direct or indirect. Direct discrimination occurs when decisions are made based on sensitive attributes. Indirect discrimination occurs when decisions are made based on non-sensitive attributes which are strongly correlated with biased sensitive ones. So we introduce an antidiscrimination techniques which including discrimination discovery and prevention. In the discrimination prevention method, we introduce a group of pre-processing discrimination prevention methods and specify the different features of each approach and how these approaches deal with direct or indirect discrimination. We discuss how to clean training data sets and outsourced data sets in such a way that direct and/or indirect discriminatory decision rules are converted to nondiscriminatory classification rules. Some metrics are used to evaluate the performance of those approaches is also given.
Authors and Affiliations
Mr. Pravin D. Kaware
Pre-Processing Approach for Discrimination Prevention in Data Mining
Data mining is an important technology for extracting useful knowledge hidden in large collections of data. In data mining, discrimination is a very important issue when considering the legal and ethical aspects o...
IMPLEMENTATION OF EFFICIENT ALGORITHMS FOR LOAD BALANCING MODELING WEB-BASED CLOUD APPLICATIONS
Load balancing is one of the central issues in cloud computing. Since millions of users are accessing the cloud every moment, the concept of load balancing has an important impact on the performance of cloud computing....
SIMULATION AND HARDWARE IMPLEMENTATION OF INTERLEAVED TECHNIQUE BASED DC-DC BUCK-BOOST CONVERTER
DC-DC converters has wide range of applications in renewable energy systems, hybrid systems, electric vehicles, fuel cells, and industries. Interleaved topology of dc-dc converter has capabilities of correctin...
An Implementation for Conserving Privacy based on Encryption Process to Secured Cloud Computing Environment
This paper describes a study on the existing methods, techniques and proposed implementation approach for cloud computing. Cloud computing is a style of computing in which dynamically scalable and often virtualize...
Association Thermodynamic Parameters (Conductometrically) for Nano Cobalt Sulfate in Mixed EtOH–H2O Solvents at Different Temperatures
The molar conductance for nano cobalt sulfate (CoSO4) in different percentages of ethanol (EtOH) and water were measured at 298.15, 303.15, 308.15 and 313.15K. From the molar conductance for nano CoSO4, the solvat...