Reconciling Schema Matching Networks Through Crowdsourcing

Journal Title: EAI Endorsed Transactions on Collaborative Computing - Year 2015, Vol 1, Issue 2

Abstract

for data integration purposes. Although several automatic schema matching tools have been developed, their results are often incomplete or erroneous. To obtain a correct set of correspondences, usually human effort is required to validate the generated correspondences. This validation process is often costly, as it is performed by highly skilled experts. Our paper analyzes how to leverage crowdsourcing techniques to validate the generated correspondences by a large group of non-experts. In our work we assume that one needs to establish attribute correspondences not only between two schemas but in a network. We also assume that the matching is realized in a pairwise fashion, in the presence of consistency expectations about the network of attribute correspondences. We demonstrate that formulating these expectations in the form of integrity constraints can improve the process of reconciliation. As in the case of crowdsourcing the user’s input is unreliable, we need specific aggregation techniques to obtain good quality. We demonstrate that consistency constraints can not only improve the quality of aggregated answers, but they also enable us to more reliably estimate the quality answers of individual workers and detect spammers. Moreover, these constraints also enable to minimize the necessary human effort needed, for the same expected quality of results.

Authors and Affiliations

Nguyen Quoc Viet Hung, Nguyen Thanh Tam, Zoltán Miklós, Karl Aberer

Keywords

Related Articles

An Alert System on the Presence of Myriapods in Peanut Farms in Senegal

In Senegal, agriculture remains one of the most important sectors of the economy and the culture of peanut is one of the pillars in this domain. Unfortunately, the expansion of this culture is constantly hampered by atta...

Reinforcement Learning with Internal Reward for Multi-Agent Cooperation: A Theoretical Approach

This paper focuses on a multi-agent cooperation which is generally difficult to be achieved without sufficient information of other agents, and proposes the reinforcement learning method that introduces an internal rewar...

A method to determine the transient capacitance of the bifacial solar cell considering the cylindrica grain and the dynamic junction velocity (Sf)

In this paper, we present a new techninic based on the dynamic junc velocity (Sf) conconce ept for the evaluation of the transient diffusion capacitance of the bbiifacial solar cell considering cylindrical model of th he...

A Highly Concurrent Replicated Data Structure EAI Endorsed Transactions

Well defined concurrent replicated data structure is very important to design collaborative editing system, particularly, certain properties like out-of-order execution of concurrent operations and data convergence. In t...

Lighting controls and energy savings potential in tropical zone

Reducing global energy consumption is a challenge to limit the rise in average earth temperature. The use of lighting controls in the building leads to energy savings. The objective of this study is to evaluate the energ...

Download PDF file
  • EP ID EP45684
  • DOI http://dx.doi.org/10.4108/cc.1.2.e2
  • Views 460
  • Downloads 0

How To Cite

Nguyen Quoc Viet Hung, Nguyen Thanh Tam, Zoltán Miklós, Karl Aberer (2015). Reconciling Schema Matching Networks Through Crowdsourcing. EAI Endorsed Transactions on Collaborative Computing, 1(2), -. https://europub.co.uk/articles/-A-45684