Reconciling Schema Matching Networks Through Crowdsourcing
Journal Title: EAI Endorsed Transactions on Collaborative Computing - Year 2015, Vol 1, Issue 2
Abstract
for data integration purposes. Although several automatic schema matching tools have been developed, their results are often incomplete or erroneous. To obtain a correct set of correspondences, usually human effort is required to validate the generated correspondences. This validation process is often costly, as it is performed by highly skilled experts. Our paper analyzes how to leverage crowdsourcing techniques to validate the generated correspondences by a large group of non-experts. In our work we assume that one needs to establish attribute correspondences not only between two schemas but in a network. We also assume that the matching is realized in a pairwise fashion, in the presence of consistency expectations about the network of attribute correspondences. We demonstrate that formulating these expectations in the form of integrity constraints can improve the process of reconciliation. As in the case of crowdsourcing the user’s input is unreliable, we need specific aggregation techniques to obtain good quality. We demonstrate that consistency constraints can not only improve the quality of aggregated answers, but they also enable us to more reliably estimate the quality answers of individual workers and detect spammers. Moreover, these constraints also enable to minimize the necessary human effort needed, for the same expected quality of results.
Authors and Affiliations
Nguyen Quoc Viet Hung, Nguyen Thanh Tam, Zoltán Miklós, Karl Aberer
A Scheme for Collaboratively Processing Nearest Neighbor Queries in Oblivious Storage
Security concerns are a substantial impediment to the wider deployment of cloud storage. There are two main concerns on the confidentiality of outsourced data: i) protecting the data, and ii) protecting the access patter...
Experience with Teaching an Online Course in Mobile App Development for Senegalese Students and Professionals
Online education has been described as a means to transform access to education and increase opportunities at a global scale. In this paper, we describe our experience with teaching an eight-week online course in mobile...
Automated Dimension Determination for NMF-based Incremental Collaborative Filtering
The nonnegative matrix factorization (NMF) based collaborative filtering t e chniques h a ve a c hieved great success in product recommendations. It is well known that in NMF, the dimensions of the factor matrices have t...
Analysis of Differential Synchronisation’s Energy Consumption on Mobile Devices
Synchronisation algorithms are central to collaborative editing software. As collaboration is increasingly mediated by mobile devices, the energy eÿciency for such algorithms is interest to a wide community of applicatio...
The Richness of Open-ended Play - Rules, feedback and adaptation mechanisms in intelligent play environments
How can we design intelligent play environments for open-ended play that support richness in play? Rich play can be described as ongoing play that changes over time in character, form and nature. This paper elaborates on...