Framework for Managing Uncertain Distributed Categorical Data

Abstract

In recent years, data has become uncertain due to the flourishing advanced technologies that participate continuously and increasingly in producing large amounts of incomplete data. Often, many modern applications where uncertainty occurs are distributed in nature, e.g., distributed sensor networks, information extraction, data integration, social network, etc. Consequently, even though the data uncertainty has been studied in the past for centralized behavior, it is still a challenging issue to manage uncertainty over the data in situ. In this paper, we propose a framework to managing uncertain categorical data over distributed environments that is built upon a hierarchical indexing technique based on inverted index, and a distributed algorithm to efficiently process queries on uncertain data in distributed environment. Leveraging this indexing technique, we address two kinds of queries on the distributed uncertain databases 1) a distributed probabilistic thresholds query, where its answers satisfy the probabilistic threshold requirement; and 2) a distributed top-k-queries, optimizing, the transfer of the tuples from the distributed sources to the coordinator site and the time treatment. Extensive experiments are conducted to verify the effectiveness and efficiency of the proposed method in terms of communication costs and response time.

Authors and Affiliations

Adel Benaissa, Mustapha Yahmi, Yassine Jamil

Keywords

Related Articles

Analyzing Data Reusability of Raytrace Application in Splash2 Benchmark

When designing a chip multiprocessors, we use Splash2 to estimate its performance. This benchmark contains eleven applications. The performance when running them is similar, except Raytrace. We analyse it to clarity why...

Analyzing Virtual Machine Live Migration in Application Data Context

Virtualization plays a very vital role in the big cloud federation. Live and Real-time virtual machine migration is always a challenging task in virtualized environment, different approaches, techniques and models have a...

Sperm Motility Algorithm for Solving Fractional Programming Problems under Uncertainty

This paper investigated solving Fractional Programming Problems under Uncertainty (FPPU) using Sperm Motility Algorithm. Sperm Motility Algorithm (SMA) is a novel metaheuristic algorithm inspired by fertilization process...

Hypercube Graph Decomposition for Boolean Simplification: An Optimization of Business Process Verification

This paper deals with the optimization of busi-ness processes (BP) verification by simplifying their equivalent algebraic expressions. Actual approaches of business processes verification use formal methods such as autom...

Intruder Attacks on Wireless Sensor Networks: A Soft Decision and Prevention Mechanism

Because of the wide-ranging of applications in a variety of fields, such as medicine, environmental studies, robotics, warfare and security, and so forth, the research on wireless sensor networks (WSNs) has attracted muc...

Download PDF file
  • EP ID EP262291
  • DOI 10.14569/IJACSA.2017.081047
  • Views 96
  • Downloads 0

How To Cite

Adel Benaissa, Mustapha Yahmi, Yassine Jamil (2017). Framework for Managing Uncertain Distributed Categorical Data. International Journal of Advanced Computer Science & Applications, 8(10), 359-368. https://europub.co.uk/articles/-A-262291