Selective Wander Join: Fast Progressive Visualizations for Data Joins

Journal Title: Informatics - Year 2019, Vol 6, Issue 1

Abstract

Progressive visualization offers a great deal of promise for big data visualization; however, current progressive visualization systems do not allow for continuous interaction. What if users want to see more confident results on a subset of the visualization? This can happen when users are in exploratory analysis mode but want to ask some directed questions of the data as well. In a progressive visualization system, the online aggregation algorithm determines the database sampling rate and resulting convergence rate, not the user. In this paper, we extend a recent method in online aggregation, called Wander Join, that is optimized for queries that join tables, one of the most computationally expensive operations. This extension leverages importance sampling to enable user-driven sampling when data joins are in the query. We applied user interaction techniques that allow the user to view and adjust the convergence rate, providing more transparency and control over the online aggregation process. By leveraging importance sampling, our extension of Wander Join also allows for stratified sampling of groups when there is data distribution skew. We also improve the convergence rate of filtering queries, but with additional overhead costs not needed in the original Wander Join algorithm.

Authors and Affiliations

Marianne Procopio, Carlos Scheidegger, Eugene Wu and Remco Chang

Keywords

Related Articles

Evaluation of the Omaha System Prototype Icons for Global Health Literacy

Omaha System problem concepts describe a comprehensive, holistic view of health in simple terms that have been represented in a set of prototype icons intended for universal use by consumers and clinicians. The purpose...

Theory and Practice in Digital Behaviour Change: A Matrix Framework for the Co-Production of Digital Services That Engage, Empower and Emancipate Marginalised People Living with Complex and Chronic Conditions

Background: The WHO framework on integrated people-centred health services promotes a focus on the needs of people and their communities to empower them to have a more active role in their own health. It has advocated...

What Is This Sensor and Does This App Need Access to It?

Mobile sensors have already proven to be helpful in different aspects of people’s everyday lives such as fitness, gaming, navigation, etc. However, illegitimate access to these sensors results in a malicious program ru...

Conversion of Legal Text to a Logical Rules Set from Medical Law Using the Medical Relational Model and the World Rule Model for a Medical Decision Support System

Automated formalization of legal text is a time- and effort-consuming task, but human-based validation consumes even more of both. The exchange of healthcare data in compliance with the medical privacy law requires exp...

AVIST: A GPU-Centric Design for Visual Exploration of Large Multidimensional Datasets

This paper presents the Animated VISualization Tool (AVIST), an exploration-oriented data visualization tool that enables rapidly exploring and filtering large time series multidimensional datasets. AVIST highlights in...

Download PDF file
  • EP ID EP44162
  • DOI https://doi.org/10.3390/informatics6010014
  • Views 242
  • Downloads 0

How To Cite

Marianne Procopio, Carlos Scheidegger, Eugene Wu and Remco Chang (2019). Selective Wander Join: Fast Progressive Visualizations for Data Joins. Informatics, 6(1), -. https://europub.co.uk/articles/-A-44162