Selective Wander Join: Fast Progressive Visualizations for Data Joins

Journal Title: Informatics - Year 2019, Vol 6, Issue 1

Abstract

Progressive visualization offers a great deal of promise for big data visualization; however, current progressive visualization systems do not allow for continuous interaction. What if users want to see more confident results on a subset of the visualization? This can happen when users are in exploratory analysis mode but want to ask some directed questions of the data as well. In a progressive visualization system, the online aggregation algorithm determines the database sampling rate and resulting convergence rate, not the user. In this paper, we extend a recent method in online aggregation, called Wander Join, that is optimized for queries that join tables, one of the most computationally expensive operations. This extension leverages importance sampling to enable user-driven sampling when data joins are in the query. We applied user interaction techniques that allow the user to view and adjust the convergence rate, providing more transparency and control over the online aggregation process. By leveraging importance sampling, our extension of Wander Join also allows for stratified sampling of groups when there is data distribution skew. We also improve the convergence rate of filtering queries, but with additional overhead costs not needed in the original Wander Join algorithm.

Authors and Affiliations

Marianne Procopio, Carlos Scheidegger, Eugene Wu and Remco Chang

Keywords

Related Articles

Motivation and User Engagement in Fitness Tracking: Heuristics for Mobile Healthcare Wearables

Wearable fitness trackers have gained a new level of popularity due to their ambient data gathering and analysis. This has signalled a trend toward self-efficacy and increased motivation among users of these devices. F...

Design, Use and Evaluation of E-Learning Platforms: Experiences and Perspectives of a Practitioner from the Developing World Studying in the Developed World

Electronic learning platforms are evolving and their evaluation is becoming more complex and challenging with time. Yet, the evaluation of electronic learning services is intrinsically linked to improving the performan...

Evaluation of the Omaha System Prototype Icons for Global Health Literacy

Omaha System problem concepts describe a comprehensive, holistic view of health in simple terms that have been represented in a set of prototype icons intended for universal use by consumers and clinicians. The purpose...

A Recommender System for Programming Online Judges Using Fuzzy Information Modeling

Programming online judges (POJs) are an emerging application scenario in e-learning recommendation areas. Specifically, they are e-learning tools usually used in programming practices for the automatic evaluation of so...

Improving the Classification Efficiency of an ANN Utilizing a New Training Methodology

In this work, a new approach for training artificial neural networks is presented which utilises techniques for solving the constraint optimisation problem. More specifically, this study converts the training of a neur...

Download PDF file
  • EP ID EP44162
  • DOI https://doi.org/10.3390/informatics6010014
  • Views 239
  • Downloads 0

How To Cite

Marianne Procopio, Carlos Scheidegger, Eugene Wu and Remco Chang (2019). Selective Wander Join: Fast Progressive Visualizations for Data Joins. Informatics, 6(1), -. https://europub.co.uk/articles/-A-44162