IEEE VIS 2024 Content: De-cluttering Scatterplots with Integral Images

De-cluttering Scatterplots with Integral Images

Hennes Rave -

Vladimir Molchanov -

Lars Linsen -

Room: Bayshore I

2024-10-17T13:30:00ZGMT-0600Change your timezone on the schedule page
2024-10-17T13:30:00Z
Exemplar figure, described by caption below
UMAP embedding of the MNIST dataset with color-coded classes after four iterations of our algorithm (top left), with grid lines (top right), with density background texture (bottom left), and with contour lines (bottom right).
Fast forward
Abstract

Scatterplots provide a visual representation of bivariate data (or 2D embeddings of multivariate data) that allows for effective analyses of data dependencies, clusters, trends, and outliers. Unfortunately, classical scatterplots suffer from scalability issues, since growing data sizes eventually lead to overplotting and visual clutter on a screen with a fixed resolution, which hinders the data analysis process. We propose an algorithm that compensates for irregular sample distributions by a smooth transformation of the scatterplot's visual domain. Our algorithm evaluates the scatterplot's density distribution to compute a regularization mapping based on integral images of the rasterized density function. The mapping preserves the samples' neighborhood relations. Few regularization iterations suffice to achieve a nearly uniform sample distribution that efficiently uses the available screen space. We further propose approaches to visually convey the transformation that was applied to the scatterplot and compare them in a user study. We present a novel parallel algorithm for fast GPU-based integral-image computation, which allows for integrating our de-cluttering approach into interactive visual data analysis systems.