
Largest Triangle Sampling for Visualizing Time Series in
Database
LEI RUI, Tsinghua University, China
XIANGDONG HUANG, Tsinghua University, China
SHAOXU SONG
∗
, Tsinghua University, China
CHEN WANG, Tsinghua University, China
JIANMIN WANG, Tsinghua University, China
ZHAO CAO, Huawei Technologies Co., Ltd, China
In time series visualization, sampling is used to reduce the number of points while retaining the visual features
of the raw time series. Area-based Largest Triangle Sampling (LTS) excels at preserving perceptually critical
points. However, the heuristic solution to LTS by sequentially sampling points with the locally largest triangle
area (a.k.a. Largest-Triangle-Three-Buckets, LTTB) suers from suboptimal solution and query ineciency.
We address the shortcomings by contributing a novel Iterative Largest Triangle Sampling (ILTS) algorithm
with convex hull acceleration. It renes the sampling results iteratively, capturing a broader perspective
by integrating more points in each iteration. Remarkably, we prove that the largest triangle can always be
found in the precomputed convex hulls, making the iterative sampling still ecient. Experiments demonstrate
increased visual quality over state-of-the-art baselines and signicant speedups over the brute force approach.
CCS Concepts: • Information systems → Database query processing.
Additional Key Words and Phrases: time series visualization, database query processing
ACM Reference Format:
Lei Rui, Xiangdong Huang, Shaoxu Song, Chen Wang, Jianmin Wang, and Zhao Cao. 2025. Largest Trian-
gle Sampling for Visualizing Time Series in Database. Proc. ACM Manag. Data 3, 1 (SIGMOD), Article 49
(February 2025), 26 pages. https://doi.org/10.1145/3709699
1 Introduction
Time series visualization, often represented by line charts [
7
], is extensively used. A contemporary
visualization system consists of a back-end database server and a front-end visualization client.
However, a naïve technique where the client queries the raw time series from the database fails to
meet the rapid response time requirements, due to the large amounts of data transferred. Therefore,
it is necessary to reduce the number of data points in the query result set, which can be accomplished
by replacing raw data queries with sampling queries [
27
]. An ideal sampling query should meet
two requirements: visual quality and query eciency.
∗
Shaoxu Song (https://sxsong.github.io/) is the corresponding author.
Authors’ Contact Information: Lei Rui, Tsinghua University, Beijing, China, rl18@mails.tsinghua.edu.cn; Xiangdong Huang,
Tsinghua University, Beijing, China, huangxdong@tsinghua.edu.cn; Shaoxu Song, Tsinghua University, Beijing, China,
sxsong@tsinghua.edu.cn; Chen Wang, Tsinghua University, Beijing, China, wang_chen@tsinghua.edu.cn; Jianmin Wang,
Tsinghua University, Beijing, China, jimwang@tsinghua.edu.cn; Zhao Cao, Huawei Technologies Co., Ltd, Beijing, China,
caozhao1@huawei.com.
This work is licensed under a Creative Commons Attribution International 4.0 License.
© 2025 Copyright held by the owner/author(s). Publication rights licensed to ACM.
ACM 2836-6573/2025/2-ART49
https://doi.org/10.1145/3709699
Proc. ACM Manag. Data, Vol. 3, No. 1 (SIGMOD), Article 49. Publication date: February 2025.
相关文档
评论