A linked bar chart is the augmentation of a traditional bar chart where each bar is partitioned into blocks and pairs of blocks are linked using orthogonal lines that pass over intermediate bars. The order of the blocks readily influences the legibility of the links. We study the algorithmic problem of minimizing the vertical length of these links, for a fixed bar order. The main challenge lies with ``dependent'' links, whose vertical link length cannot be optimized independently per bar. We show that, if the dependent links form a forest, the problem can be solved in $O(nm)$ time, for n bars and m links. If the dependent links between non-adjacent bars form a forest, the problem admits an $O(n^4m)$-time algorithm. Finally, we show that the general case is fixed-parameter tractable in the maximum number of links that are connected to one bar.
Linked bar charts are an enhanced version of traditional bar charts where each bar is subdivided into multiple blocks, with connections between blocks represented by orthogonal lines that traverse intermediate bars. The ordering of blocks directly affects the readability of connection lines. This paper investigates the algorithmic problem of minimizing the vertical length of connection lines under fixed bar ordering. The primary challenge involves "dependent links," whose vertical lengths cannot be optimized independently. The research demonstrates that: when dependent links form a forest structure, the problem is solvable in O(nm) time (n bars, m links); when dependent links between non-adjacent bars form a forest, it is solvable in O(n⁴m) time; in the general case, the problem is fixed-parameter tractable (FPT) with respect to the maximum degree of bars.
Traditional bar charts can only represent single-category data. However, in many practical scenarios, certain numerical values are not uniquely attributed to a single category but are related to multiple categories. Examples include:
Shared quantities: Communication volume between accounts appearing in statistics for both accounts
Pairwise uncertainty: Voters in election polls hesitating between two parties; border factory contributions to pollution in both countries
Visualization of cross-category data is an important requirement in data analysis. Linked bar charts represent such relationships by adding connection lines between bars, but the stacking order of blocks directly affects the vertical length of connection lines, thereby impacting chart readability and aesthetics.
This paper is the first to systematically investigate the algorithmic problem of minimizing vertical link length in linked bar charts, providing theoretical foundations and efficient algorithms for this novel visualization method.
Problem Formalization: First defines the optimization problem of minimizing vertical link length in linked bar charts, introducing a classification system of "independent links" and "dependent links"
Forest Structure Algorithm: Proves that when the dependent link subgraph forms a forest, the problem can be solved in O(nm) time via dynamic programming (Theorem 3)
Non-Adjacent Forest Algorithm: Proves that when non-adjacent dependent links form a forest, the problem can be solved in O(n⁴m) time (Theorem 6)
FPT Algorithm: Proves that in the general case, the problem is fixed-parameter tractable with respect to the maximum degree Δ of bars, with time complexity O(Δ^(3δ)·δ·n) (Theorem 8)
Complexity Lower Bounds: Proves that when vertices can have multiple unlinked blocks with arbitrary ordering, the problem is strongly NP-hard (Theorem 12)
The paper divides links into two major categories:
Independent Links (IL): Vertical length can be independently optimized by placing blocks at a fixed target t with cost |t - y| + |t - y'|. Three cases:
Some intermediate bar is higher than the highest possible position of some endpoint → target is the height of the highest intermediate bar H
Possible position intervals of two endpoints do not overlap internally → target is the lowest position of the higher interval
Some endpoint position is fixed (corresponding sequence is empty) → target is that fixed position
Dependent Links (DL): Vertical length depends on the relative positions of two blocks, cannot be assigned a static target. Further subdivided into:
Link Classification System: Independent/dependent link classification enables problem decomposition; independent links optimize locally, dependent links require global consideration
Hierarchical Dynamic Programming:
Algorithm 1: Exploits tree structure decomposition
Algorithm 2: Introduces parameterized states for ADL handling
Algorithm 3: Utilizes tree decomposition for general case
Outerplanar Graph Property Exploitation: Outerplanarity of dependent link subgraph guarantees treewidth ≤ 2, providing theoretical foundation for FPT algorithm
The paper proves problem hardness through reduction:
Theorem 12 (Strong NP-Hardness): When vertices can have multiple unlinked blocks with arbitrary ordering, minimizing vertical link length is strongly NP-hard.
Proof Method: Reduction from 3-Partition problem
Construction: 2k-1 bars, B₀ has n unlinked blocks (corresponding to 3-Partition instance)
Key property: Only B₀'s unlinked block ordering is adjustable
Equivalence: Zero vertical length scheme exists ⟺ 3-Partition solution exists
1 Bernhart & Kainen (1979): The book thickness of a graph - Foundational theory of single-page book embedding
6 Cygan et al. (2015): Parameterized algorithms - Standard textbook for FPT algorithms
11 Frederickson & Hambrusch (1988): Planar linear arrangements of outerplanar graphs - Polynomial algorithms for outerplanar graph optimization
17 van Beusekom et al. (2024): Visualizing set sizes with dependent and independent uncertainties - Original proposal of linked bar charts
Overall Assessment: This is a high-quality theoretical computational geometry paper that systematically investigates the algorithmic problem of linked bar chart optimization. The paper is theoretically rigorous, features clever algorithm design, and provides complete complexity analysis. Its main value lies in establishing solid theoretical foundations for this novel visualization method. Shortcomings include missing experimental validation and incomplete complexity characterization for the general case. Future integration of real data evaluation and heuristic algorithm design would significantly enhance its practical value.