We present an algorithm that given any invertible symmetric diagonally dominant M-matrix (SDDM), i.e., a principal submatrix of a graph Laplacian, $\boldsymbol{\mathit{L}}$ and a nonnegative vector $\boldsymbol{\mathit{b}}$, computes an entrywise approximation to the solution of $\boldsymbol{\mathit{L}} \boldsymbol{\mathit{x}} = \boldsymbol{\mathit{b}}$ in $\tilde{O}(m n^{o(1)})$ time with high probability, where $m$ is the number of nonzero entries and $n$ is the dimension of the system.
Paper ID : 2511.16570Title : Entrywise Approximate Solutions for SDDM Systems in Almost-Linear TimeAuthors : Angelo Farfan (MIT), Mehrdad Ghadiri (MIT), Junzhao Yang (CMU)Categories : cs.DS (Data Structures and Algorithms), cs.NA (Numerical Analysis), math.NA (Numerical Analysis)Submission Date : November 20, 2025 to arXivPaper Link : https://arxiv.org/abs/2511.16570 This paper presents an algorithm that, for any invertible symmetric diagonally dominant M-matrix (SDDM, i.e., a principal submatrix of a graph Laplacian) L L L and non-negative vector b b b , computes entrywise approximate solutions to L x = b Lx = b Lx = b with high probability in time O ~ ( m ⋅ 2 O ( log n ) log ( U ) log 2 ( U ϵ − 1 δ − 1 ) ) \tilde{O}(m \cdot 2^{O(\sqrt{\log n})} \log(U) \log^2(U\epsilon^{-1}\delta^{-1})) O ~ ( m ⋅ 2 O ( l o g n ) log ( U ) log 2 ( U ϵ − 1 δ − 1 )) , where m m m is the number of nonzeros and n n n is the system dimension. This is the first algorithm to solve SDDM systems in almost-linear time while providing entrywise approximation guarantees.
Core Problem : Solve the linear system L x = b Lx = b Lx = b where L L L is an SDDM matrix, requiring multiplicative entrywise approximation guarantees for each component of the solution vector x x x :
e − ϵ ( L − 1 b ) i ≤ x ~ i ≤ e ϵ ( L − 1 b ) i , ∀ i ∈ [ n ] e^{-\epsilon}(L^{-1}b)_i \leq \tilde{x}_i \leq e^{\epsilon}(L^{-1}b)_i, \quad \forall i \in [n] e − ϵ ( L − 1 b ) i ≤ x ~ i ≤ e ϵ ( L − 1 b ) i , ∀ i ∈ [ n ] Problem Importance :Graph Laplacian Systems have widespread applications in computer science, closely related to graph theory and random walksEntrywise Approximation is crucial when solution components span exponential rangesApplication scenarios include computing stationary distributions of Markov chains, flow problems in interior point methods, etc. Limitations of Existing Methods :Norm Error Bounds : Works like Spielman-Teng ST14 provide near-linear time algorithms but only guarantee energy norm or Euclidean norm error:
∥ x ^ − L † b ∥ L ≤ ϵ ∥ L † b ∥ L \|\hat{x} - L^\dagger b\|_L \leq \epsilon \|L^\dagger b\|_L ∥ x ^ − L † b ∥ L ≤ ϵ ∥ L † b ∥ L Exponential Precision Requirements : Since solution components may differ exponentially, norm errors cannot recover small components unless ϵ \epsilon ϵ is exponentially small (ϵ = O ( 1 / 2 n ) \epsilon = O(1/2^n) ϵ = O ( 1/ 2 n ) )Time Complexity Bottleneck : Requires O ( log ( 1 / ϵ ) ) O(\log(1/\epsilon)) O ( log ( 1/ ϵ )) iterations, with numerical precision needing log ( κ / ϵ ) \log(\kappa/\epsilon) log ( κ / ϵ ) bits, resulting in O ( m n 2 ) O(mn^2) O ( m n 2 ) bit operation complexityExisting Entrywise Algorithms : GNY26 proposes O ~ ( m n log 2 ( U ϵ − 1 δ − 1 ) ) \tilde{O}(m\sqrt{n}\log^2(U\epsilon^{-1}\delta^{-1})) O ~ ( m n log 2 ( U ϵ − 1 δ − 1 )) time algorithm, but still superlinearResearch Motivation :Can SDDM systems be solved in almost-linear time while providing entrywise approximation guarantees? This paper provides an affirmative answer, reducing time complexity from O ( m n ) O(m\sqrt{n}) O ( m n ) to O ( m ⋅ n o ( 1 ) ) O(m \cdot n^{o(1)}) O ( m ⋅ n o ( 1 ) ) First Almost-Linear Time Entrywise Solver : Proposes an algorithm computing entrywise approximate solutions to SDDM systems in O ~ ( m ⋅ 2 O ( log n ) log ( U ) log 2 ( U ϵ − 1 δ − 1 ) ) \tilde{O}(m \cdot 2^{O(\sqrt{\log n})} \log(U) \log^2(U\epsilon^{-1}\delta^{-1})) O ~ ( m ⋅ 2 O ( l o g n ) log ( U ) log 2 ( U ϵ − 1 δ − 1 )) bit operationsProbabilistic Distance Low-Diameter Cover :Defines probabilistic distance based on matrix inverse entries: D L ( i , j ) = − log n U ( ( L − 1 ) i j ) + 2 D_L(i,j) = -\log_{nU}((L^{-1})_{ij}) + 2 D L ( i , j ) = − log n U (( L − 1 ) ij ) + 2 Constructs ( r i n , r o u t , α ) (r_{in}, r_{out}, \alpha) ( r in , r o u t , α ) -covers with r i n = 2 O ( log n ) r_{in} = 2^{O(\sqrt{\log n})} r in = 2 O ( l o g n ) , r o u t = 2 Ω ( log n ) r_{out} = 2^{\Omega(\sqrt{\log n})} r o u t = 2 Ω ( l o g n ) , α = 2 O ( log n ) \alpha = 2^{O(\sqrt{\log n})} α = 2 O ( l o g n ) Proves existence and provides almost-linear time construction algorithm Improved Threshold Decay Framework :Extends the threshold decay framework of GNY26 Uses low-diameter covers for prediction, adaptively determining solution component scales Maintains active set size at n 1 + o ( 1 ) n^{1+o(1)} n 1 + o ( 1 ) through boundary-expanded sets Theoretical Guarantees : For ϵ > ( n U ) − 2 log n \epsilon > (nU)^{-2\sqrt{\log n}} ϵ > ( n U ) − 2 l o g n (non-exponentially small), the algorithm outputs with probability at least 1 − δ 1-\delta 1 − δ a solution satisfying x ~ ≈ ϵ L − 1 b \tilde{x} \approx_\epsilon L^{-1}b x ~ ≈ ϵ L − 1 b Input :
L ∈ Z n × n L \in \mathbb{Z}^{n \times n} L ∈ Z n × n : Invertible SDDM matrix with integer entries in [ − U , U ] [-U, U] [ − U , U ] , m m m nonzerosb ∈ Z n b \in \mathbb{Z}^n b ∈ Z n : Non-negative vector with integer entries in [ 0 , U ] [0, U] [ 0 , U ] ϵ > ( n U ) − 2 log n \epsilon > (nU)^{-2\sqrt{\log n}} ϵ > ( n U ) − 2 l o g n : Precision parameterδ ∈ ( 0 , 1 ) \delta \in (0,1) δ ∈ ( 0 , 1 ) : Failure probabilityOutput :
x ~ \tilde{x} x ~ : Satisfying entrywise approximation e − ϵ ( L − 1 b ) i ≤ x ~ i ≤ e ϵ ( L − 1 b ) i e^{-\epsilon}(L^{-1}b)_i \leq \tilde{x}_i \leq e^{\epsilon}(L^{-1}b)_i e − ϵ ( L − 1 b ) i ≤ x ~ i ≤ e ϵ ( L − 1 b ) i for all i ∈ [ n ] i \in [n] i ∈ [ n ] Elements represented with O ( log ( n U / ϵ ) ) O(\log(nU/\epsilon)) O ( log ( n U / ϵ )) bit floating point numbers Constraints :
Time Complexity: O ~ ( m ⋅ 2 O ( log n ) log ( U ) log 2 ( U ϵ − 1 δ − 1 ) ) \tilde{O}(m \cdot 2^{O(\sqrt{\log n})} \log(U) \log^2(U\epsilon^{-1}\delta^{-1})) O ~ ( m ⋅ 2 O ( l o g n ) log ( U ) log 2 ( U ϵ − 1 δ − 1 )) bit operations Success Probability: At least 1 − δ 1-\delta 1 − δ Definition (Definition 2.1):
D L ( i , j ) : = − log n U ( ( L − 1 ) i j ) + 2 D_L(i,j) := -\log_{nU}((L^{-1})_{ij}) + 2 D L ( i , j ) := − log n U (( L − 1 ) ij ) + 2
Key Properties (Claim 2.2):
Symmetry : D L ( i , j ) = D L ( j , i ) D_L(i,j) = D_L(j,i) D L ( i , j ) = D L ( j , i ) Triangle Inequality : D L ( i , k ) ≤ D L ( i , j ) + D L ( j , k ) D_L(i,k) \leq D_L(i,j) + D_L(j,k) D L ( i , k ) ≤ D L ( i , j ) + D L ( j , k ) Small Adjacent Distance : If L i j ≠ 0 L_{ij} \neq 0 L ij = 0 , then D L ( i , j ) ≤ 4 D_L(i,j) \leq 4 D L ( i , j ) ≤ 4 Monotonicity (Lemma 2.3): For principal submatrix L S , S L_{S,S} L S , S , D L ( i , j ) ≤ D L S , S ( i , j ) D_L(i,j) \leq D_{L_{S,S}}(i,j) D L ( i , j ) ≤ D L S , S ( i , j ) Physical Interpretation : Through Lemma 1.4, ( L − 1 ) i j (L^{-1})_{ij} ( L − 1 ) ij relates to random walk escape probability:
P ( i , j , n + 1 ) = ( L − 1 ) i j L j j ( L − 1 ) j j P(i,j,n+1) = \frac{(L^{-1})_{ij}}{L_{jj}(L^{-1})_{jj}} P ( i , j , n + 1 ) = L jj ( L − 1 ) jj ( L − 1 ) ij
The probabilistic distance characterizes the logarithmic scale of random walk arrival probability from i i i to j j j .
Definition (Definition 2.4): A collection C = { ( V 1 , W 1 ) , … , ( V k , W k ) } \mathcal{C} = \{(V_1, W_1), \ldots, (V_k, W_k)\} C = {( V 1 , W 1 ) , … , ( V k , W k )} is an ( r i n , r o u t , α ) (r_{in}, r_{out}, \alpha) ( r in , r o u t , α ) -cover if:
Containment : V i ⊆ W i ⊆ [ n ] V_i \subseteq W_i \subseteq [n] V i ⊆ W i ⊆ [ n ] (inner ball V i V_i V i , outer ball W i W_i W i )Full Coverage : Every vertex is in at least one inner ballBounded Overlap : Every vertex is in at most α \alpha α outer ballsSmall Diameter : For any u , v ∈ W i u,v \in W_i u , v ∈ W i , D L ( u , v ) ≤ r i n D_L(u,v) \leq r_{in} D L ( u , v ) ≤ r in Large Gap : For any u ∈ V i , v ∈ [ n ] ∖ W i u \in V_i, v \in [n] \setminus W_i u ∈ V i , v ∈ [ n ] ∖ W i , D L ( u , v ) > r o u t D_L(u,v) > r_{out} D L ( u , v ) > r o u t Construction Algorithm (LowDiamConstruct, Figure 1):
Parameter Settings :
ℓ = ⌈ log n ⌉ + 3 \ell = \lceil\sqrt{\log n}\rceil + 3 ℓ = ⌈ log n ⌉ + 3 Distance thresholds: d i = 4 ℓ 2 i − 1 d_i = \frac{4\ell}{2^{i-1}} d i = 2 i − 1 4 ℓ , i ∈ [ ℓ ] i \in [\ell] i ∈ [ ℓ ] Sampling probabilities: p j = min { 1 n ⋅ 2 ℓ ( j − 1 ) , 1 } p_j = \min\{\frac{1}{n} \cdot 2^{\ell(j-1)}, 1\} p j = min { n 1 ⋅ 2 ℓ ( j − 1 ) , 1 } , j ∈ [ ℓ ] j \in [\ell] j ∈ [ ℓ ] Iteration count: B = 6 ⋅ 16 ℓ ⋅ ⌈ log ( n / δ ) ⌉ B = 6 \cdot 16^\ell \cdot \lceil\log(n/\delta)\rceil B = 6 ⋅ 1 6 ℓ ⋅ ⌈ log ( n / δ )⌉ Algorithm Flow :
For each pair (d_i, p_j), repeat B times:
1. Independently sample set S ⊆ [n] with probability p_j
2. Solve Lx = 1_S, obtaining approximate solution y with norm error M^{-2d_i}
3. Let T = {k: y_k ≥ M^{-d_i/2}}, construct induced subgraph G_T
4. For each v ∈ S, if its connected component C_v is disjoint from other S vertices:
- Inner ball: V = {k ∈ C_v: y_k ≥ M^{-d_i/4} - M^{-2d_i}}
- Outer ball: W = {k ∈ C_v: y_k ≥ M^{-(d_i/2)+2}}
- Add (V,W) to cover C
Key Idea :
For each vertex u u u , there exists appropriate ( d i , p j ) (d_i, p_j) ( d i , p j ) such that:
S S S contains exactly one vertex near u u u with moderate probabilityThat vertex's connected component is separated from other S S S vertices Inner/outer ball pairs are recovered from large solution elements Complexity (Theorem 2.1):
Time: m ⋅ 2 O ( log n ) log ( U ) log ( δ − 1 ) log ( U δ − 1 ) m \cdot 2^{O(\sqrt{\log n})} \log(U) \log(\delta^{-1}) \log(U\delta^{-1}) m ⋅ 2 O ( l o g n ) log ( U ) log ( δ − 1 ) log ( U δ − 1 ) bit operations Parameters: r i n = 2 2 ℓ + 1 r_{in} = 2^{2\ell+1} r in = 2 2 ℓ + 1 , r o u t = 2 ℓ − 2 r_{out} = 2^{\ell-2} r o u t = 2 ℓ − 2 , α = 6 ℓ 2 ⋅ 16 ℓ ⋅ ⌈ log ( n / δ ) ⌉ \alpha = 6\ell^2 \cdot 16^\ell \cdot \lceil\log(n/\delta)\rceil α = 6 ℓ 2 ⋅ 1 6 ℓ ⋅ ⌈ log ( n / δ )⌉ Basic Framework (ThresholdDecay, Figure 2):
Maintains partition of three disjoint sets [ n ] = P ( t ) ∪ Q ( t ) ∪ R ( t ) [n] = P^{(t)} \cup Q^{(t)} \cup R^{(t)} [ n ] = P ( t ) ∪ Q ( t ) ∪ R ( t ) :
P ( t ) P^{(t)} P ( t ) : Solved set (entrywise approximation computed)Q ( t ) Q^{(t)} Q ( t ) : Inactive set (deemed sufficiently small to ignore)R ( t ) R^{(t)} R ( t ) : Active set (current linear system participants)Iteration Process :
Initialize: S^(0) = [n], b̂^(0) = b, x̃^(0) = 0
For t = 0 to T:
1. Solve subsystem: L_{S^(t),S^(t)} x = b̂^(t), obtain norm approximate x̂^(t)
2. Compute threshold: θ_t = (smallest power of 2 > 1/(4(nU)^2) ||b̂^(t)||_1)
3. Extract large elements: F^(t) = {i: x̂^(t)_i ≥ θ_t}
4. Update solved set: S^(t+1) = S^(t) \ F^(t)
5. Update right-hand side: b̂^(t+1) ≈_{ε/(8T)} b_{S^(t+1)} - L_{S^(t+1),[n]\S^(t+1)} x̃^(t+1)
Key Properties (Lemma 3.1):
Threshold decay: ∣ ∣ b ^ ( t ) ∣ ∣ 1 ≤ ( n U ) − t ∣ ∣ b ∣ ∣ 1 ||b̂^{(t)}||_1 \leq (nU)^{-t} ||b||_1 ∣∣ b ^ ( t ) ∣ ∣ 1 ≤ ( n U ) − t ∣∣ b ∣ ∣ 1 Entrywise precision: x ~ ( t ) ≈ ϵ t / ( 4 T ) x [ n ] ∖ S ( t ) \tilde{x}^{(t)} \approx_{\epsilon t/(4T)} x_{[n]\setminus S^{(t)}} x ~ ( t ) ≈ ϵ t / ( 4 T ) x [ n ] ∖ S ( t ) Small element guarantee: For i ∈ S ( t + 1 ) i \in S^{(t+1)} i ∈ S ( t + 1 ) , x i < ( n U ) − 2 ∣ ∣ b ^ ( t ) ∣ ∣ 1 x_i < (nU)^{-2} ||b̂^{(t)}||_1 x i < ( n U ) − 2 ∣∣ b ^ ( t ) ∣ ∣ 1 Paper's Innovation: Using Low-Diameter Covers for Prediction
Boundary-Expanded Set (Definition 3.4):
Exp C ( I ) : = { u ∈ [ n ] : ∃ ( V i , W i ) ∈ C , u ∈ V i ∧ ∣ W i ∩ I ∣ > 0 } \text{Exp}_{\mathcal{C}}(I) := \{u \in [n]: \exists (V_i, W_i) \in \mathcal{C}, u \in V_i \land |W_i \cap I| > 0\} Exp C ( I ) := { u ∈ [ n ] : ∃ ( V i , W i ) ∈ C , u ∈ V i ∧ ∣ W i ∩ I ∣ > 0 }
Equivalent definition:
Exp C ( I ) = ⋃ ( V i , W i ) ∈ C : ∣ W i ∩ I ∣ > 0 V i \text{Exp}_{\mathcal{C}}(I) = \bigcup_{(V_i,W_i) \in \mathcal{C}: |W_i \cap I| > 0} V_i Exp C ( I ) = ⋃ ( V i , W i ) ∈ C : ∣ W i ∩ I ∣ > 0 V i
Partial Solver (PartialSolve, Figure 3):
At iteration t t t :
I ( t ) = { u ∈ S ( t ) : b ^ u ( t ) > 0 } I^{(t)} = \{u \in S^{(t)}: \hat{b}^{(t)}_u > 0\} I ( t ) = { u ∈ S ( t ) : b ^ u ( t ) > 0 } (support of right-hand side vector)H ( t ) = Exp C ( I ( t ) ) ∩ S ( t ) H^{(t)} = \text{Exp}_{\mathcal{C}}(I^{(t)}) \cap S^{(t)} H ( t ) = Exp C ( I ( t ) ) ∩ S ( t ) (boundary-expanded active set)Only solve on H ( t ) H^{(t)} H ( t ) :
L H ( t ) , H ( t ) x = b ^ H ( t ) ( t ) L_{H^{(t)},H^{(t)}} x = \hat{b}^{(t)}_{H^{(t)}} L H ( t ) , H ( t ) x = b ^ H ( t ) ( t )
Correctness Guarantee (Lemma 3.5):
For u ∈ S ( t ) ∖ H ( t ) u \in S^{(t)} \setminus H^{(t)} u ∈ S ( t ) ∖ H ( t ) , D L S ( t ) , S ( t ) ( u , v ) > r o u t D_{L_{S^{(t)},S^{(t)}}}(u, v) > r_{out} D L S ( t ) , S ( t ) ( u , v ) > r o u t for all v ∈ I ( t ) v \in I^{(t)} v ∈ I ( t ) Applying Corollary 3.3, error from ignoring S ( t ) ∖ H ( t ) S^{(t)} \setminus H^{(t)} S ( t ) ∖ H ( t ) is ( n U ) − r o u t + 5 ∣ ∣ b ^ ( t ) ∣ ∣ 2 (nU)^{-r_{out}+5} ||\hat{b}^{(t)}||_2 ( n U ) − r o u t + 5 ∣∣ b ^ ( t ) ∣ ∣ 2 Setting r o u t ≥ 5 + log n U ( 2 / ϵ L ) r_{out} \geq 5 + \log_{nU}(2/\epsilon_L) r o u t ≥ 5 + log n U ( 2/ ϵ L ) ensures error ≤ ϵ L ∣ ∣ b ^ ( t ) ∣ ∣ 2 \leq \epsilon_L ||\hat{b}^{(t)}||_2 ≤ ϵ L ∣∣ b ^ ( t ) ∣ ∣ 2 SDDMSolve Algorithm (Figure 4):
Input: L, b, ε, δ
Output: x̃ ≈_ε L^{-1}b
1. Construct low-diameter cover:
C ← LowDiamConstruct(L, δ/2)
2. Run improved threshold decay:
Initialize: S^(0) = [n], b̂^(0) = b, I^(0), H^(0)
For t = 0 to n:
a. Partial solve:
x̂^(t) ← PartialSolve(L, C, S^(t), b̂^(t), ε_L, δ/(2n), H^(t), I^(t))
b-d. Extract large elements, update solved set (only on H^(t))
e. Efficiently maintain vectors:
- Use incremental update v̂^(t+1) ← v̂^(t)_{S^(t+1)} - L_{S^(t+1),F^(t)} x̂^(t)_{F^(t)}
- Implicitly represent b̂^(t+1) := b_{S^(t+1)} + v̂^(t+1)
f. Maintain I^(t) and H^(t):
- I^(t) through tracking nonzeros of b̂^(t)
- H^(t) through counter maintenance of boundary-expanded set
3. Return x̃
Introduction of Probabilistic Distance :Connects random walk theory with matrix inverse Satisfies triangle inequality, suitable for metric space construction Monotonicity ensures subsystem distances don't decrease Random Sampling Cover Construction :Simultaneously constructs multiple balls by solving random right-hand sides 1 S 1_S 1 S Exponential distance thresholds and sampling probabilities ensure separation Repeated sampling boosts success probability to 1 − δ 1-\delta 1 − δ Boundary-Expanded Prediction :Uses cover properties to predict large element locations Avoids explicit distance computation for all pairs Maintains total active set size at n 1 + o ( 1 ) n^{1+o(1)} n 1 + o ( 1 ) Efficient Data Structure Maintenance :Incremental right-hand side updates (O(m+n) total updates) Counter-based boundary-expanded set maintenance Segment tree for norm maintenance (avoids numerical cancellation) Core Lemma (Lemma 3.6):
∑ t = 0 T 1 [ u ∈ H ( t ) ] ≤ r i n = 2 O ( log n ) \sum_{t=0}^T \mathbb{1}[u \in H^{(t)}] \leq r_{in} = 2^{O(\sqrt{\log n})} ∑ t = 0 T 1 [ u ∈ H ( t ) ] ≤ r in = 2 O ( l o g n )
Proof Sketch :
When u u u joins H ( t ) H^{(t)} H ( t ) , there exists v ∈ I ( t ) v \in I^{(t)} v ∈ I ( t ) with D L ( u , v ) ≤ r i n D_L(u,v) \leq r_{in} D L ( u , v ) ≤ r in By Lemma 3.8, x u ≥ x v ⋅ ( n U ) − r i n ≥ ( n U ) − r i n − 3 ∣ ∣ b ^ ( t − 1 ) ∣ ∣ 1 x_u \geq x_v \cdot (nU)^{-r_{in}} \geq (nU)^{-r_{in}-3} ||\hat{b}^{(t-1)}||_1 x u ≥ x v ⋅ ( n U ) − r in ≥ ( n U ) − r in − 3 ∣∣ b ^ ( t − 1 ) ∣ ∣ 1 If u u u survives more than r i n + 1 r_{in}+1 r in + 1 iterations, then x u < ( n U ) − 3 − r i n ∣ ∣ b ^ ( t − 1 ) ∣ ∣ 1 x_u < (nU)^{-3-r_{in}} ||\hat{b}^{(t-1)}||_1 x u < ( n U ) − 3 − r in ∣∣ b ^ ( t − 1 ) ∣ ∣ 1 (contradiction) Therefore each vertex appears in active sets for at most r i n r_{in} r in iterations Corollary :
∑ t = 0 T ∣ H ( t ) ∣ ≤ n ⋅ r i n = n ⋅ 2 O ( log n ) \sum_{t=0}^T |H^{(t)}| \leq n \cdot r_{in} = n \cdot 2^{O(\sqrt{\log n})} ∑ t = 0 T ∣ H ( t ) ∣ ≤ n ⋅ r in = n ⋅ 2 O ( l o g n )
Complexity of Each Step :
Low-Diameter Cover Construction (Step 1):Iterations: ℓ 2 B = 2 O ( log n ) log ( n / δ ) \ell^2 B = 2^{O(\sqrt{\log n})} \log(n/\delta) ℓ 2 B = 2 O ( l o g n ) log ( n / δ ) Per solve: O ~ ( m log ( M 4 ℓ ) log ( U M 4 ℓ δ − 1 ℓ 2 B ) ) \tilde{O}(m \log(M^{4\ell}) \log(UM^{4\ell}\delta^{-1}\ell^2B)) O ~ ( m log ( M 4 ℓ ) log ( U M 4 ℓ δ − 1 ℓ 2 B )) Total: m ⋅ 2 O ( log n ) log ( U ) log ( δ − 1 ) log ( U δ − 1 ) m \cdot 2^{O(\sqrt{\log n})} \log(U) \log(\delta^{-1}) \log(U\delta^{-1}) m ⋅ 2 O ( l o g n ) log ( U ) log ( δ − 1 ) log ( U δ − 1 ) Partial Solves (Step 2a):Iteration t t t solves on H ( t ) H^{(t)} H ( t ) with sparsity nnz ( L H ( t ) , H ( t ) ) \text{nnz}(L_{H^{(t)},H^{(t)}}) nnz ( L H ( t ) , H ( t ) ) Per-iteration complexity: O ~ ( nnz ( L H ( t ) , H ( t ) ) log 2 ( U ϵ − 1 δ − 1 ) ) \tilde{O}(\text{nnz}(L_{H^{(t)},H^{(t)}}) \log^2(U\epsilon^{-1}\delta^{-1})) O ~ ( nnz ( L H ( t ) , H ( t ) ) log 2 ( U ϵ − 1 δ − 1 )) Total sparsity:
∑ t = 0 T nnz ( L H ( t ) , H ( t ) ) ≤ ∑ u ∈ [ n ] deg ( u ) ⋅ ∑ t = 0 T 1 [ u ∈ H ( t ) ] ≤ m ⋅ 2 O ( log n ) \sum_{t=0}^T \text{nnz}(L_{H^{(t)},H^{(t)}}) \leq \sum_{u \in [n]} \deg(u) \cdot \sum_{t=0}^T \mathbb{1}[u \in H^{(t)}] \leq m \cdot 2^{O(\sqrt{\log n})} ∑ t = 0 T nnz ( L H ( t ) , H ( t ) ) ≤ ∑ u ∈ [ n ] deg ( u ) ⋅ ∑ t = 0 T 1 [ u ∈ H ( t ) ] ≤ m ⋅ 2 O ( l o g n ) Total: O ~ ( m ⋅ 2 O ( log n ) log 2 ( U ϵ − 1 δ − 1 ) ) \tilde{O}(m \cdot 2^{O(\sqrt{\log n})} \log^2(U\epsilon^{-1}\delta^{-1})) O ~ ( m ⋅ 2 O ( l o g n ) log 2 ( U ϵ − 1 δ − 1 )) Extract Large Elements (Steps 2b-d):Only iterate on H ( t ) H^{(t)} H ( t ) , total ∑ t ∣ H ( t ) ∣ = n ⋅ 2 O ( log n ) \sum_t |H^{(t)}| = n \cdot 2^{O(\sqrt{\log n})} ∑ t ∣ H ( t ) ∣ = n ⋅ 2 O ( l o g n ) Complexity: O ~ ( n ⋅ 2 O ( log n ) ) \tilde{O}(n \cdot 2^{O(\sqrt{\log n})}) O ~ ( n ⋅ 2 O ( l o g n ) ) Vector and Set Maintenance (Step 2e-f):Vector updates: O(m+n) total updates (Claim 3.9) Norm maintenance: O ( ( n + m ) log ( n U / ϵ ) log n ) O((n+m)\log(nU/\epsilon)\log n) O (( n + m ) log ( n U / ϵ ) log n ) (Claim 3.10) I ( t ) I^{(t)} I ( t ) maintenance: O(m+n)H ( t ) H^{(t)} H ( t ) maintenance: n ⋅ 2 O ( log n ) n \cdot 2^{O(\sqrt{\log n})} n ⋅ 2 O ( l o g n ) (Claim 3.12)Total Time Complexity (Theorem 1.1):
O ~ ( m ⋅ 2 O ( log n ) log ( U ) log 2 ( U ϵ − 1 δ − 1 ) ) \tilde{O}(m \cdot 2^{O(\sqrt{\log n})} \log(U) \log^2(U\epsilon^{-1}\delta^{-1})) O ~ ( m ⋅ 2 O ( l o g n ) log ( U ) log 2 ( U ϵ − 1 δ − 1 ))
Main Theorem (Theorem 1.1): For ϵ > ( n U ) − 2 log n \epsilon > (nU)^{-2\sqrt{\log n}} ϵ > ( n U ) − 2 l o g n , the algorithm outputs with probability at least 1 − δ 1-\delta 1 − δ a solution x ~ ≈ ϵ L − 1 b \tilde{x} \approx_\epsilon L^{-1}b x ~ ≈ ϵ L − 1 b .
Proof Elements :
Cover Correctness (Theorem 2.1):Failure probability ≤ δ / 2 \leq \delta/2 ≤ δ /2 Parameters satisfy r o u t = 2 Ω ( log n ) ≥ 9 + log n U ( 1 / ϵ ) r_{out} = 2^{\Omega(\sqrt{\log n})} \geq 9 + \log_{nU}(1/\epsilon) r o u t = 2 Ω ( l o g n ) ≥ 9 + log n U ( 1/ ϵ ) Partial Solve Correctness (Lemma 3.5):Each iteration failure probability ≤ δ / ( 2 n ) \leq \delta/(2n) ≤ δ / ( 2 n ) Total failure probability over n n n iterations ≤ δ / 2 \leq \delta/2 ≤ δ /2 (union bound) Threshold Decay Correctness (Theorem 3.1):After T = n T=n T = n iterations, x ~ ≈ ϵ L − 1 b \tilde{x} \approx_\epsilon L^{-1}b x ~ ≈ ϵ L − 1 b Union Bound : Total failure probability ≤ δ / 2 + δ / 2 = δ \leq \delta/2 + \delta/2 = \delta ≤ δ /2 + δ /2 = δ
Note : This is a purely theoretical work with no experimental section. All results are from theoretical analysis and proofs.
Spielman-Teng ST14 : First near-linear time algorithm, energy norm error O ( m log n poly ( log ( U ϵ − 1 ) ) ) O(m \log n \text{poly}(\log(U\epsilon^{-1}))) O ( m log n poly ( log ( U ϵ − 1 ))) Simplified Algorithms : KOSZ13, LS13, KS16 simplify algorithm designImproved Polylog Factors : KMST10, KMP11, LS13, PS14, KMP14, CKM+14, JS21, KS16 Parallel Algorithms : BGK+11, PS14, KLP+16 polylog depth, near-linear workLimitations : All these algorithms only provide norm error bounds, cannot recover small components (unless ϵ \epsilon ϵ is exponentially small)
Early Work (1980-1990s) :
Higham Hig90 : Fast matrix multiplication (e.g., Strassen) cannot achieve entrywise approximationNumerical Stability : Hey87, GTH85, HP84 empirical studies show Gaussian elimination more stable for SDDM matricesTheoretical Analysis : O'C93, O'C96 analyze entrywise error in stationary distribution computation and LU factorizationTime Complexity : All algorithms require cubic time O ( n 3 ) O(n^3) O ( n 3 ) Modern Work :
GNY26 :
SDDM system solving: O ~ ( m n log 2 ( U ϵ − 1 δ − 1 ) ) \tilde{O}(m\sqrt{n}\log^2(U\epsilon^{-1}\delta^{-1})) O ~ ( m n log 2 ( U ϵ − 1 δ − 1 )) SDDM matrix inversion: O ~ ( m n log 2 ( U ϵ − 1 δ − 1 ) ) \tilde{O}(mn\log^2(U\epsilon^{-1}\delta^{-1})) O ~ ( mn log 2 ( U ϵ − 1 δ − 1 )) Uses threshold decay framework and adjacent vertex prediction Paper's Improvement :
Reduces from O ( m n ) O(m\sqrt{n}) O ( m n ) to O ( m ⋅ n o ( 1 ) ) O(m \cdot n^{o(1)}) O ( m ⋅ n o ( 1 ) ) Key: Low-diameter covers enable more precise global prediction Traditional Methods :
Coh98, BGK+11 : Used for shortest paths and parallel SDDM solversDifferences :
Traditional: Single cluster/ball, small diameter This paper: Inner-outer ball pairs, large inner-outer gap Distance Definition :
Traditional: Graph distance (weighted/unweighted) This paper: Probabilistic distance D L ( i , j ) = − log n U ( ( L − 1 ) i j ) + 2 D_L(i,j) = -\log_{nU}((L^{-1})_{ij}) + 2 D L ( i , j ) = − log n U (( L − 1 ) ij ) + 2 Access Restriction : This paper indirectly queries distances by solving random right-hand sidesGKS23 : Superior practical performance of near-linear time Laplacian solversPotential Applications : Scenarios with large edge weight variations (e.g., flow problems in interior point methods)Theoretical Breakthrough : First algorithm solving SDDM systems in almost-linear time O ~ ( m ⋅ n o ( 1 ) ) \tilde{O}(m \cdot n^{o(1)}) O ~ ( m ⋅ n o ( 1 ) ) with entrywise approximation guaranteesTechnical Contributions :Probabilistic distance metric space Existence and almost-linear time construction of low-diameter covers Improved threshold decay framework with boundary-expanded prediction Complexity Bounds :Time: O ~ ( m ⋅ 2 O ( log n ) log ( U ) log 2 ( U ϵ − 1 δ − 1 ) ) \tilde{O}(m \cdot 2^{O(\sqrt{\log n})} \log(U) \log^2(U\epsilon^{-1}\delta^{-1})) O ~ ( m ⋅ 2 O ( l o g n ) log ( U ) log 2 ( U ϵ − 1 δ − 1 )) bit operations Precision: ϵ > ( n U ) − 2 log n \epsilon > (nU)^{-2\sqrt{\log n}} ϵ > ( n U ) − 2 l o g n (non-exponentially small) Success Probability: ≥ 1 − δ \geq 1-\delta ≥ 1 − δ Quasi-Linear Factor : 2 O ( log n ) = n o ( 1 ) 2^{O(\sqrt{\log n})} = n^{o(1)} 2 O ( l o g n ) = n o ( 1 ) while subpolynomial, still non-constantPrecision Requirement : ϵ > ( n U ) − 2 log n \epsilon > (nU)^{-2\sqrt{\log n}} ϵ > ( n U ) − 2 l o g n , while non-exponentially small, still has lower boundInput Representation :Current algorithm targets fixed-point input Floating-point input may involve shortest path problem reduction, more challenging Constant Factors : Constants in 2 O ( log n ) 2^{O(\sqrt{\log n})} 2 O ( l o g n ) not optimized, potentially large in practiceTheoretical Work : No experimental verification or practical performance assessmentTruly Near-Linear Time : Can we achieve O ( m ⋅ polylog ( n ) ) O(m \cdot \text{polylog}(n)) O ( m ⋅ polylog ( n )) ?Requires improving low-diameter cover parameters r i n , α = O ( polylog ( n ) ) r_{in}, \alpha = O(\text{polylog}(n)) r in , α = O ( polylog ( n )) Or designing new frameworks not relying on covers Floating-Point Input : Study algorithms for floating-point represented inputNeed to avoid shortest path problem hardness May require new techniques and assumptions Practical Algorithms :Optimize constant factors Implementation and experimental evaluation Integration with existing solvers (e.g., GKS23 ) Extension to RDDL Matrices : Currently focuses on SDDM (symmetric), can we generalize to general RDDL?Parallel Algorithms : Design polylog-depth parallel versionsApplication Exploration : Applications in interior point methods, Markov chains, and other practical problemsSignificant Theoretical Impact :First algorithm solving entrywise approximation in almost-linear time Breaks through GNY26 's O ( m n ) O(m\sqrt{n}) O ( m n ) bottleneck Reduces problem complexity from n 1.5 n^{1.5} n 1.5 to n 1 + o ( 1 ) n^{1+o(1)} n 1 + o ( 1 ) Strong Technical Innovation :Probabilistic Distance : Cleverly connects random walks and matrix inverse, satisfies triangle inequalityLow-Diameter Cover : Elegant inner-outer ball design, balances coverage and separationRandom Construction : Queries distances indirectly via solving random right-hand sides, avoids O ( n 2 ) O(n^2) O ( n 2 ) explicit computationsBoundary-Expanded Prediction : Prediction technique controls total active set size at n 1 + o ( 1 ) n^{1+o(1)} n 1 + o ( 1 ) Rigorous Theoretical Analysis :Complete correctness proofs Detailed time complexity analysis Precise probabilistic analysis (high probability guarantees) Careful bit complexity analysis (accounts for numerical precision) Clear Presentation :Technical overview section provides intuitive explanation of core ideas Well-organized definitions and lemmas Clear algorithm pseudocode Logical proof structure Generality :Applies to all invertible SDDM matrices Flexible parameter settings (precision ϵ \epsilon ϵ , failure probability δ \delta δ ) Unknown Practical Performance :Purely theoretical work, no experimental verification Constants in 2 O ( log n ) 2^{O(\sqrt{\log n})} 2 O ( l o g n ) potentially large May underperform O ( m n ) O(m\sqrt{n}) O ( m n ) algorithm for moderate-sized n n n in practice Quasi-Linear Factor :2 O ( log n ) 2^{O(\sqrt{\log n})} 2 O ( l o g n ) while n o ( 1 ) n^{o(1)} n o ( 1 ) , still significant growthFor n = 2 20 n=2^{20} n = 2 20 , log n ≈ 4.5 \sqrt{\log n} \approx 4.5 log n ≈ 4.5 , 2 4.5 ≈ 22.6 2^{4.5} \approx 22.6 2 4.5 ≈ 22.6 Still gap from truly near-linear O ( polylog ( n ) ) O(\text{polylog}(n)) O ( polylog ( n )) Precision Limitation :ϵ > ( n U ) − 2 log n \epsilon > (nU)^{-2\sqrt{\log n}} ϵ > ( n U ) − 2 l o g n while non-exponentially small, still has lower boundMay not apply for extremely small ϵ \epsilon ϵ (e.g., ϵ = 1 / poly ( n ) \epsilon = 1/\text{poly}(n) ϵ = 1/ poly ( n ) ) Input Restrictions :Only handles fixed-point representation Floating-point input problem unresolved Technical Complexity :Algorithm implementation complex (nested loops, data structure maintenance) Low-diameter cover construction requires solving O ( ℓ 2 B ) = 2 O ( log n ) log ( n / δ ) O(\ell^2 B) = 2^{O(\sqrt{\log n})} \log(n/\delta) O ( ℓ 2 B ) = 2 O ( l o g n ) log ( n / δ ) linear systems Engineering implementation challenging Open Theoretical Questions :Does O ( m ⋅ polylog ( n ) ) O(m \cdot \text{polylog}(n)) O ( m ⋅ polylog ( n )) algorithm exist? What are the lower bounds? Academic Contribution :Important impact on algorithmic theory Advances SDDM system solving theory Introduces new technical tools (probabilistic distance, low-diameter covers) Follow-up Research :May inspire further improvements (e.g., optimizing 2 O ( log n ) 2^{O(\sqrt{\log n})} 2 O ( l o g n ) factor) Techniques potentially applicable to other matrix classes (RDDL, M-matrices) Low-diameter covers may find applications in other problems Practical Value :Short-term may underperform existing algorithms Long-term potential through optimization and implementation Potential advantage for very large-scale problems (n n n extremely large) Theoretical Completeness :Essentially answers "is almost-linear time entrywise approximation possible?" Provides new benchmark for the field Large-Scale Sparse Systems :Very large n n n (e.g., n > 10 6 n > 10^6 n > 1 0 6 ) m ≈ O ( n ) m \approx O(n) m ≈ O ( n ) (sparse graphs)2 O ( log n ) 2^{O(\sqrt{\log n})} 2 O ( l o g n ) factor relatively smallLarge Edge Weight Variation :Edge weights span multiple orders of magnitude Solution components differ exponentially Norm error bounds insufficient High Precision Requirements :Need accurate approximation of all components ϵ \epsilon ϵ not too small (ϵ > ( n U ) − 2 log n \epsilon > (nU)^{-2\sqrt{\log n}} ϵ > ( n U ) − 2 l o g n )Theoretical Research :Use as subroutine in other algorithms Theoretical complexity analysis Inapplicable Scenarios :Small-scale problems (n < 10 4 n < 10^4 n < 1 0 4 ): O ( m n ) O(m\sqrt{n}) O ( m n ) or cubic algorithms may be faster Only need norm error: use classical algorithms like ST14 Need extremely small ϵ \epsilon ϵ : may exceed precision range Floating-point input: current algorithm doesn't support ST14 Spielman & Teng (2014): Near-linear time SDDM solver, norm errorGNY26 Ghadiri, Nguyen & Yang (2026): O ( m n ) O(m\sqrt{n}) O ( m n ) entrywise approximation algorithm, threshold decay frameworkKOSZ13 Kelner et al. (2013): Combinatorial algorithm simplificationKS16 Kyng & Sachdeva (2016): Approximate Gaussian eliminationBGK+11 Blelloch et al. (2011): Parallel solver, low-diameter decompositionHig90 Higham (1990): Fast matrix multiplication and entrywise approximationO'C93, O'C96 O'Cinneide (1993, 1996): Markov chains, entrywise error in LU factorizationStr69 Strassen (1969): Fast matrix multiplicationGKS23 Gao, Kyng & Spielman (2023): Practical performance of Laplacian solversThis paper achieves important theoretical breakthrough, realizing the first almost-linear time entrywise approximation algorithm for SDDM systems. Core innovations lie in introducing probabilistic distance and low-diameter covers, using clever random sampling construction and boundary-expanded prediction to control total active set size at n 1 + o ( 1 ) n^{1+o(1)} n 1 + o ( 1 ) . Despite the quasi-linear factor 2 O ( log n ) 2^{O(\sqrt{\log n})} 2 O ( l o g n ) and lack of experimental verification, this work holds significant algorithmic-theoretic importance and provides new technical tools and research directions for future work. For large-scale sparse systems and applications with dramatically varying edge weights, this algorithm has potential practical value.