Constant Degree Networks for Almost-Everywhere Reliable Transmission
Bafna, Minzer
In the almost-everywhere reliable message transmission problem, introduced by [Dwork, Pippenger, Peleg, Upfal'86], the goal is to design a sparse communication network $G$ that supports efficient, fault-tolerant protocols for interactions between all node pairs. By fault-tolerant, we mean that that even if an adversary corrupts a small fraction of vertices in $G$, then all but a small fraction of vertices can still communicate perfectly via the constructed protocols. Being successful to do so allows one to simulate, on a sparse graph, any fault-tolerant distributed computing task and secure multi-party computation protocols built for a complete network, with only minimal overhead in efficiency. Previous works on this problem achieved either constant-degree networks tolerating $o(1)$ faults, constant-degree networks tolerating a constant fraction of faults via inefficient protocols (exponential work complexity), or poly-logarithmic degree networks tolerating a constant fraction of faults.
We show a construction of constant-degree networks with efficient protocols (i.e., with polylogarithmic work complexity) that can tolerate a constant fraction of adversarial faults, thus solving the main open problem of Dwork et al.. Our main contribution is a composition technique for communication networks, based on graph products. Our technique combines two networks tolerant to adversarial edge-faults to construct a network with a smaller degree while maintaining efficiency and fault-tolerance. We apply this composition result multiple times, using the polylogarithmic-degree edge-fault tolerant networks constructed in a recent work of [Bafna, Minzer, Vyas'24] (that are based on high-dimensional expanders) with itself, and then with the constant-degree networks (albeit with inefficient protocols) of [Upfal'92].
academic
Constant Degree Networks for Almost-Everywhere Reliable Transmission
This paper investigates the "almost-everywhere reliable message transmission" problem originally proposed by Dwork et al. in 1986. The objective is to design a sparse communication network G that supports efficient, fault-tolerant protocol interactions between node pairs. Even when an adversary corrupts a small fraction of vertices in G, all but a negligible fraction of vertices can communicate perfectly through the constructed protocol. Successfully solving this problem enables the simulation of any fault-tolerant distributed computing task and secure multi-party computation protocol designed for complete networks on sparse graphs with minimal efficiency overhead.
Previous research either achieved constant-degree networks tolerating o(1) faults, or achieved constant-degree networks tolerating constant-fraction faults through inefficient protocols (exponential work complexity), or achieved polylogarithmic-degree networks tolerating constant-fraction faults. This paper demonstrates a constant-degree network construction with an efficient protocol (polylogarithmic work complexity) that tolerates constant-fraction adversarial faults, thereby resolving the main open problem posed by Dwork et al.
Practical demands of distributed computing: Computational tasks in modern large-scale networks typically require distributed execution across multiple machines, including Byzantine consensus, collective coin-flipping, and secure multi-party computation tasks such as poker.
Unrealistic nature of complete connectivity assumptions: Most fault-tolerant protocols assume every machine can communicate directly with all other machines, which is impractical in modern large-scale sparse networks.
Challenges of sparse networks: In sparse networks, if node degree d is much smaller than the number of faulty nodes t, some honest nodes may become isolated because all their neighbors are corrupted.
Theoretical significance: Resolves a fundamental problem in distributed computing theory, bridging theoretical models and practical network constraints
Practical value: Provides theoretical foundations for large-scale distributed systems, particularly in blockchain and distributed storage domains
Security guarantees: Maintains network communication capability in adversarial environments, which is crucial for network security
Resolves the main open problem: Constructs the first network simultaneously achieving constant degree, polylogarithmic work complexity, and constant fault tolerance rate
Proposes combinatorial techniques: Communication network combinatorial techniques based on graph products that reduce network degree while maintaining efficiency and fault tolerance
Establishes theoretical framework: Provides theoretical foundations for network combination under edge failure models
Achieves parameter optimization: Attains ideal targets across all critical parameters (degree, work complexity, fault tolerance rate)
Theorem 1.1: There exists a constant D such that for all ε > 0 and sufficiently large n, there exists a D-regular graph G with Θ(n) vertices and protocol collection R = {R(u,v)} satisfying:
Work complexity: polylog n
Round complexity: Õ(log n)
Fault tolerance: Under ε-fraction edge faults, at most poly(ε)-fraction vertices are doomed to fail
Lemma 1.2 (Permutation Model): There exists a constant D such that for all permutations π, graph G admits a routing protocol with (ε³², O(ε))-edge fault tolerance.
LSV05a,b: Algebraic constructions of high-dimensional expanders
Through innovative graph product combination techniques, this paper successfully resolves an important open problem in distributed computing with significant theoretical breakthrough. While there remains room for improvement in practical applicability, it establishes a solid theoretical foundation for future research and applications.