Rademacher Meets Colors: More Expressivity, but at What Cost ?
Carrasco, Netto, Martirosyan et al.
The expressive power of graph neural networks (GNNs) is typically understood through their correspondence with graph isomorphism tests such as the Weisfeiler-Leman (WL) hierarchy. While more expressive GNNs can distinguish a richer set of graphs, they are also observed to suffer from higher generalization error. This work provides a theoretical explanation for this trade-off by linking expressivity and generalization through the lens of coloring algorithms. Specifically, we show that the number of equivalence classes induced by WL colorings directly bounds the GNNs Rademacher complexity -- a key data-dependent measure of generalization. Our analysis reveals that greater expressivity leads to higher complexity and thus weaker generalization guarantees. Furthermore, we prove that the Rademacher complexity is stable under perturbations in the color counts across different samples, ensuring robustness to sampling variability across datasets. Importantly, our framework is not restricted to message-passing GNNs or 1-WL, but extends to arbitrary GNN architectures and expressivity measures that partition graphs into equivalence classes. These results unify the study of expressivity and generalization in GNNs, providing a principled understanding of why increasing expressive power often comes at the cost of generalization.
academic
Rademacher Meets Colors: More Expressivity, but at What Cost?
The expressive power of Graph Neural Networks (GNNs) is typically understood through its correspondence with graph isomorphism tests, such as the Weisfeiler-Leman hierarchy. While more expressive GNNs can distinguish richer sets of graphs, they also exhibit higher generalization error. This work connects expressivity with generalization capability through the lens of coloring algorithms, providing a theoretical explanation for this trade-off. Specifically, the authors prove that the number of equivalence classes induced by WL coloring directly bounds the Rademacher complexity of GNNs—a key data-dependent generalization measure. The analysis reveals that stronger expressivity leads to higher complexity, resulting in weaker generalization guarantees. Furthermore, the authors establish the stability of Rademacher complexity under color count perturbations across different samples. Importantly, the framework extends beyond message-passing GNNs or 1-WL to arbitrary GNN architectures and expressivity measures that partition graphs into equivalence classes.
This research addresses a fundamental theoretical question in the GNN field: the trade-off between expressive power and generalization capability. While empirical observations suggest that more expressive GNNs often exhibit worse generalization performance, rigorous theoretical explanations are lacking.
Missing Theoretical Foundation: Existing research primarily focuses on analyzing GNN expressivity, but lacks sufficient theoretical understanding of its relationship with generalization capability
Practical Guidance Value: Understanding this trade-off is crucial for designing GNN architectures that possess sufficient expressivity while maintaining good generalization
Need for Unified Framework: A unified theoretical framework is needed to explain the generalization behavior of different GNN architectures
Morris et al.'s VC Dimension Analysis: Applicable only to specific activation functions and bounded graphs, depending on parameter count rather than structural properties
Garg et al.'s Rademacher Complexity: While providing tighter bounds, it does not explore connections with WL coloring distributions
Lack of Generality: Existing analyses are mostly limited to specific GNN architectures or 1-WL tests
Establishing Expressivity-Generalization Theoretical Connection: First direct linkage between GNN expressivity and Rademacher complexity through coloring algorithms
Providing Precise Complexity Bounds: Proves that the Rademacher complexity upper bound is p/m, where p is the number of equivalence classes
Establishing Stability Guarantees: Establishes Lipschitz continuity of Rademacher complexity under color count perturbations
Designing Universal Framework: Extends to arbitrary GNN architectures and corresponding coloring algorithms, not limited to message-passing GNNs or 1-WL
Improving Dudley Integral Bounds: Provides tighter covering number bounds utilizing p-dimensional structure
Coloring algorithms partition sample S into p disjoint sets I1,…,Ip, where each Ij contains all graphs with the same color cj. This partition imposes structural constraints on the function class: any function implementable by the architecture must remain constant on equivalence classes.
Proposition 3.1 (Core Bound):
For function class F, if for each f∈F, graphs with identical 1-WL colors have identical outputs, then the empirical Rademacher complexity bound is:
RS(F)≤msupΘL(Θ)p
where L(Θ)=∑i=1mf(Gi;Θ)2 is the ℓ2 norm of function outputs.
Corollary 3.2 (Bounded Output Case):
When f:G→[−1,1]:
Proposition 3.4 (Stability Guarantee):
For two samples S and S′ of size m, if the count difference for each color cj across the two samples is at most ϵj:
∣RS(F)−RS′(F)∣≤m∑cj∈GCϵj
This ensures robustness of the bound under sampling variability.
The framework extends to arbitrary (A,T) pairs, where A is a GNN architecture and T is a coloring algorithm bounding its expressivity. If T⊑S (expressivity of T does not exceed S), then pT≤pS, meaning more expressive architectures have larger Rademacher complexity bounds.
This is primarily a theoretical work, with all proposed bounds verified through mathematical proofs. The authors provide visualization examples in Figure 1, demonstrating how function classes of different expressivity induce different sample partitions.
This paper cites 28 relevant references, covering important works in core areas including GNN expressivity, generalization theory, and Rademacher complexity, providing solid foundation for theoretical analysis.
Summary: Through the lens of coloring algorithms, this paper establishes for the first time a quantitative theoretical connection between GNN expressivity and generalization capability, providing important theoretical tools for understanding and designing GNNs. Despite some limitations, its theoretical contributions hold significant value and are expected to advance GNN theory research.