There is no known polynomial-time algorithm for graph isomorphism testing, but elementary combinatorial "refinement" algorithms seem to be very efficient in practice. Some philosophical justification is provided by a classical theorem of Babai, ErdÅs and Selkow: an extremely simple polynomial-time combinatorial algorithm (variously known as "naïve refinement", "naïve vertex classification", "colour refinement" or the "1-dimensional Weisfeiler-Leman algorithm") yields a so-called canonical labelling scheme for "almost all graphs". More precisely, for a typical outcome of a random graph $G(n,1/2)$, this simple combinatorial algorithm assigns labels to vertices in a way that easily permits isomorphism-testing against any other graph.
We improve the Babai-ErdÅs-Selkow theorem in two directions. First, we consider randomly perturbed graphs, in accordance with the smoothed analysis philosophy of Spielman and Teng: for any graph $G$, naïve refinement becomes effective after a tiny random perturbation to $G$ (specifically, the addition and removal of $O(n\log n)$ random edges). Actually, with a twist on naïve refinement, we show that $O(n)$ random additions and removals suffice. These results significantly improve on previous work of Gaudio-Rácz-Sridhar, and are in certain senses best-possible.
Second, we complete a long line of research on canonical labelling of random graphs: for any $p$ (possibly depending on $n$), we prove that a random graph $G(n,p)$ can typically be canonically labelled in polynomial time. This is most interesting in the extremely sparse regime where $p$ has order of magnitude $c/n$; denser regimes were previously handled by Bollobás, Czajka-Pandurangan, and Linial-Mosheiff. Our proof also provides a description of the automorphism group of a typical outcome of $G(n,p_n)$ (slightly correcting a prediction of Linial-Mosheiff).
The graph isomorphism testing problem lacks a known polynomial-time algorithm, yet basic combinatorial "refinement" algorithms perform remarkably efficiently in practice. The classical theorem of Babai, Erdős, and Selkow provides a philosophical explanation: an extremely simple polynomial-time combinatorial algorithm (called "naive refinement," "naive vertex classification," "color refinement," or "1-dimensional Weisfeiler-Leman algorithm") provides a canonical labeling scheme for "almost all graphs."
This paper improves the Babai-Erdős-Selkow theorem in two directions: first, by considering randomly perturbed graphs according to the smoothed analysis framework of Spielman and Teng; second, by completing a long-standing research line on canonical labelings of random graphs.
Importance of the graph isomorphism problem: Graph isomorphism testing is a central problem in computational complexity theory, occupying a special position between P and NP-complete
Gap between practice and theory: Despite requiring exponential time in the worst case, the color refinement algorithm performs excellently in practice
Limitations of the Babai-Erdős-Selkow theorem: The classical theorem applies only to random graphs G(n,1/2) and performs poorly on structured graphs
Smoothed analysis results: Proving that after O(n log n) random edge perturbations to an arbitrary graph G₀, the color refinement algorithm almost always succeeds
Improved perturbation bounds: Through a modified algorithm, reducing the required perturbations to O(n) random edges
Complete theory for sparse random graphs: Providing polynomial-time canonical labeling schemes for random graphs G(n,p) of arbitrary density p
Characterization of automorphism groups: Describing the automorphism group structure of typical random graphs, correcting predictions by Linial-Mosheiff
Given two n-vertex graphs G₁ and G₂, the graph isomorphism problem requires determining whether there exists a bijection between vertex sets preserving the adjacency relations. Canonical labeling is a method for assigning a standard form to each graph such that isomorphic graphs have identical labelings.
For constant ε > 0 and p ≥ (1+ε)log n/n, for any graph G₀ and random graph G_rand ~ G(n,p), the color refinement algorithm almost always distinguishes all vertices of G₀△G_rand.
There exist a graph class H and a polynomial-time canonical labeling algorithm such that for p ≥ 100/n, for any graph G₀ and G_rand ~ G(n,p), almost always G₀△G_rand ∈ H.
This is the core technical result, proving that in appropriately randomly perturbed graphs, when S^{≤i}({u,v}) is sufficiently large, vertices u and v are almost always distinguished by color refinement.
Detailed analysis of time complexity for each algorithm component, proving overall polynomial-time properties.
This paper makes important contributions to theoretical research on the graph isomorphism problem, particularly in explaining the effectiveness of practical algorithms and perfecting random graph theory. Although the techniques are relatively complex, it provides new perspectives and profound insights into this classical problem.