We present a new probabilistic proof of Otter's asymptotic formula for the number of unlabelled trees with a given number of vertices. We additionally prove a new approximation result, showing that the total variation distance between random Pólya trees and random unlabelled trees tends to zero when the number of vertices tends to infinity. In order to demonstrate that our approach is not restricted to trees we extend our results to tree-like classes of graphs.
Probabilistic enumeration and equivalence of nonisomorphic trees Paper ID : 2305.16453Title : Probabilistic enumeration and equivalence of nonisomorphic treesAuthor : Benedikt Stufler (Vienna University of Technology)Classification : math.CO (Combinatorics), math.PR (Probability Theory)Publication Date : May 2023, revised October 2025Paper Link : https://arxiv.org/abs/2305.16453 This paper presents a novel probabilistic proof of Otter's asymptotic formula for the number of unlabeled trees with a given number of vertices. Furthermore, it establishes a new approximation result demonstrating that the total variation distance between random Pólya trees and random unlabeled trees converges to zero as the number of vertices approaches infinity. To demonstrate that the approach extends beyond trees, the author generalizes the results to tree-like graph classes.
Classical tree enumeration : Cayley's theorem provides the counting formula for labeled trees u n = n n − 2 u_n = n^{n-2} u n = n n − 2 , but enumeration of unlabeled trees is considerably more complexSignificance of Otter's formula : Otter's 1948 asymptotic formula for the number of unlabeled trees is a classical result in combinatoricsAbsence of probabilistic approaches : Existing proofs of Otter's formula rely primarily on generating functions and analytic combinatorics, lacking a probabilistic perspectiveMethodological innovation : Providing the first probabilistic proof of Otter's formula, enriching the methodology of combinatorial enumerationEquivalence problem : Resolving the fundamental question of the relationship between random Pólya trees and random unlabeled treesResult transfer : Providing stronger theoretical foundations for transferring properties from rooted trees to unrooted treesGeneralization potential : Demonstrating the universality of the proof method and extending it to more general graph classesProvides a new probabilistic proof of Otter's asymptotic formula : Avoiding the traditional dissymmetry equation approachProves asymptotic equivalence of random trees : lim n → ∞ d T V ( F ( A n ) , F n ) = 0 \lim_{n→∞} d_{TV}(F(A_n), F_n) = 0 lim n → ∞ d T V ( F ( A n ) , F n ) = 0 Establishes stronger approximation theory : Directly proving equivalence rather than requiring additional small tree approximations as in previous workExtends to degree-restricted trees and tree-like graph classes : Demonstrating the universality of the methodProvides precise convergence rates : For 1 / 2 < α < 1 1/2 < α < 1 1/2 < α < 1 , establishes exponential convergence ratesThe author transforms the tree enumeration problem into an orbit counting problem under the action of symmetric groups through symmetry analysis.
Symmetry set : S y m ( U ) [ V ] = { ( T , σ ) ∣ T ∈ U [ V ] , σ ∈ S [ V ] , σ . T = T } Sym(U)[V] = \{(T,σ) | T ∈ U[V], σ ∈ S[V], σ.T = T\} S y m ( U ) [ V ] = {( T , σ ) ∣ T ∈ U [ V ] , σ ∈ S [ V ] , σ . T = T } Fixed point classification : S y m k ( U ) [ V ] Sym_k(U)[V] S y m k ( U ) [ V ] denotes symmetries with exactly k k k fixed pointsGenerating function relations : Establishing connections between exponential generating functionsDecomposing the generating function of unlabeled trees as:
F ( z ) = S y m 0 ( U ) ( z ) + U ( H ( z ) ) F(z) = Sym_0(U)(z) + U(H(z)) F ( z ) = S y m 0 ( U ) ( z ) + U ( H ( z ))
where:
U ( z ) = ∑ n ≥ 1 n n − 2 n ! z n U(z) = \sum_{n≥1} \frac{n^{n-2}}{n!}z^n U ( z ) = ∑ n ≥ 1 n ! n n − 2 z n is the exponential generating function of labeled treesH ( z ) = z exp ( ∑ i ≥ 2 A ( z i ) / i ) H(z) = z\exp(\sum_{i≥2} A(z^i)/i) H ( z ) = z exp ( ∑ i ≥ 2 A ( z i ) / i ) corresponds to symmetries fixing only the rootIntroducing independent random variables X 1 , X 2 , . . . X_1, X_2, ... X 1 , X 2 , ... , whose probability generating function is:
E [ z X ] = ρ z exp ( 1 + ∑ i ≥ 2 A ( ( ρ z ) i ) / i ) E[z^X] = ρz\exp(1 + \sum_{i≥2} A((ρz)^i)/i) E [ z X ] = ρ z exp ( 1 + ∑ i ≥ 2 A (( ρ z ) i ) / i )
and an independent random variable N N N satisfying E [ z N ] = 2 U ( z / e ) E[z^N] = 2U(z/e) E [ z N ] = 2 U ( z / e ) .
Utilizing moderate deviation inequalities for random walks and Stirling's formula to obtain:
[ z n ] F ( z ) ∼ 1 2 π E [ X ] 3 / 2 n − 5 / 2 ρ − n [z^n]F(z) ∼ \frac{1}{\sqrt{2π}}E[X]^{3/2}n^{-5/2}ρ^{-n} [ z n ] F ( z ) ∼ 2 π 1 E [ X ] 3/2 n − 5/2 ρ − n
Fixed point control : Proving that the probability of fixed point numbers deviating from the expected value n / E [ X ] n/E[X] n / E [ X ] by more than n α n^α n α is exponentially smallConditional comparison : Comparing probabilities in rooted and unrooted cases when fixed point numbers lie within reasonable rangesSecond-order analysis : Utilizing second-order asymptotic expansions of Pólya trees to control error termsThis is primarily a theoretical work, with experimental components including:
Constant computation : Verifying c A ≈ 0.439924 c_A ≈ 0.439924 c A ≈ 0.439924 , ρ ≈ 0.338321 ρ ≈ 0.338321 ρ ≈ 0.338321 Convergence rate verification : Validating exponential convergence rates through explicit calculationsExtension verification : Verifying theoretical results in degree-restricted casesModerate deviation inequalities : Controlling tail probabilities of random walksGenerating function analysis : Establishing connections between combinatorial structures and probability distributionsSingularity analysis : Deriving asymptotic behavior from singularity properties of generating functionsf n ∼ c F n − 5 / 2 ρ − n f_n ∼ c_F n^{-5/2}ρ^{-n} f n ∼ c F n − 5/2 ρ − n
where c F = 2 π c A 3 c_F = 2πc_A^3 c F = 2 π c A 3 , c A = 1 2 π E [ X ] 1 / 2 c_A = \frac{1}{\sqrt{2π}}E[X]^{1/2} c A = 2 π 1 E [ X ] 1/2 .
lim n → ∞ d T V ( F ( A n ) , F n ) = 0 \lim_{n→∞} d_{TV}(F(A_n), F_n) = 0 lim n → ∞ d T V ( F ( A n ) , F n ) = 0
More precisely, for 1 / 2 < α < 1 1/2 < α < 1 1/2 < α < 1 , there exist constants c , C > 0 c, C > 0 c , C > 0 such that:
∣ P ( F ( A n ) ∈ E ) − P ( F n ∈ E ) ∣ ≤ exp ( − c n 2 α − 1 ) + P ( F ( A n ) ∈ E ) C n α − 1 |P(F(A_n) ∈ E) - P(F_n ∈ E)| ≤ \exp(-cn^{2α-1}) + P(F(A_n) ∈ E)Cn^{α-1} ∣ P ( F ( A n ) ∈ E ) − P ( F n ∈ E ) ∣ ≤ exp ( − c n 2 α − 1 ) + P ( F ( A n ) ∈ E ) C n α − 1
Degree-restricted trees : For degree set Ω Ω Ω , lim n → ∞ d T V ( F ( A ~ n Ω ) , F n Ω ) = 0 \lim_{n→∞} d_{TV}(F(\tilde{A}_n^Ω), F_n^Ω) = 0 lim n → ∞ d T V ( F ( A ~ n Ω ) , F n Ω ) = 0 Subcritical graph classes : For graph classes C C C satisfying certain conditions, lim n → ∞ d T V ( F ( C n • ) , C n ) = 0 \lim_{n→∞} d_{TV}(F(C_n^•), C_n) = 0 lim n → ∞ d T V ( F ( C n • ) , C n ) = 0 Cayley (1889) : Counting formula for labeled treesPólya (1937) : General counting theory and Pólya treesOtter (1948) : Asymptotic formula for unlabeled treesHarary (1955) : Alternative proof using dissymmetry equationsAnalytic combinatorics methods : Singularity analysis by Flajolet-SedgewickProbabilistic methods : Research on properties of random treesTransfer theorems : Result transfer from rooted to unrooted treesCompared to existing work, this paper is the first to:
Provide a probabilistic proof of Otter's formula Establish the strongest form of random tree equivalence Avoid complex dissymmetry equations Methodological contribution : Successful application of probabilistic methods in combinatorial enumerationTheoretical breakthrough : Establishing complete equivalence between random Pólya trees and random unlabeled treesTechnical innovation : Clever combination of symmetry analysis and moderate deviation inequalitiesTechnical complexity : The proof requires deep knowledge of probability theory and combinatoricsGeneralization constraints : Extending to non-tree structures requires additional technical conditionsComputational complexity : Precise computation of constants remains difficultUnlabeled planar graphs : Extending the method to more complex graph classesFunctional convergence : Studying limiting processes of contour functionsAlgorithmic applications : Applying theoretical results to random generation algorithmsTheoretical depth : Solving a classical problem in combinatorics from a novel perspectiveTechnical innovation : Cleverly combining symmetric group theory, probability theory, and generating function methodsResult strength : Not only reproving classical results but obtaining stronger equivalence theoremsMethod universality : Successfully extending to degree-restricted and graph class casesProof complexity : Numerous technical details with high barriers to understandingPractical applications : Primarily theoretical contributions with limited direct applicabilityComputational aspects : Limited assistance for practical problems such as computing constantsAcademic value : Providing important examples for interdisciplinary research between combinatorics and probability theoryMethodological significance : Demonstrating the powerful potential of probabilistic methods in enumerative combinatoricsSubsequent research : Providing new technical tools for related problem investigationsTheoretical research : Asymptotic analysis of combinatorial structuresRandomized algorithms : Random generation and analysis of tree structuresProbabilistic combinatorics : Properties of large random structuresThe paper cites 32 important references spanning from Cayley's classical work to modern developments in probabilistic combinatorics, particularly:
Otter (1948): Original asymptotic formula Pólya (1937): Foundational counting theory Flajolet & Sedgewick (2009): Analytic combinatorics methods Author's previous work: Limit theory of random trees This paper has significant theoretical value, not only providing a new proof of a classical result but more importantly establishing fundamental equivalences in random tree theory, laying a solid foundation for further development in this field.