Evolutionary algorithms are widely used for solving multi-objective optimization problems. A prominent example is NSGA-III, which is particularly well suited for solving problems involving more than three objectives, distinguishing it from the classical NSGA-II. Despite its empirical success, the theoretical understanding of NSGA III remains very limited, especially with respect to runtime analysis. A central open problem concerns its population dynamics, which involve controlling the maximum number of individuals sharing the same fitness value during the exploration process. In this paper, we make a significant step towards such an understanding by proving tight runtime bounds for NSGA-III on the bi-objective OneMinMax ($2$-OMM) problem. Firstly, we prove that NSGA-III requires $Ω(n^2 \log(n) / μ)$ generations in expectation to optimize $2$-OMM assuming the population size $μ$ satisfies $n+1 \leq μ=O(\log(n)^c(n+1))$ where $n$ denotes the problem size and $c<1$ is a constant. Apart from~\cite{opris2025multimodal}, this is the first proven lower runtime bound for NSGA-III on a classical benchmark problem. Complementing this, we secondly improve the best known upper bound of NSGA-III on the $m$-objective OneMinMax problem ($m$-OMM) of $O(n \log(n))$ generations by a factor of $μ/(2n/m + 1)^{m/2}$ for a constant number $m$ of objectives and population size $(2n/m + 1)^{m/2} \leq μ\in O(\sqrt{\log(n)} (2n/m + 1)^{m/2})$. This yields tight runtime bounds in the case $m = 2$, and the surprising result that NSGA-III beats NSGA-II by a factor of $μ/n$ in the expected runtime.
Towards a Rigorous Understanding of the Population Dynamics of the NSGA-III: Tight Runtime Bounds Paper ID : 2511.07125Title : Towards a Rigorous Understanding of the Population Dynamics of the NSGA-III: Tight Runtime BoundsAuthor : Andre Opris (University of Passau, Germany)Classification : cs.NE (Neural and Evolutionary Computing)Publication Date : November 2025 (arXiv preprint)Paper Link : https://arxiv.org/abs/2511.07125 This paper provides a rigorous theoretical runtime analysis of the NSGA-III algorithm, which is widely used in multi-objective optimization. Although NSGA-III has achieved empirical success in solving problems with more than three objectives, its theoretical understanding remains severely limited. By analyzing the bi-objective OneMinMax (2-OMM) problem, the paper proves that NSGA-III requires Ω(n²log(n)/μ) generations to optimize this problem (where μ is the population size satisfying n+1 ≤ μ = O(log(n)^c(n+1)), with c<1 being a constant). This is the first lower bound proof for NSGA-III on a classical benchmark problem. Furthermore, the paper improves the known upper bound on the m-OMM problem from O(n log(n)) by a factor of μ/(2n/m+1)^(m/2). For the case m=2, these results yield tight runtime bounds and reveal the surprising result that NSGA-III surpasses NSGA-II by a factor of μ/n in expected runtime.
Missing Theoretical Understanding : Although NSGA-III is widely applied in practice (approximately 6,000 citations), particularly excelling at problems with four or more objectives, its theoretical foundation lags far behind its practical impact. Runtime analysis studies have only recently emerged.Population Dynamics Control : A core open question is understanding NSGA-III's population dynamics, particularly how to control the maximum number of individuals sharing the same fitness value during exploration (the cover number β).Distinction from NSGA-II : NSGA-III and NSGA-II exhibit significant differences in population dynamics:NSGA-III systematically iterates through reference points, always selecting points associated with the least-selected reference points NSGA-II treats all points with zero crowding distance equally This enables NSGA-III to distribute solutions more uniformly Bridging Theoretical Gaps : Providing rigorous mathematical foundations for a practically successful algorithmUnderstanding Algorithm Behavior : Clarifying when and why NSGA-III performs wellGuiding Algorithm Improvement : Deep understanding can help develop enhanced versionsMulti-objective Optimization Foundations : Multi-objective optimization is widely applied in AI, bioinformatics, machine learning, and engineeringNSGA-II Limitations : Crowding distance becomes unreliable with three or more objectives (a solution may have zero crowding distance without being close to other solutions)Insufficient Theoretical Analysis : Before (Opris 2025a), there were no tight runtime bounds for NSGA-II or NSGA-III on classical benchmark functionsUnclear Population Dynamics : How NSGA-III distributes solutions during exploration (particularly before reaching local optima or when no local optima exist) remains unclearFirst Lower Bound Proof : Proves that NSGA-III requires Ω(n²ln(n)/μ) generations expected runtime on the 2-OMM problem, the first lower bound for classical benchmark problems besides (Opris 2025a)Improved Upper Bounds : Improves the known upper bound O(n ln(n)) on the m-OMM problem by a factor of μ/(2n/m+1)^(m/2) for constant m and appropriate population sizesEstablishing Tight Bounds : For the case m=2, lower and upper bounds match, establishing a tight runtime bound of Θ(n²ln(n)/μ)Surpassing NSGA-II : Proves NSGA-III outperforms NSGA-II by a factor of μ/n in expected runtime (NSGA-II's lower bound is Ω(n ln(n)))In-Depth Population Dynamics Analysis :Analyzing time required to cover Pareto front subsets (Lemma 3) Analyzing time required to uniformly distribute solutions on the subset (Lemma 4) Analyzing time lower bounds during exploration toward extreme point 1^n (Lemmas 5 and 6) Proving the decreasing behavior of maximum cover number β during exploration m-objective OneMinMax (m-OMM) Problem :
Input: Bit string x ∈ {0,1}^n, where n is a multiple of m/2 Divide the bit string into m/2 blocks, each of length 2n/m For block j (j ∈ m/2 ):
f_{2j-1}(x): counts the number of 1s in the block f_{2j}(x): counts the number of 0s in the block Objective: Maximize m-OMM(x) = (f_1(x), ..., f_m(x)) Key Properties :
Every search point is Pareto optimal (since all objective values sum to n) Pareto set cardinality is (2n/m+1)^(m/2) No local optima exist (unlike the previously studied OJZJ problem) 1. Cover Number
Definition: c_t(v) := |{x ∈ P_t | f(x) = v}|, the number of individuals in generation t with fitness vector v Maximum cover number: β := max{c_t(v) | v ∈ Pareto front} Key property (Lemma 1): For Pareto optimal solutions, β is non-increasing 2. NSGA-III Selection Mechanism
The algorithm uses a reference point set:
Rp = {(a_1/p, ..., a_m/p) | (a_1,...,a_m) ∈ N_0^m, Σa_i = p}
Selection process:
Compute normalized fitness function f^n Associate each individual with the nearest reference point Iteratively select individuals associated with the least-selected reference points Phase 1: Covering Subset (Lemma 3)
For a given set A ⊂ Pareto front with cardinality α ≤ 3n/8 Cover A within 64α generations with probability at least 1-e^(-Ω(α)) Proof approach: Utilizing random initialization and probabilistic analysis of standard bit mutation Phase 2: Uniform Distribution (Lemma 4)
After covering A, within 84α+46γ generations (γ = min{⌈n/ln(n)⌉, ⌈μ/α⌉}) Cover number of each v ∈ A is at most ⌈μ/α⌉ with probability 1-o(1) Two-case analysis:
Case 1: ⌈μ/α⌉ ≤ ⌈n/ln(n)⌉ Case 2: ⌈μ/α⌉ > ⌈n/ln(n)⌉ Phase 3: Exploration Control (Lemmas 5 and 6)
Lemma 5: Within cn/ln(n) generations, no individuals with |y|_1 ≥ 3n/4 are created with probability 1-o(1) Lemma 6: For constants 0 ≤ a < b ≤ 3/4
Assuming maximum cover number at most β = o(n^(1-b)) Reducing distance from n^b to n^a requires Ω(n ln(n)/β) generations Key insight: Smaller β reduces probability of selecting individuals close to 1^n 1. Iteratively Reducing Cover Number
Unlike previous work (analyzing distribution only after reaching local optima) Dynamically tracking and controlling β during exploration Combining Lemmas 3, 4, and 6 to iteratively reduce β 2. Fine-grained Probabilistic Analysis
Using stochastic dominance and independent sums of geometric distributions Applying Witt (2014)'s tail bound theorem Considering Chernoff bounds and union bounds 3. Phase-wise Analysis
Setting ℓ = ⌈(2c+1)/(1-c)⌉ = O(1) phases:
Phase j: Covers sets of size Ω(n ln(n)^j/ln(n)^((2+j)c)) Reduces β to e_j ln(n)^((2+j)c-j) Finally β = O(μ/n) (minimum possible value) Note : This is a pure theoretical paper with no experimental section. All results are obtained through rigorous mathematical proofs.
Problem Size : n (bit string length)Population Size : μ, satisfying n+1 ≤ μ = O(ln(n)^c(n+1)), where c < 1Number of Objectives : m (constant)Reference Point Parameter : p ≥ 4√2n (ensuring correct normalization)Probabilistic Tools :Chernoff bounds Stochastic dominance Tail bounds for geometric distributions (Witt 2014) Union bounds Key Lemmas :Lemma 1: Protection property of Pareto optimal solutions Standard bit mutation analysis Non-dominated sorting properties Theorem 7 (Lower Bound) :
For the 2-OMM problem, under conditions:
p ≥ 4√2n μ ≥ n+1 μ ∈ O(ln(n)^c n), c < 1 is a constant The expected number of generations for NSGA-III to cover the entire Pareto front is at least Ω(n²ln(n)/μ) .
Proof Highlights :
After initial 130⌊n/ln(n)⌋ generations:No individual y satisfies |y|_1 ≥ 3n/4 β ≤ 2ln(n)^(1+c) Iterative process (ℓ = O(1) iterations):Each iteration: explores distance n^(b_j) → n^(b_{j+1}) Requires Ω(n ln(n)/β) generations Then reduces β to new value Final phase:β = O(μ/n) Reducing from n^(1/8) to n^(1/16) requires Ω(n²ln(n)/μ) generations Theorem 8 (Upper Bound) :
For the m-OMM problem (m is constant), let S_m = (2n/m+1)^(m/2). If:
Then NSGA-III finds the Pareto optimal set in expected O(min{S_m n ln(n)/μ + nμ/S_m, n ln(n)}) generations.
Proof Highlights :
For each Pareto optimal vector v:First increase c_t(v) to ⌊μ/S_m⌋ Then reduce distance d_t to 0 Time decomposition:Increasing cover number: O(nμ/S_m) generations Covering v: O(S_m n ln(n)/μ) generations Union bound ensures high probability For the case m=2 :
Lower bound: Ω(n²ln(n)/μ) Upper bound: O(n²ln(n)/μ) Conclusion : Θ(n²ln(n)/μ) is a tight runtime boundComparison with NSGA-II :
NSGA-II: Ω(n ln(n)) generations (Doerr & Qu 2023a) NSGA-III: O(n²ln(n)/μ) = O(n ln(n)) when μ = Θ(n) Speedup Factor : μ/n (significant when μ = ω(n))Role of Population Size :Larger μ allows more individuals near targets Reduces β, thus accelerating exploration Optimal μ range exists: (2n/m+1)^(m/2) ≤ μ ∈ O(√ln(n)(2n/m+1)^(m/2)) Advantages of Uniform Distribution :NSGA-III's reference point mechanism ensures uniform solution distribution Particularly beneficial when no local optima exist More effective than NSGA-II's crowding distance Improvement Factors :Compared to Opris et al. (2024)'s O(n ln(n)) upper bound Improvement factor: min{S_m/μ, μ/(S_m ln(n))} Significant improvement for appropriate μ Pioneering Work :Zheng, Liu & Doerr (2022): First NSGA-II runtime analysis Sparked numerous subsequent studies Bi-objective Problems :Doerr & Qu (2022, 2023a,b): Multimodality, crossover operators Dang et al. (2023, 2024, 2025a,b): Robustness, crossover advantages Zheng & Doerr (2022): Crowding distance improvements Combinatorial Optimization :Cerf et al. (2023): Minimum spanning tree Deng et al. (2024): Subset selection NSGA-III :Wietheger & Doerr (2023): First runtime analysis Opris et al. (2024): O(n ln(n)) bounds on m-OMM Opris (2025a): Multimodal analysis on OJZJ Other Algorithms :SMS-EMOA: Zheng & Doerr (2024b) SPEA2: Related analysis PAES-25: Opris (2025b), Θ(n³) to Θ(n(2n/m)^(m/2)ln(n/m)) GSEMO :Doerr, Krejca & Opris (2025): Tight bounds on COCZ and OMM First Lower Bound : First NSGA-III lower bound on classical benchmarks besides Opris (2025a)Tight Bounds : Upper and lower bounds match (m=2)Population Dynamics : First analysis of β evolution during explorationSurpassing NSGA-II : Theoretical proof of NSGA-III's advantagesTheoretical Breakthrough :Establishes tight runtime bound Θ(n²ln(n)/μ) for NSGA-III on 2-OMM Proves NSGA-III surpasses NSGA-II by factor μ/n Population Dynamics Understanding :Maximum cover number β decreases during exploration β reduction directly affects exploration speed Final β = O(μ/n) is the minimum possible value Algorithm Behavior Insights :NSGA-III distributes solutions uniformly via reference points Particularly effective on problems without local optima Population size μ selection is critical Problem Type Restrictions :Analysis focuses on OMM problems (no local optima) Different dynamics from OJZJ (with local optima) More complex fitness landscapes remain unstudied Population Size Constraints :Tight bounds only hold for specific μ ranges n+1 ≤ μ = O(ln(n)^c(n+1)), c < 1 Bounds may not be tight for very large or very small μ Number of Objectives :Tight bounds for m=2 Room for improvement when m>2 (gap between bounds) Practical Application :Analysis based on pseudo-Boolean functions Real problem fitness landscapes are more complex Constant factors may be important in practice Extension to More Objectives :Establish tight bounds for m>2 Analyze complexity of high-dimensional Pareto fronts Complex Fitness Landscapes :Problems requiring reaching Pareto fronts Multimodal and deceptive problems Practical scheduling and graph problems Practical Improvements :Develop enhanced versions based on theoretical insights Adaptive population size strategies Improved reference point selection Other Algorithms :Apply population dynamics analysis to other MOEAs Compare theoretical performance of different selection mechanisms 1. Theoretical Rigor
All results have complete mathematical proofs Advanced probabilistic analysis techniques employed Both upper and lower bounds established, matching for m=2 2. Methodological Innovation
First to dynamically track cover number β during exploration Novel framework for iteratively reducing β Organic combination of covering, distribution, and exploration phases 3. Significant Results
First NSGA-III lower bound on classical benchmarks (besides one paper) Proves NSGA-III surpasses NSGA-II (latter has 60,000+ citations) Tight bounds provide complete picture of algorithm understanding 4. Technical Depth
Fine-grained probabilistic analysis (geometric distributions, tail bounds, union bounds) Multi-phase iterative analysis framework Considers multiple parameter ranges 5. Clear Presentation
Well-structured, logical flow Lemmas progressively build toward final theorem Sufficient technical detail without redundancy 1. Limited Application Scope
Only analyzes OMM benchmark problems Lacks analysis on practical problems Constant factors not optimized (may be important in practice) 2. Missing Experimental Validation
Pure theoretical paper, no experimental verification Cannot verify practical impact of constant factors No comparison with empirical studies 3. Parameter Range Restrictions
Limited population size μ range Cases like μ = Θ(n²) not covered Constant c < 1 constraint is strong 4. Objective Number Limitations
Gap between bounds remains for m>2 High-dimensional case complexity not fully resolved Many practical applications involve more objectives 5. Algorithm Variants Not Considered
Only analyzes standard NSGA-III Adaptive variants not considered Other selection strategies not compared 1. Theoretical Contribution
Fills important gap in NSGA-III theoretical analysis Provides new methods for population dynamics research Likely to inspire more runtime analysis studies 2. Practical Guidance
Reveals importance of population size selection Explains sources of NSGA-III's advantages Can guide algorithm parameter tuning 3. Academic Value
Suitable for top-tier conferences/journals (e.g., AAAI) High citation potential (connecting theory and practice) Likely to become benchmark work in the field 4. Limitations
Short-term practical impact may be limited (pure theory) More work needed to translate insights into practical improvements Practical importance of constant factors unknown 1. Theoretical Research
Methodological reference for runtime analysis Foundation for population dynamics research Starting point for analyzing other MOEAs 2. Algorithm Design
Guides new MOEA design (considering uniform distribution) Theoretical guidance for population size Improvements to reference point mechanisms 3. Benchmark Testing
OMM as theoretical analysis benchmark Validating theoretical performance of new algorithms Comparing different selection mechanisms 4. Educational Use
Teaching material for evolutionary algorithm courses Examples of probabilistic analysis techniques Case studies of theory-practice integration Not Applicable To :
Direct application to practical engineering problems (requires more work) Scenarios requiring rapid prototyping (theoretical analysis is time-consuming) Non-OMM type problems (requires new analysis) The paper cites extensive related work, with key references including:
NSGA-III Original Paper :Deb & Jain (2014): Introduction of NSGA-III NSGA-II Analysis :Zheng, Liu & Doerr (2022): First runtime analysis Doerr & Qu (2023a): First lower bound NSGA-III Analysis :Wietheger & Doerr (2023): First analysis Opris et al. (2024): Upper bounds on m-OMM Opris (2025a): Multimodal analysis on OJZJ Probabilistic Tools :Witt (2014): Tail bound theorem Badkobeh et al. (2015): Parallel search Benchmark Problems :Zheng & Doerr (2024a): OMM definition and NSGA-II inefficiency Overall Assessment : This is a high-quality theoretical paper achieving important breakthroughs in NSGA-III runtime analysis. By establishing tight bounds and revealing population dynamics, it provides solid theoretical foundations for understanding this widely-used algorithm in practice. Despite limited application scope, its methodology and insights hold significant value for the field. The paper is suitable for publication at top-tier venues and likely to become an important reference for future research.