2025-11-18T07:43:13.662683

A direct PinT algorithm for higher-order nonlinear time-evolution equations

Zhong, Zhao, Shu

Higher-order nonlinear time-evolution equations have widespread applications in science and engineering, such as in solid mechanics, materials science, and fluid mechanics. This paper mainly studies a direct time-parallel algorithm for solving time-dependent differential equations of orders 1 to 3. Different from the traditional time-stepping approach, we directly solve the all-at-once system from higher-order evolution equations by diagonalization the time discretization matrix $B$. Based on the connection between the characteristic equation and Chebyshev polynomials, we give explicit formulas for the eigenvector matrix $V$ of $B$ and its inverse $V^{-1}$. We prove that $Cond_2\left( V \right) =\mathcal{O} \left( n^3 \right)$, where $n$ is the number of time steps. A direct parallel-in-time algorithm is designed by exploring the structure of the spectral decomposition of $B$. Numerical experiments are provided to show the significant computational speedup of the proposed algorithm.

academic

A direct PinT algorithm for higher-order nonlinear time-evolution equations

Basic Information

Paper ID: 2507.05743
Title: A direct PinT algorithm for higher-order nonlinear time-evolution equations
Authors: Shun-Zhi Zhong, Yong-Liang Zhao, Qian-Yu Shu (School of Mathematical Sciences, Sichuan Normal University)
Classification: math.NA cs.NA
Publication Date: October 12, 2025 (arXiv v2)
Paper Link: https://arxiv.org/abs/2507.05743v2

Abstract

Higher-order nonlinear time-evolution equations have widespread applications in solid mechanics, materials science, and fluid mechanics. This paper investigates direct parallel-in-time (PinT) algorithms for solving first to third-order time-dependent differential equations. Unlike traditional time-stepping methods, this work directly solves the monolithic system of higher-order evolution equations by diagonalizing the temporal discretization matrix $B$ . Based on the connection between characteristic equations and Chebyshev polynomials, explicit formulas are provided for the eigenvector matrix $V$ of $B$ and its inverse $V^{-1}$ . It is proven that $\text{Cond}_2(V) = \mathcal{O}(n^3)$ , where $n$ is the number of time steps. By exploring the spectral decomposition structure of $B$ , a direct parallel-in-time algorithm is designed. Numerical experiments demonstrate significant computational acceleration.

Research Background and Motivation

Problem Background

Temporal parallelization of time-evolution problems is a recent hot research topic. Traditional time-stepping methods often fail to obtain ideal solutions efficiently on modern supercomputers. Introducing parallelization can significantly reduce computational cost and improve efficiency.

Limitations of Existing Methods

Limitations of iterative PinT algorithms: While existing parallel algorithms (such as MGRiT, PFASST) perform well for strongly dissipative problems, they show poor performance for wave propagation problems, as convergence speed largely depends on dissipativity.
Challenges in diagonalization methods:
- Traditional backward Euler discretization leads to non-diagonalizable matrices
- Using different time step sizes enables diagonalization but may result in large condition numbers for the eigenvector matrix, increasing rounding errors
- Existing methods impose restrictions on the number of time steps $n$ (typically $n$ can only be between 20-25)

Research Motivation

This work aims to eliminate unfavorable restrictions on $n$ , extend special second-order partial differential equations to more general first to third-order PDE forms, and design a direct PinT algorithm for solving the monolithic system.

Core Contributions

Theoretical Proof: Theoretically proves that matrix $B$ can be diagonalized as $B = VDV^{-1}$
Explicit Expressions: Provides analytical expressions for $V$ , $V^{-1}$ , and $D$ , rigorously proving that the condition number of matrix $V$ satisfies $\text{Cond}_2(V) = \mathcal{O}(n^3)$
Fast Algorithm: Proposes a fast algorithm for computing $V^{-1}$ that is faster than MATLAB's built-in eig function
Algorithm Extension: Extends the direct PinT algorithm to first to third-order nonlinear differential equations

Methodology Details

Problem Formulation

Solving higher-order nonlinear time-evolution equations of the following forms:

First-order problem: $u'(t) + f(u(t)) = 0$ , $u(0) = u_0$
Second-order problem: $u''(t) + a_1u'(t) + f(u(t)) = 0$ , $u(0) = u_0$ , $u'(0) = \tilde{u}_0$
Third-order problem: $u'''(t) + a_1u''(t) + a_2u'(t) + f(u(t)) = 0$ , with additional initial conditions

Core Algorithm Framework

Temporal Discretization Scheme

A hybrid temporal discretization scheme is employed:

Central finite difference formulas for the first $n-1$ steps
BDF2 formula for the final step

$\begin{cases} \frac{u_{j+1}-u_{j-1}}{2\Delta t} + Au_j = f_j, & j = 1,2,\ldots,n-1 \\ \frac{\frac{3}{2}u_n - 2u_{n-1} + \frac{1}{2}u_{n-2}}{\Delta t} + Au_n = f_n \end{cases}$

The corresponding temporal discretization matrix is: $B = \frac{1}{\Delta t}\begin{pmatrix} 0 & \frac{1}{2} & & & \\ -\frac{1}{2} & 0 & \frac{1}{2} & & \\ & \ddots & \ddots & \ddots & \\ & & -\frac{1}{2} & 0 & \frac{1}{2} \\ & & \frac{1}{2} & -2 & \frac{3}{2} \end{pmatrix}$

Spectral Decomposition Theory

Theorem 3.1: The eigenvalues of matrix $B$ are $\lambda_j = ix_j$ , where $\{x_j\}_{j=1}^n$ are the $n$ roots of the equation: $U_{n-1}(x) + iU_{n-2}(x) - iT_n(x) + T_{n-1}(x) = 0$

The corresponding eigenvectors are $P_j = [p_{j,0}, \ldots, p_{j,n-1}]^T$ , where: $p_{j,k} = i^k U_k(x_j), \quad k = 0,\ldots,n-1$

Here $T_n(x)$ and $U_n(x)$ are Chebyshev polynomials of the first and second kind, respectively.

Direct PinT Algorithm

For nonlinear problems, simplified Newton iteration (SNI) is used: $(B \otimes I_x + I_t \otimes A^k)u^{k+1} = b + [(I_t \otimes A^k)u^k - F(u^k)]$

where $A^k = \frac{1}{n}\sum_{j=1}^n \nabla f(u_j^k)$ is the average Jacobian matrix.

Through spectral decomposition $B = VDV^{-1}$ , the system can be solved in parallel:

$\tilde{g} = (V^{-1} \otimes I_x)r^k$ (Step a)
$(\lambda_j I_x + A^k)z_j = \tilde{g}_j$ , $j = 1,2,\ldots,n$ (Step b)
$u^{k+1} = (V \otimes I_x)z$ (Step c)

Technical Innovations

Chebyshev Polynomial Connection: Establishes the connection between characteristic equations and Chebyshev polynomials to obtain explicit spectral decomposition
Condition Number Control: Proves $\text{Cond}_2(V) = \mathcal{O}(n^3)$ , significantly improving upon existing methods
Fast Algorithm: Designs an $\mathcal{O}(n^2)$ complexity algorithm for computing $V^{-1}$
Higher-order Extension: Extends the algorithm to second and third-order nonlinear equations

Experimental Setup

Numerical Experiment Configuration

Computing Environment: Intel(R) Core(TM) i7-14700K 3.40GHz processor, 32GB memory
Software Platform: MATLAB 2022a
Parallel Cores: Up to 20 cores for acceleration testing

Evaluation Metrics

CPU Time: Measured using MATLAB's tic/toc functions
Relative Error: $\omega = \frac{\|B - VDV^{-1}\|_F}{\|B\|_F}$
Condition Number: $\text{Cond}_2(V)$
Speedup Ratio: Comparison of computation times across different core counts

Comparison Methods

MATLAB built-in eig function for spectral decomposition
Traditional time-stepping methods (as baseline)

Experimental Results

Fast Spectral Decomposition Performance

n	MATLAB eig+mrdivide	Fast Algorithm	Speedup
32	0.002s	0.003s	0.67×
256	0.050s	0.023s	2.17×
1024	1.285s	0.306s	4.20×
4096	67.599s	8.626s	7.84×
8192	580.663s	62.270s	9.32×

Parallel Acceleration Performance

Experiments demonstrate:

More pronounced acceleration effects when the number of time steps $N_t$ is larger
At $N_t = 2^9 = 512$ , using 20 cores shows significant CPU time reduction compared to single-core execution
When core count exceeds 8, acceleration gradually diminishes (likely due to increased communication overhead)

Numerical Test Cases

Four numerical examples were tested:

Example 1: Two-dimensional nonlinear equation (Dirichlet boundary conditions)
Example 2: Two-dimensional Sine-Gordon equation
Example 3: Third-order linear evolution equation
Example 4: Third-order nonlinear evolution equation

All examples validate the algorithm's effectiveness and parallel acceleration capability.

Temporal Parallel Methods

Iterative PinT algorithms: Parareal, MGRiT, PFASST and other methods perform well on strongly dissipative problems
Diagonalization methods: Maday and Rønquist first proposed diagonalization-based PinT algorithms
Improved methods: Including space-time discretization, low-rank approximation techniques, domain decomposition algorithms, etc.

Advantages of This Work

Compared to existing work, this paper:

Eliminates restrictions on the number of time steps $n$
Provides explicit spectral decomposition formulas
Extends the method to higher-order nonlinear equations
Provides rigorous condition number analysis

Conclusions and Discussion

Main Conclusions

Successfully extends the diagonalization PinT algorithm to first to third-order nonlinear time-evolution equations
Provides explicit diagonalization formulas $B = VDV^{-1}$ for the temporal discretization matrix $B$
Proves that the condition number of the eigenvector matrix is $\mathcal{O}(n^3)$
Designs a fast algorithm with $\mathcal{O}(n^2)$ complexity

Limitations

Condition Number Growth: Although improved compared to existing methods, the condition number still grows as $n^3$
Communication Overhead: Large-scale parallelization may be limited by communication overhead
Applicable Scope: Primarily applicable to problems with certain dissipative properties

Future Directions

Further optimize the computational algorithm for $V^{-1}$
Investigate extensions to higher-order differential equations
Explore methods to reduce condition number growth
Application research in wave equations, fluid dynamics, and other fields

In-depth Evaluation

Strengths

Rigorous Theory: Provides complete mathematical theoretical analysis, including explicit expressions for eigenvalues, eigenvectors, and condition number estimates
Methodological Innovation: Cleverly utilizes Chebyshev polynomials to establish connections in characteristic equations, obtaining analytical solutions
Practical Value: The algorithm demonstrates significant computational acceleration on large-scale problems
Strong Extensibility: Extension from first-order to third-order nonlinear equations demonstrates good generality

Weaknesses

Condition Number Issue: Despite improvements over existing methods, the $\mathcal{O}(n^3)$ condition number growth may still cause numerical instability in extremely large-scale problems
Experimental Limitations: Numerical experiments focus mainly on relatively simple model problems, lacking verification on complex engineering applications
Parallel Efficiency: Parallel efficiency decreases with increasing core count, requiring further optimization of communication strategies

Impact

Academic Contribution: Provides new theoretical tools and methods for the temporal parallel algorithms field
Application Prospects: Has important application value in scientific computing, engineering simulation, and other fields requiring solutions to large-scale time-evolution problems
Reproducibility: The paper provides detailed algorithm descriptions and mathematical derivations, facilitating reproduction and further research

Applicable Scenarios

Large-scale time-evolution problems: Particularly suitable for physical simulations requiring long-time integration
High-performance computing environments: Can fully leverage parallel advantages in multi-core or cluster environments
Scientific and engineering applications: Numerical simulations in solid mechanics, materials science, fluid mechanics, and other fields

References

The paper cites 44 related references, primarily including:

Lions, Maday, Turinici (2001): Pioneering work on the Parareal algorithm
Gander, Halpern et al.: Theoretical analysis of temporal parallel methods
Liu, Wu, Zhou et al.: Recent research on diagonalization PinT algorithms
Classical textbooks on Chebyshev polynomials and numerical linear algebra

Overall Assessment: This is a high-quality numerical analysis paper with significant contributions in both theoretical analysis and algorithm design. The paper addresses important limitations of existing diagonalization PinT algorithms and provides an effective parallel solution scheme for higher-order nonlinear time-evolution equations. Despite some limitations, its theoretical and practical value are outstanding, making important contributions to advancing the development of temporal parallel algorithms.