The performance of relativistic particle pushers has long been a topic of interest in the field of computational plasma physics, particularly from the point of view of the particle-in-cell approach. Previous works undertaken to compare such integrators have predominantly targeted the ultra-relativistic regime. In this paper, we utilize a custom-built code to study the core run-times of the Boris, the Vay, and the Higuera-Cary particle pushers for low-, high-, and ultra-relativistic particles. This is followed by a comparison of the three integrators in terms of accuracy and error. A fitness parameter is then proposed that can serve as a one-stop value to determine which method is more suitable for a particular simulation setup. It is hoped that through knowledge of such intricacies, the choice for the integrator will be easier to make depending on the problem at hand.
A Comparative Analysis of Relativistic Particle Pushers vis-à-vis Computation Time & Accuracy Paper ID : 2410.03352Title : A Comparative Analysis of Relativistic Particle Pushers vis-à-vis Computation Time & AccuracyAuthors : Mohammad Yasir, Vikrant Saxena (Indian Institute of Technology Delhi)Classification : physics.plasm-ph (Plasma Physics)Publication Date : October 7, 2024Paper Link : https://arxiv.org/abs/2410.03352 The performance of relativistic particle pushers has been a research focus in computational plasma physics, particularly in particle-in-cell (PIC) methods. Previous comparative studies have primarily targeted the ultrarelativistic regime. This paper employs the self-developed code PaTriC to investigate the core runtime of three particle pushers—Boris, Vay, and Higuera-Cary—across non-relativistic, relativistic, and ultrarelativistic particle regimes, subsequently comparing the accuracy and errors of the three integrators. The paper proposes a "fitness parameter" that serves as a comprehensive metric for determining which method is most suitable for specific simulation scenarios.
Particle trajectory calculation is one of the most computationally intensive steps in relativistic plasma dynamics simulations. Multiple explicit integrators (Boris, Vay, Higuera-Cary, etc.) each have advantages and disadvantages, but systematic comparative studies across different relativistic regimes (non-relativistic, relativistic, ultrarelativistic) are lacking.
Computational Bottleneck : Plasma systems involve multiple temporal, energetic, and spatial scales (spatial scales spanning 10^8, temporal scales from femtoseconds to days), with limited computational resourcesBroad Applications : Particle trajectory calculations are not only used in plasma simulations but also applied to molecular dynamics, astrophysics, and other fieldsSpeed-Accuracy Trade-off : Requires finding optimal balance between computational speed and simulation accuracyResearch Bias : Previous comparative studies have primarily focused on the ultrarelativistic regime (e.g., Ripperda et al. 2017)Lack of Unified Standards : No unified evaluation metrics comprehensively considering computational cost and accuracyUnclear Application Scenarios : Researchers struggle to select the most appropriate integrator for specific problemsDevelop a comprehensive performance evaluation framework covering all relativistic regimes (LR, HR, UR) and propose quantitative metrics to help researchers select optimal integrators for specific applications.
Systematic Comparative Study : First comprehensive comparison of Boris, Vay, and Higuera-Cary pushers across three regimes: non-relativistic (LR), relativistic (HR), and ultrarelativistic (UR)Precise Computational Cost Analysis :Pure magnetic field: Boris (24 FLOPs), Vay (41 FLOPs), HC (38 FLOPs) Electromagnetic field: Boris (55 FLOPs), Vay (91 FLOPs), HC (88 FLOPs) Measured runtime data (600,000 iterations) Multi-dimensional Accuracy Assessment : Systematic evaluation of phase error, gyroradius error, relativistic factor γ error, and cross-field drift performanceFitness Parameter Proposal : Innovatively proposes fitness parameter: f = (1/κ)e^(-ε), where κ is computational cost and ε is the logarithm of relative error, providing quantitative criteria for integrator selectionCustom Code Development : Constructed PaTriC (Particle Tracker in C++) for single-particle simulation and rigorous analysisNumerical integration of relativistic charged particle trajectories in electromagnetic fields:
Input : Initial position x⃗, velocity u⃗ (u⃗ = γv⃗), electromagnetic fields E⃗ and B⃗, time step δtOutput : Particle position and velocity at each time stepConstraints : Maintain time reversibility, near-symplecticity, energy conservation (without electric field)All three methods are based on the staggered leapfrog framework with equations of motion:
d u ⃗ d t = q m ( E ⃗ + v ⃗ ˉ × B ⃗ ) \frac{d\vec{u}}{dt} = \frac{q}{m}(\vec{E} + \bar{\vec{v}} \times \vec{B}) d t d u = m q ( E + v ˉ × B )
d x ⃗ d t = v ⃗ ˉ \frac{d\vec{x}}{dt} = \bar{\vec{v}} d t d x = v ˉ
where γ = 1/√(1-(v/c)²), with the key difference being different choices for average velocity v̄.
Discrete form:
u ⃗ i + 1 = u ⃗ i + ( q / m ) ( E ⃗ i + 1 / 2 + v ⃗ ˉ × B ⃗ i + 1 / 2 ) d t \vec{u}_{i+1} = \vec{u}_i + (q/m)(\vec{E}_{i+1/2} + \bar{\vec{v}} \times \vec{B}_{i+1/2}) dt u i + 1 = u i + ( q / m ) ( E i + 1/2 + v ˉ × B i + 1/2 ) d t
Common symbol definitions:
f = qδt/(2m) ε⃗ = fE⃗ β⃗ = fB⃗ Γ(u⃗) = √(1 + u⃗·u⃗/c²) Mathematical Expression :
v ⃗ ˉ = ( 2 Γ ( u ⃗ e ) ) − 1 ( u ⃗ i + 1 + u ⃗ i ) \bar{\vec{v}} = (2\Gamma(\vec{u}_e))^{-1}(\vec{u}_{i+1} + \vec{u}_i) v ˉ = ( 2Γ ( u e ) ) − 1 ( u i + 1 + u i )
Update Steps :
u⃗_e = u⃗_i + ε⃗
τ⃗ = β⃗/Γ(u⃗_e)
s⃗ = 2τ⃗/(1 + τ⃗²)
u⃗_m = u⃗_e + (u⃗_e + (u⃗_e × τ⃗)) × s⃗
u⃗_{i+1} = u⃗_m + ε⃗
Computational Cost :
Pure magnetic field: 24 FLOPs Electromagnetic field: 55 FLOPs Characteristics :
Classical method, widely used in EPOCH, SMILEI, and other software Near-symplectic integrator with excellent accuracy Lowest computational cost Uses simplified form (original form more accurate but 46% higher cost) Design Motivation : Boris pusher exhibits spurious forces in non-trivial cases, leading to trajectory deviations in relativistic situations
Update Steps :
u⃗_{i+1/2} = u⃗_i + f(E⃗_{i+1/2} + v⃗_i × B⃗_{i+1/2})
u⃗_e = u⃗_{i+1/2} + ε⃗
u⃗_{i+1} = s(u⃗_e + (u⃗_e · t⃗)t⃗ + u⃗_e × t⃗)
where:
u* = u⃗_e · β⃗/c σ = (Γ(u⃗_e))² - β⃗² γ_{i+1} = √((σ + √(σ + 4(β⃗² + u*²)))/2) t⃗ = β⃗/γ_{i+1} s = 1/(1 + t⃗²) Computational Cost :
Pure magnetic field: 41 FLOPs Electromagnetic field: 91 FLOPs Characteristics :
Prevents spurious forces, maintains Lorentz invariance Excellent cross-field drift performance Higher computational cost Design Motivation : Seeks an integrator that correctly captures cross-field drift while preserving phase space volume (Vay does not preserve volume)
Average Velocity Definition :
v ⃗ ˉ = ( 1 / 2 γ ˉ ) − 1 ( u ⃗ i + u ⃗ i + 1 ) \bar{\vec{v}} = (1/2\bar{\gamma})^{-1}(\vec{u}_i + \vec{u}_{i+1}) v ˉ = ( 1/2 γ ˉ ) − 1 ( u i + u i + 1 )
where:
γ ˉ = Γ ( u ⃗ i + u ⃗ i + 1 2 ) \bar{\gamma} = \Gamma\left(\frac{\vec{u}_i + \vec{u}_{i+1}}{2}\right) γ ˉ = Γ ( 2 u i + u i + 1 )
Update Steps :
u⃗_e = u⃗_i + ε⃗
u⃗_m = s(u⃗_e + (u⃗_e · t⃗)t⃗ + u⃗_e × t⃗)
u⃗_{i+1} = u⃗_m + ε⃗ + (u⃗_m × t⃗)
Computational Cost :
Pure magnetic field: 38 FLOPs Electromagnetic field: 88 FLOPs Characteristics :
Preserves phase space volume Excellent cross-field drift performance Smaller phase error Computational cost between Boris and Vay Comprehensive Regime Coverage : Unlike previous studies concentrated on UR regime, this research covers γ ranging from 1.27 to 56.95Actual Runtime Measurement : Beyond theoretical FLOPs calculation, actual runtime measured on Intel Xeon processor for 600,000 iterationsMulti-dimensional Error Analysis :Phase error Gyroradius error Relativistic factor γ error Cross-field drift slope error Fitness Parameter Design :Comprehensively considers computational cost and error accumulation Uses error from 5 slowest process cycles for evaluation Time step set to 1/50 of fastest process cycle Exponential form enhances discrimination Positron, with charge sign choice not affecting accuracy results
Low-speed Relativistic (LR) :γ = 1.27282 ω = 6.90×10^11 rad/s Time step: dt ≈ T_c/18 High-speed Relativistic (HR) :γ = 3.589 T_c = 2.56545×10^-11 s Time step: dt ≈ T_c/25 Ultrarelativistic (UR) :γ = 56.95 T_c = 4.07×10^-10 s Time step: dt ≈ T_c/20 1. Magnetic Gyration :
Pure magnetic field: B⃗ = 5T (ẑ direction) Particle executes helical motion Evaluates trajectory accuracy, phase error, gyroradius error 2. Cross-Field Drift :
Magnetic field: B⃗ = 10^-4 T (ẑ direction) Electric field: E⃗ = 100 V/m (x̂ direction) Drift velocity: |v⃗_D| = E/B = 10^6 m/s (negative ŷ direction) UR case analyzed in drift reference frame (κ = 1.002) Processor : Intel Xeon W-1270 @ 3.40GHzMemory : 64GBCompiler : gcc 13.2.0Architecture : x86_64Core Runtime : Actual CPU time for 600,000 iterationsPhase Error : Δφ = φ_simulated - φ_analytical (radians)Gyroradius Relative Error : (r_simulated - r_analytical)/r_analyticalγ Relative Error : (γ_simulated - γ_analytical)/γ_analyticalCross-field Drift Slope Error : Slope deviation of ⟨y⟩ over timeFixed Time Step : Avoids variable step length breaking symplecticityCoarse Grid Testing : Uses T_c/18 to T_c/25 coarse grids to amplify errorsMultiple Run Averaging : Each integrator run 10 times with averaging to eliminate noiseData Output Frequency : Record once every 25,000 iterationsPure Magnetic Field (600,000 iterations):
Method Average Time/600 steps (ms) Total Time (ms) Boris 0.14162 141.94 Vay 0.20398 204.37 HC 0.19303 193.43
Electromagnetic Field (600,000 iterations):
Method Average Time/600 steps (ms) Total Time (ms) Increment (ms) Boris 0.21390 214.29 +72.4 Vay 0.24878 250.56 +46.2 HC 0.26081 262.51 +69.1
Key Findings :
Boris method fastest in all scenarios Vay and HC runtimes similar, consistent with theoretical expectations Electric field introduction affects different methods differently Magnetic Gyration (γ=1.27, 8 cycles):
Phase Error :
Boris: 0.489 rad Vay: 0.489 rad HC: 0.374 rad ✓ (optimal)Gyroradius Error (theoretical value 173.74 µm):
Boris: 0.057 µm (0.033%) Vay: 0.057 µm (0.033%) ✓ (optimal) HC: 0.45 µm (0.259%) γ Relative Error :
Boris: ~10^-9 ✓ Vay: ~10^-8 HC: ~10^-9 ✓ Cross-field Drift (7 cycles):
Phase Error :
Boris: 0.24 rad Vay: 0.24 rad HC: 0.17 rad ✓ (optimal)Drift Slope Error :
Method Slope Absolute Error Percentage Error Analytical -0.46631 - - Boris -0.46192 0.00439 0.942% Vay -0.46192 0.00439 0.941% ✓HC -0.46179 0.00452 0.969%
Magnetic Gyration (γ=3.589, 6 cycles):
Phase Error :
Boris: 0.1926 rad Vay: 0.1926 rad HC: 0.1583 rad ✓ (optimal)Gyroradius Relative Error (theoretical value 0.4083 mm):
Boris: 0.0325% Vay: 0.0325% ✓ HC: 0.1141% γ Relative Error : ~10^-6% (negligible for all methods)
Cross-field Drift (9 cycles):
Phase Error :
Boris: 0.3264 rad Vay: 0.3263 rad HC: 0.2663 rad ✓ (optimal)Drift Slope Error :
Method Slope Absolute Error Percentage Error Analytical -1.42000 - - Boris -1.38363 0.03636 2.561% Vay -1.38364 0.03635 2.560% ✓HC -1.38730 0.03269 2.302% ✓
Magnetic Gyration (γ=56.95, 9 cycles):
Phase Error :
Boris: 0.4173 rad Vay: 0.4173 rad HC: 0.3213 rad ✓ (optimal, significant difference)Gyroradius Error (theoretical value 7.4826 mm):
Boris: 0.00235 mm ✓ Vay: 0.00235 mm ✓ HC: 0.015 mm γ Error Behavior :
Boris and Vay exhibit staggered pattern HC method shows more pronounced stepping Vay initial stability inferior Cross-field Drift (drift reference frame, 8 cycles):
Phase Error :
Boris: 0.263 rad Vay: 0.263 rad HC: 0.194 rad ✓ (optimal)Gyroradius Relative Error (boosted frame):
Boris: 0.937 ✓ Vay: 0.939 HC: 1.1002 γ Relative Error (boosted frame):
Boris: ~0.1% ✓ HC: ~0.1% ✓ Vay: unstable, inferior Definition: f = (1/κ)e^(-ε), where:
κ: computational cost per iteration (FLOPs) ε: log₁₀(relative error over 5 cycles) Calculation Results :
Method Pure Magnetic Field Electromagnetic Field Cost (FLOPs) f Parameter Cost (FLOPs) f Parameter Boris 24 68.35 55 0.0308 Vay 41 19.91 91 0.0186 HC 38 43.17 88 0.0192
Interpretation :
Pure Magnetic Field : Boris f parameter far exceeds others, optimal choiceElectromagnetic Field : Boris still optimal but advantage reduced; Vay and HC comparablef parameter successfully synthesizes computational cost and accuracy, providing quantitative selection basis Computational Efficiency :Boris fastest in all scenarios Vay and HC computational costs similar (theory and measurement consistent) Electric field impact differs across methods Accuracy Characteristics :Phase Error : HC optimal across all regimes and scenariosGyroradius Error : Boris and Vay superior in most casesCross-field Drift : Vay and HC generally superior to Boris, Vay slightly betterEnergy Conservation : All methods maintain good conservation in pure magnetic fieldRelativistic Effects :Larger γ makes relative phase error differences more pronounced UR regime shows more obvious spurious force effects in Boris and Vay HC's volume preservation advantage evident in HR and UR regimes Error Evolution :Phase error grows linearly Gyroradius error exhibits periodic oscillation γ error shows staggered pattern in UR regime Explicit Integrators :Boris (1971): Classical method, widely applied Vay (2008): Improved for relativistic cases Higuera-Cary (2017): Preserves structural properties Implicit Integrators :Cohen et al. (1982): Larger steps, fewer errors Pétri (2017): Fully implicit numerical integration Zhang et al. (2024): Preserves volume, energy, and Lorentz invariance in strong magnetic fields High-order Methods :Winkel et al. (2015): High-order Boris integrator Iterative methods achieving arbitrary order accuracy Comparative Studies :Ripperda et al. (2017) : Comprehensive ultrarelativistic regime comparisonVay (2008), Higuera-Cary (2017): Scenario-specific comparisons This paper: First coverage of LR, HR, UR three regimes Coverage Range : Comprehensive analysis from γ=1.27 to γ=56.95Measured Data : Actual runtime on real hardware, not just theoretical analysisUnified Standard : Proposes fitness parameter f as selection criterionPractical Orientation : Clear method recommendations for different scenariosComputational Efficiency Ranking : Boris > HC ≈ Vay (all scenarios)Application Recommendations :Pure Magnetic Gyration : Boris (fastest and sufficiently accurate)Cross-field Drift : Vay (optimal accuracy) or HC (volume preservation)Computationally Constrained Scenarios : Boris (efficiency priority)Accuracy Priority Scenarios : HC (small phase error) or Vay (precise drift)HC Method Evaluation :Excellent phase error performance Higher computational cost, other errors not necessarily superior Volume preservation property valuable in specific applications Fitness Parameter :Successfully quantifies "cost-effectiveness" Boris advantage clear in pure magnetic field (f=68.35 vs 19.91/43.17) Boris still optimal in electromagnetic field but gap narrowed No "Universal" Method : Different application scenarios require different choices; this research provides decision basisTest Range :Single-particle dynamics only Uniform fields, no field gradients or time-varying effects Fixed time step, no adaptive step testing Scenario Constraints :No extreme nonlinear cases tested No particle-particle interactions considered Lack of systematic testing in actual PIC simulations Fitness Parameter :Energy error as sole metric potentially incomplete Weight choices (1/κ and e^(-ε)) lack rigorous theoretical basis Universal applicability across applications needs more verification Implementation Dependence :Runtime depends on specific implementation and hardware Compiler optimization may alter relative performance GPU accelerators not tested Adaptive Time Stepping : Study symplectic-preserving adaptive algorithms and performanceMulti-particle Systems : Validate conclusions in actual PIC codesMore Integrators : Include implicit and high-order methods in comparisonFitness Parameter Optimization :Explore different weighting schemes Multi-metric synthesis (beyond energy) Application-specific customization Extreme Condition Testing :Strongly time-varying fields Extreme relativistic regime (γ > 1000) Strong nonlinear effects Strong Systematicity :First coverage of complete LR, HR, UR regimes Multi-dimensional error analysis (phase, radius, γ, drift) Combines theoretical analysis with measured data High Practical Value :Provides clear application recommendations Fitness parameter f enables quick decision-making Measured runtime valuable for practical applications Technical Rigor :Detailed FLOPs calculation (Table 1) Multiple run averaging eliminates noise Coarse grid amplifies errors for analysis Clear Presentation :Complete mathematical derivations Rich figures and tables (18 figures, 5 tables) Clear logical structure Open-source Potential : Self-developed PaTriC code can provide tools for communityTheoretical Depth :Lacks theoretical analysis of error propagation Fitness parameter f mathematical foundation insufficient No exploration of symplecticity-error relationship Experimental Limitations :Large gap between single-particle simulation and actual PIC Uniform field assumption overly idealized Limited test cases (2 scenarios per regime) Comparison Gaps :Original Boris form (tan form) not included No implicit method comparison Boris high-order variants not tested Statistical Analysis :Lacks statistical significance testing of errors 10 runs may insufficient for complete performance characterization Standard deviations or confidence intervals not reported Practical Details :No discussion of implementation in actual PIC codes Lacks systematic parameter sweeps over field strength and particle energy Multi-particle extrapolation (Section 3.3) oversimplified Academic Contribution :Fills systematic research gap in LR and HR regimes Fitness parameter concept may be adopted by subsequent research Provides quantitative criteria for integrator selection Practical Value :Plasma simulation researchers can directly apply recommendations Guides PIC code development Particularly valuable in computationally constrained scenarios Reproducibility :Detailed method description Complete parameter specification Code open-sourcing would greatly enhance reproducibility Limitations :Single-particle conclusions to multi-particle generalization needs verification Hardware and implementation-specific dependencies Fitness parameter universal applicability needs testing Strongly Recommended :
Plasma acceleration simulations (positron acceleration, etc.) Magnetic confinement fusion simulations Astrophysical relativistic jets Teaching and prototype development Applicable but Requires Caution :
Large-scale PIC simulations (needs further verification) Adaptive time step scenarios (requires modification) Strongly time-varying or non-uniform fields (extrapolation validity unknown) Not Suitable :
Scenarios requiring long-term energy conservation (consider implicit methods) Extreme relativistic regime (γ > 100, beyond test range) Scenarios with strict volume preservation requirements (HC potentially superior but insufficiently evaluated) Boris (1971) : Original Boris method proposalVay (2008) : Physics of Plasmas 15:056701 - Vay integratorHiguera & Cary (2017) : ICOPS 2017 - HC integratorRipperda et al. (2017) : ApJS 235 - Comprehensive ultrarelativistic regime comparisonQin et al. (2013) : Physics of Plasmas 20:084503 - Boris method advantages analysisZenitani & Umeda (2018) : Physics of Plasmas 25(11) - Boris simplified form analysisOverall Assessment : This is a high-quality, practically-oriented comparative research paper with strong systematicity and clear value to the plasma simulation community. The fitness parameter proposal demonstrates innovation, though theoretical foundations could be strengthened. Main shortcomings lie in the gap between single-particle simulation and practical applications, and lack of deeper theoretical analysis. Recommended future work includes validating conclusions in actual PIC codes and open-sourcing the PaTriC code to enhance impact.