We present a strategy to accelerate molecular dynamics simulations using foundation neural network models. To do so, we apply a dual-level neural network multi-time-step (MTS) strategy where the target accurate potential is coupled to a simpler but faster model obtained via a distillation process. Thus, the 3.5 Ã -cutoff distilled model is sufficient to capture the fast varying forces, i.e. mainly bonded interactions, from the accurate potential allowing its use in a reversible reference system propagator algorithms (RESPA)-like formalism. The approach conserves accuracy, preserving both static and dynamical properties, while enabling to evaluate the costly model only every 3 to 6 fs depending on the system. Consequently, large simulation speedups over standard 1 fs integration are observed: 4-fold in homogeneous systems and 2.7-fold in large solvated proteins. Such a strategy is applicable to any neural network potential and reduces their performance gap with classical force fields.
academic
Accelerating Molecular Dynamics Simulations with Foundation Neural Network Models using Multiple Time-Step and Distillation
This paper proposes a strategy for accelerating molecular dynamics simulations using foundation neural network models. The method employs a dual-layer neural network multiple time-step (MTS) strategy, coupling the target accurate potential energy with a simpler yet faster model obtained through a distillation process. A distilled model with a 3.5 Å cutoff is sufficient to capture rapidly varying forces in the accurate potential (primarily bonded interactions), enabling use in a reversible reference system propagation algorithm (RESPA)-like framework. The method maintains accuracy while preserving static and dynamic properties, requiring evaluation of the expensive model only every 3 to 6 fs depending on the system. Consequently, significant simulation acceleration is observed compared to standard 1 fs integration: 4-fold in homogeneous systems and 2.7-fold in large solvated proteins.
Although neural network potentials (NNPs) provide near quantum mechanical accuracy, their computational cost is significantly higher than traditional empirical potentials, limiting their application to large systems and long timescale simulations. The primary bottlenecks are:
Time integration requirements for high-frequency motions: Molecular dynamics must employ small timesteps (0.5-1 fs) to resolve high-frequency motions such as bond vibrations
Expensive force evaluations: The computational intensity of ML models results in numerous expensive force evaluations
Performance gap with classical force fields: The computational cost of NNPs hinders their widespread adoption
Multiple time-step (MTS) integrators have proven effective in classical molecular simulations but have not yet been adapted to the ML potential domain. This research aims to:
Develop the first RESPA-based MTS scheme applicable to ML potentials
Implement an efficient MTS scheme using multiple neural networks of different complexity and inference cost
Reduce the performance gap between NNPs and classical force fields
First MTS scheme for ML potentials: Proposes the first RESPA-based multiple time-step integration scheme specifically designed for machine learning potentials
Knowledge distillation strategy: Develops two distillation strategies (system-specific and general models) to create fast short-range models
Significant computational acceleration: Achieves 4-fold (homogeneous systems) and 2.7-fold (protein-ligand complexes) acceleration while maintaining accuracy
Broad applicability: The strategy is applicable to any neural network potential with universal applicability
Complete implementation and validation: Implemented in the FeNNol library and Tinker-HP package, validated across multiple systems
Algorithm 1: MTS Integration Step with FENNIX Force Splitting
1: if first step then
2: Fsmall ← FENNIXsmall(x)
3: F ← FENNIXlarge(x)
4: end if
5: v ← v + Δt/(2m) · (F - Fsmall)
6: for i = 1 to nslow do
7: v ← v + Δt/(2m·nslow) · Fsmall
8: x ← x + Δt/(2·nslow) · v
9: v ← thermo(v, Δt/nslow) # Apply thermostat
10: x ← x + Δt/(2·nslow) · v
11: Fsmall ← FENNIXsmall(x)
12: v ← v + Δt/(2m·nslow) · Fsmall
13: end for
14: F ← FENNIXlarge(x)
15: v ← v + Δt/(2m) · (F - Fsmall)
This paper cites 49 important references covering classical and recent work in key areas including neural network potentials, multiple time-step methods, and knowledge distillation, providing a solid theoretical foundation for the research.
Overall Assessment: This is a high-quality research paper that successfully introduces multiple time-step methods into the machine learning potential domain, providing an innovative and practical solution to address the computational efficiency challenges of NNPs. Despite some technical limitations, its pioneering contributions and significant practical value make it an important advance in the field.