2025-11-13T10:34:10.524110

Accelerating Molecular Dynamics Simulations with Foundation Neural Network Models using Multiple Time-Step and Distillation

Cattin, PlÃ©, Adjoua et al.

We present a strategy to accelerate molecular dynamics simulations using foundation neural network models. To do so, we apply a dual-level neural network multi-time-step (MTS) strategy where the target accurate potential is coupled to a simpler but faster model obtained via a distillation process. Thus, the 3.5 Ã-cutoff distilled model is sufficient to capture the fast varying forces, i.e. mainly bonded interactions, from the accurate potential allowing its use in a reversible reference system propagator algorithms (RESPA)-like formalism. The approach conserves accuracy, preserving both static and dynamical properties, while enabling to evaluate the costly model only every 3 to 6 fs depending on the system. Consequently, large simulation speedups over standard 1 fs integration are observed: 4-fold in homogeneous systems and 2.7-fold in large solvated proteins. Such a strategy is applicable to any neural network potential and reduces their performance gap with classical force fields.

academic

Accelerating Molecular Dynamics Simulations with Foundation Neural Network Models using Multiple Time-Step and Distillation

基本信息

论文ID: 2510.06562
标题: Accelerating Molecular Dynamics Simulations with Foundation Neural Network Models using Multiple Time-Step and Distillation
作者: Côme Cattin, Thomas Plé, Olivier Adjoua, Nicoläı Gouraud, Louis Lagardère, Jean-Philip Piquemal
分类: physics.chem-ph
发表时间: 2025年10月14日 (arXiv v2)
论文链接: https://arxiv.org/abs/2510.06562

摘要

本文提出了一种使用基础神经网络模型加速分子动力学模拟的策略。该方法采用双层神经网络多时间步长(MTS)策略，将目标精确势能与通过蒸馏过程获得的更简单但更快的模型耦合。3.5 Å截止的蒸馏模型足以捕获精确势能中快速变化的力（主要是成键相互作用），允许在可逆参考系统传播算法(RESPA)类似的形式中使用。该方法保持了准确性，保留了静态和动态性质，同时根据系统的不同，只需每3到6 fs评估一次昂贵的模型。因此，相比标准1 fs积分观察到了大幅的模拟加速：在均匀系统中4倍，在大型溶剂化蛋白质中2.7倍。

研究背景与动机

问题定义

神经网络势能(NNPs)虽然能提供接近量子力学的精度，但计算成本显著高于传统经验势能，这限制了它们在大系统和长时间尺度模拟中的应用。主要瓶颈在于：

高频运动的时间积分要求：分子动力学必须用小时间步长(0.5-1 fs)来解决高频运动如键振动
昂贵的力评估：ML模型的计算密集性导致大量昂贵的力评估
与经典力场的性能差距：NNPs的计算成本阻碍了其广泛应用

研究动机

多时间步长(MTS)积分器在经典分子模拟中已被证明有效，但尚未适配到ML势能领域。本研究旨在：

开发首个适用于ML势能的RESPA-based MTS方案
利用不同复杂度和推理成本的多个神经网络实现高效MTS方案
减少NNPs与经典力场之间的性能差距

核心贡献

首次实现ML势能的MTS方案：提出了首个针对机器学习势能的RESPA-based多时间步长积分方案
知识蒸馏策略：开发了两种蒸馏策略（系统特定模型和通用模型）来创建快速的短程模型
显著的计算加速：在保持精度的同时实现了4倍（均匀系统）和2.7倍（蛋白质-配体复合物）的加速
广泛适用性：该策略适用于任何神经网络势能，具有通用性
完整的实现和验证：在FeNNol库和Tinker-HP包中实现，并通过多种系统验证

方法详解

任务定义

本研究的任务是设计一种多时间步长积分方案，使用两个不同复杂度的神经网络势能：

输入：分子系统的坐标和速度
输出：加速的MD轨迹，保持与单时间步长方案相同的精度
约束：保持静态和动态性质的准确性

模型架构

双层神经网络设计

参考模型：FeNNix-Bio1(M) - 基于范围分离等变Transformer架构
- 感受野：11 Å（两次消息传递）
- 包含近程和远程注意力头
- 高精度但计算昂贵
快速模型：蒸馏的轻量级模型
- 感受野：3.5 Å（一次消息传递）
- 移除远程注意力头
- 专注于快变"成键"力
- 推理速度提升约10倍

BAOAB-RESPA积分方案

算法流程如下：

Algorithm 1: MTS Integration Step with FENNIX Force Splitting
1: if first step then
2:   Fsmall ← FENNIXsmall(x)
3:   F ← FENNIXlarge(x)
4: end if
5: v ← v + Δt/(2m) · (F - Fsmall)
6: for i = 1 to nslow do
7:   v ← v + Δt/(2m·nslow) · Fsmall
8:   x ← x + Δt/(2·nslow) · v
9:   v ← thermo(v, Δt/nslow)  # Apply thermostat
10:  x ← x + Δt/(2·nslow) · v
11:  Fsmall ← FENNIXsmall(x)
12:  v ← v + Δt/(2m·nslow) · Fsmall
13: end for
14: F ← FENNIXlarge(x)
15: v ← v + Δt/(2m) · (F - Fsmall)