2025-11-14T19:19:11.421355

GO-Diff: Data-free and amortized global structure optimization

RÃ¸nne, Vegge, Bhowmik

We introduce GO-Diff, a diffusion-based method for global structure optimization that learns to directly sample low-energy atomic configurations without requiring prior data or explicit relaxation. GO-Diff is trained from scratch using a Boltzmann-weighted score-matching loss, leveraging only the known energy function to guide generation toward thermodynamically favorable regions. The method operates in a two-stage loop of self-sampling and model refinement, progressively improving its ability to target low-energy structures. Compared to traditional optimization pipelines, GO-Diff achieves competitive results with significantly fewer energy evaluations. Moreover, by reusing pretrained models across related systems, GO-Diff supports amortized optimization - enabling faster convergence on new tasks without retraining from scratch.

academic

GO-Diff: Data-free and amortized global structure optimization

基本信息

论文ID: 2510.13448
标题: GO-Diff: Data-free and amortized global structure optimization
作者: Nikolaj Rønne, Tejs Vegge, Arghya Bhowmik (Technical University of Denmark)
分类: physics.comp-ph cond-mat.dis-nn cond-mat.mtrl-sci cs.CE
发表时间: 2025年10月15日 (Preprint)
论文链接: https://arxiv.org/abs/2510.13448

新催化表面的发现
功能材料的设计
稳定原子构型的预测
材料性质的理解

现有方法的局限性

传统的全局优化方法存在以下问题：

计算成本高：随机结构搜索(RSS)、盆跳跃、遗传算法、模拟退火等方法依赖局部弛豫和基于梯度的优化器，需要大量能量和力的评估
局限于局部优化：容易陷入局部最优解，限制了对复杂能量景观的探索
数据依赖性：机器学习原子间势需要精心选择的训练数据来捕获相关最小值，否则可能陷入自强化的局部最小值
缺乏可转移性：现有方法难以在相关系统间重用已学习的知识

研究动机

扩散模型在分子和材料科学的结构生成中显示出前景，但将其应用于全局优化任务具有挑战性，因为目标是采样对应于PES全局最小值的稀有低能构型，但这种结构的数据分布通常是未知或不可获得的。

核心贡献

提出了无数据的生成优化方法：直接采样势能面的最小值，无需先验数据或显式弛豫
开发了玻尔兹曼加权损失函数：结合退火策略引导采样朝向低能区域同时保持探索性
实现了摊销优化：通过在相关系统间转移预训练模型实现知识重用
验证了优越的样本效率：相比经典搜索方法具有更高的样本效率

模型通过反向扩散生成原子结构
评估生成结构的能量
使用结果样本来精化模型

维护一个重放缓冲区 $B = \{(x_0^{(i)}, E^{(i)})\}$ 存储生成的构型及其能量。

玻尔兹曼加权分数匹配

核心创新是玻尔兹曼加权的分数匹配损失：

$L_{\theta}^{Boltzmann} = E_{t\sim U(0,1)}\left[\lambda(t)E_{x_0\sim q, x_t\sim p_{t|0}(x_t|x_0)} w(E) \|s_\theta(x_t,t) - \nabla_{x_t}\log p_{t|0}(x_t|x_0)\|_2^2\right]$