强大而可扩展的不确定性估计，并对机器学习的原子势进行保形预测

论文标题

强大而可扩展的不确定性估计，并对机器学习的原子势进行保形预测

Robust and scalable uncertainty estimation with conformal prediction for machine-learned interatomic potentials

论文作者

Hu, Yuge, Musielewicz, Joseph, Ulissi, Zachary, Medford, Andrew J.

论文摘要

不确定性定量（UQ）对机器学习（ML）力场很重要，以评估预测过程中的置信度，因为ML模型并非固有的物理，因此可能产生灾难性不正确的预测。建立的A-posteriori UQ方法，包括集合方法，辍学方法，Delta方法和各种启发式距离指标具有局限性，例如由于模型重新训练而对大型模型的计算挑战。另外，不确定性估计通常不会严格校准。在这项工作中，我们建议将无分布的UQ方法（称为共形预测（CP））与神经网络潜在空间的距离相结合，以估计神经网络力场预测的能量的不确定性。我们在两个基本方面（校准和清晰度）上评估了该方法（CP+潜在）以及其他UQ方法，并在独立且分布相同的（I.I.D.）数据的假设下找到了这种方法。我们表明该方法对所选的超参数不敏感，并在I.I.D.时测试该方法的局限性。违反了假设。最后，我们证明，该方法可以轻松地应用于具有传统和图形神经网络体系结构的训练有素的神经网络力场，以在100万张图像的培训数据集中获得低计算成本的不确定性估计，以展示其可扩展性和可移植性。将CP方法与潜在距离结合起来提供了校准，清晰有效的策略，以估计神经网络力场的不确定性。此外，CP方法还可以作为校准其他方法估计的不确定性的有希望的策略。

Uncertainty quantification (UQ) is important to machine learning (ML) force fields to assess the level of confidence during prediction, as ML models are not inherently physical and can therefore yield catastrophically incorrect predictions. Established a-posteriori UQ methods, including ensemble methods, the dropout method, the delta method, and various heuristic distance metrics, have limitations such as being computationally challenging for large models due to model re-training. In addition, the uncertainty estimates are often not rigorously calibrated. In this work, we propose combining the distribution-free UQ method, known as conformal prediction (CP), with the distances in the neural network's latent space to estimate the uncertainty of energies predicted by neural network force fields. We evaluate this method (CP+latent) along with other UQ methods on two essential aspects, calibration, and sharpness, and find this method to be both calibrated and sharp under the assumption of independent and identically-distributed (i.i.d.) data. We show that the method is relatively insensitive to hyperparameters selected, and test the limitations of the method when the i.i.d. assumption is violated. Finally, we demonstrate that this method can be readily applied to trained neural network force fields with traditional and graph neural network architectures to obtain estimates of uncertainty with low computational costs on a training dataset of 1 million images to showcase its scalability and portability. Incorporating the CP method with latent distances offers a calibrated, sharp and efficient strategy to estimate the uncertainty of neural network force fields. In addition, the CP approach can also function as a promising strategy for calibrating uncertainty estimated by other approaches.

下载PDF全文

下载文献需遵守相关版权规定

论文标题