Robust and scalable uncertainty estimation with conformal prediction for machine-learned interatomic potentials

Hu, Yuge and Musielewicz, Joseph and Ulissi, Zachary W and Medford, Andrew J (2022) Robust and scalable uncertainty estimation with conformal prediction for machine-learned interatomic potentials. Machine Learning: Science and Technology, 3 (4). 045028. ISSN 2632-2153

[thumbnail of Hu_2022_Mach._Learn.__Sci._Technol._3_045028.pdf] Text
Hu_2022_Mach._Learn.__Sci._Technol._3_045028.pdf - Published Version

Download (4MB)

Abstract

Uncertainty quantification (UQ) is important to machine learning (ML) force fields to assess the level of confidence during prediction, as ML models are not inherently physical and can therefore yield catastrophically incorrect predictions. Established a-posteriori UQ methods, including ensemble methods, the dropout method, the delta method, and various heuristic distance metrics, have limitations such as being computationally challenging for large models due to model re-training. In addition, the uncertainty estimates are often not rigorously calibrated. In this work, we propose combining the distribution-free UQ method, known as conformal prediction (CP), with the distances in the neural network's latent space to estimate the uncertainty of energies predicted by neural network force fields. We evaluate this method (CP+latent) along with other UQ methods on two essential aspects, calibration, and sharpness, and find this method to be both calibrated and sharp under the assumption of independent and identically-distributed (i.i.d.) data. We show that the method is relatively insensitive to hyperparameters selected, and test the limitations of the method when the i.i.d. assumption is violated. Finally, we demonstrate that this method can be readily applied to trained neural network force fields with traditional and graph neural network architectures to obtain estimates of uncertainty with low computational costs on a training dataset of 1 million images to showcase its scalability and portability. Incorporating the CP method with latent distances offers a calibrated, sharp and efficient strategy to estimate the uncertainty of neural network force fields. In addition, the CP approach can also function as a promising strategy for calibrating uncertainty estimated by other approaches.

Item Type: Article
Subjects: OA STM Library > Multidisciplinary
Depositing User: Unnamed user with email support@oastmlibrary.com
Date Deposited: 03 Jun 2024 12:48
Last Modified: 03 Jun 2024 12:48
URI: http://geographical.openscholararchive.com/id/eprint/1292

Actions (login required)

View Item
View Item