Machine learning potential

From HandWiki

Beginning in the 1990s, researchers have employed machine learning programs to construct interatomic potentials, mapping atomic structures to their potential energies. Such machine learning potentials promised to fill the gap between density functional theory, a highly-accurate but computationally-intensive simulation program, and empirically derived or intuitively-approximated potentials, which were far computationally lighter but substantially less accurate. Improvements in artificial intelligence technology have served to heighten the accuracy of MLPs while lowering their computational cost, increasing machine learning's role in fitting potentials.[1][2] Machine learning potentials began by using neural networks to tackle low dimensional systems. While promising, these models could not systematically account for interatomic energy interactions; they could be applied to small molecules in a vacuum and molecules interacting with frozen surfaces, but not much else, and even in these applications often relied on force fields or potentials derived empirically or with simulations.[1] These models thus remained confined to academia.

Modern neural networks construct highly-accurate, computationally-light potentials because theoretical understanding of materials science was increasingly built into their architectures and preprocessing. Almost all are local, accounting for all interactions between an atom and its neighbor up to some cutoff radius. There exist some nonlocal models, but these have been experimental for almost a decade. For most systems, reasonable cutoff radii enable highly accurate results.[1][3]

Almost all neural networks intake atomic coordinates and output potential energies. For some, these atomic coordinates are converted into atom-centered symmetry functions. From this data, a separate atomic neural network is trained for each element; each atomic neural network is evaluated whenever that element occurs in the given structure, and then the results are pooled together at the end. This process - in particular, the atom-centered symmetry functions, which convey translational, rotational, and permutational invariances - has greatly improved machine learning potentials by significantly constraining the neural networks' search space. Other models use a similar process but emphasize bonds over atoms, using pair symmetry functions and training one neural network per atom pair.[1][4]

Still other models, rather than using predetermined symmetry-dictating functions, prefer to learn their own descriptors instead. These models, called message-passing neural networks (MPNNs), are graph neural networks. Treating molecules as three-dimensional graphs (where atoms are nodes and bonds are edges), the model intakes feature vectors describing the atoms, and iteratively updates these feature vectors as information about neighboring atoms is processed through message functions and convolutions. These feature vectors are then used to predict the final potentials. This method gives more flexibility to the artificial intelligences, often resulting in stronger and more generalizable models. In 2017, the first-ever MPNN model, a deep tensor neural network, was used to calculate the properties of small organic molecules. Such technology was commercialized, leading to the development of Matlantis in 2022, which extracts properties through both the forward and backward passes. Matlantis, which can simulate 72 elements, handle up to 20,000 atoms at a time, and execute calculations up to 20,000,000 times faster than density functional theory with almost indistinguishable accuracy, showcases the power of machine learning potentials in the age of artificial intelligence.[5][1][6][7]

References

  1. 1.0 1.1 1.2 1.3 1.4 Kocer, Emir; Ko, Tsz Wai; Behler, Jorg (2022). "Neural Network Potentials: A Concise Overview of Methods". Annual Review of Physical Chemistry 73: 163–86. doi:10.1146/annurev-physchem-082720-034254. PMID 34982580. Bibcode2022ARPC...73..163K. 
  2. Blank, TB; Brown, SD; Calhoun, AW; Doren, DJ (1995). "Neural network models of potential energy surfaces". Journal of Chemistry and Physics 103 (10): 4129–37. doi:10.1063/1.469597. Bibcode1995JChPh.103.4129B. 
  3. Ghasemi, SA; Hofstetter, A; Saha, S; Goedecker, S (2015). "Interatomic potentials for ionic systems with density functional accuracy based on charge densities obtained by a neural network". Physical Review B 92 (4): 045131. doi:10.1103/PhysRevB.92.045131. Bibcode2015PhRvB..92d5131G. 
  4. Behler, J; Parrinello, M (2007). "Generalized neural-network representation of high-dimensional potential-energy surfaces". Physical Review Letters 148 (14). doi:10.1103/PhysRevLett.98.146401. Bibcode2007PhRvL..98n6401B. 
  5. Schutt, KT; Arbabzadah, F; Chmiela, S; Muller, KR; Tkatchenko, A (2017). "Quantum-chemical insights from deep tensor neural networks". Nature Communications 8: 13890. doi:10.1038/ncomms13890. PMID 28067221. Bibcode2017NatCo...813890S. 
  6. Takamoto, So; Shinagawa, Chikashi; Motoki, Daisuke; Nakago, Kosuke (May 30, 2022). "Towards universal neural network potential for material discovery applicable to arbitrary combinations of 45 elements". Nature Communications 13 (1): 2991. doi:10.1038/s41467-022-30687-9. PMID 35637178. Bibcode2022NatCo..13.2991T. 
  7. "Matlantis". https://matlantis.com/.