[PDF] Prediction Errors of Molecular Machine Learning Models Lower than Hybrid DFT Error. | Semantic Scholar (2024)

Figures and Tables from this paper

  • table 1
  • table 3
  • figure 3
  • table 4

486 Citations

Chemical diversity in molecular orbital energy predictions with kernel ridge regression.
    A. StukeM. Todorović P. Rinke

    Chemistry, Materials Science

    The Journal of chemical physics

  • 2019

This work investigates the performance of machine learning with kernel ridge regression (KRR) for the prediction of molecular orbital energies on three large datasets: the standard QM9 small organic molecules set, amino acid and dipeptide conformers, and organic crystal-forming molecules extracted from the Cambridge Structural Database.

Machine Learning Prediction of Nine Molecular Properties Based on the SMILES Representation of the QM9 Quantum-Chemistry Dataset.
    Gabriel A. PinheiroJ. MuceliniM. SoaresR. PratiJ. L. D. Da SilvaM. G. Quiles

    Chemistry, Computer Science

    The journal of physical chemistry. A

  • 2020

This work evaluates a feed-forward neural network (FNN) model's prediction performance over five feature selection methods and nine ground-state properties from a public data set composed of ∼130k organic molecules and investigates the impact of choosing free-coordinate descriptors based on the Simplified Molecular Input Line Entry System (SMILES) representation.

  • 50
  • PDF
Applying machine learning techniques to predict the properties of energetic materials
    Daniel C. EltonZois BoukouvalasMark S. ButricoM. FugePeter W. Chung

    Materials Science, Chemistry

    Scientific Reports

  • 2018

This work presents a comprehensive comparison of machine learning models and several molecular featurization methods - sum over bonds, custom descriptors, Coulomb matrices, Bag of Bonds, and fingerprints - and concludes that the best featurizing was sum over bond counting, and the best model was kernel ridge regression.

Predicting Molecular Energy Using Force-Field Optimized Geometries and Atomic Vector Representations Learned from an Improved Deep Tensor Neural Network.
    Jianing LuCheng WangYingkai Zhang

    Chemistry, Computer Science

    Journal of chemical theory and computation

  • 2019

DFT-level accuracy of molecular energy prediction can be achieved using force-field optimized geometries and atomic vector representations learned from deep tensor neural network, and integrated molecular modeling and machine learning would be a promising approach to develop more powerful computational tools for molecular conformation analysis.

  • 25
  • Highly Influenced
  • PDF
BAND NN: A Deep Learning Framework for Energy Prediction and Geometry Optimization of Organic Small Molecules

A transferable and molecule size‐independent machine learning model bonds (B), angles (A), nonbonded (N) interactions, and dihedrals (D) neural network (BAND NN) based on a chemically intuitive representation inspired by molecular mechanics force fields is presented.

  • 27
  • PDF
Machine learning of free energies in chemical compound space using ensemble representations: Reaching experimental uncertainty for solvation.
    J. WeinreichN. BrowningO. A. von Lilienfeld

    Chemistry, Computer Science

    The Journal of chemical physics

  • 2021

This work presents a Free energy Machine Learning model applicable throughout chemical compound space and based on a representation that employs Boltzmann averages to account for an approximated sampling of configurational space and showcases its usefulness through analysis of solvation free energies.

Constant size descriptors for accurate machine learning models of molecular properties.
    Christopher R. CollinsGeoffrey J. GordonO. A. von LilienfeldD. Yaron

    Chemistry, Computer Science

    The Journal of chemical physics

  • 2018

Two different classes of molecular representations for use in machine learning of thermodynamic and electronic properties are studied, including the Coulomb matrix and Bag of Bonds, and Encoded Bonds, which encode such lists into a feature vector whose length is independent of molecular size.

  • 95
Transferable Multilevel Attention Neural Network for Accurate Prediction of Quantum Chemistry Properties via Multitask Learning
    Ziteng LiuLiqiang Lin Jing Ma

    Chemistry, Computer Science

    J. Chem. Inf. Model.

  • 2021

A multilevel attention neural network is proposed, named DeepMoleNet, to enable chemical interpretable insights being fused into multitask learning through weighting contributions from various atoms and taking the atom-centered symmetry functions (ACSFs) as the teacher descriptor.

Construction of Machine Learned Force Fields with Quantum Chemical Accuracy: Applications and Chemical Insights
    H. E. SaucedaStefan ChmielaI. PoltavskyKlaus-Robert MullerA. Tkatchenko

    Chemistry, Computer Science

    Machine Learning Meets Quantum Physics

  • 2020

The hierarchical pathway from generating the dataset of reference calculations to the construction of the machine learning model, and the validation of the physics generated by the model are discussed, and empirical evidence that a higher level of theory generates a smoother PES is provided.

Resolving Transition Metal Chemical Space: Feature Selection for Machine Learning and Structure-Property Relationships.
    J. JanetH. Kulik

    Chemistry, Computer Science

    The journal of physical chemistry. A

  • 2017

A series of revised autocorrelation functions (RACs) that encode relationships of the heuristic atomic properties (e.g., size, connectivity, and electronegativity) on a molecular graph are introduced to make these RACs amenable to inorganic chemistry.

...

...

65 References

Fast machine learning models of electronic and energetic properties consistently reach approximation errors better than DFT accuracy
    Felix A FaberLuke Hutchinson A. V. Lilienfeld

    Chemistry, Computer Science

  • 2017

Numerical evidence is presented that ML model predictions for all properties can reach an approximation error to DFT which is on par with chemical accuracy.

  • 18
Communication: Understanding molecular representations in machine learning: The role of uniqueness and target similarity.
    B. HuangO. A. von Lilienfeld

    Chemistry, Computer Science

    The Journal of chemical physics

  • 2016

This work introduces a hierarchy of representations which meet uniqueness and target similarity criteria, and relies on interatomic many body expansions, as implemented in universal force-fields, including Bonding, Angular (BA), and higher order terms to systematically control target similarity.

Assessment and Validation of Machine Learning Methods for Predicting Molecular Atomization Energies.
    K. HansenG. Montavon Plus Environment

    Chemistry, Computer Science

    Journal of chemical theory and computation

  • 2013

A number of established machine learning techniques are outlined and the influence of the molecular representation on the methods performance is investigated, finding the best methods achieve prediction errors of 3 kcal/mol for the atomization energies of a wide variety of molecules.

  • 513
  • PDF
Big Data Meets Quantum Chemistry Approximations: The Δ-Machine Learning Approach.
    R. RamakrishnanM. Rupp von Lilienfeld

    Chemistry, Computer Science

    Journal of chemical theory and computation

  • 2015

The transferability of the approach is demonstrated, using semiempirical quantum chemistry and machine learning models trained on 1 and 10% of 134k organic molecules, to reproduce enthalpies of all remaining molecules at density functional theory level of accuracy.

Many Molecular Properties from One Kernel in Chemical Space.
    R. RamakrishnanO. A. von Lilienfeld

    Chemistry, Computer Science

    Chimia

  • 2015

Property-independent kernels for machine learning models of arbitrarily many molecular properties enable the instantaneous generation of machine learning Models which can be systematically improved through the addition of more data.

Machine Learning of Parameters for Accurate Semiempirical Quantum Chemical Calculations
    Pavlo O. DralO. A. von LilienfeldW. Thiel

    Chemistry, Computer Science

    Journal of chemical theory and computation

  • 2015

The ML-OM2 results show improved average accuracy and a much reduced error range compared with those of standard OM2 results, with mean absolute errors in atomization enthalpies dropping from 6.3 to 1.7 kcal/mol.

  • 79
  • PDF
Machine Learning Predictions of Molecular Properties: Accurate Many-Body Potentials and Nonlocality in Chemical Space
    K. HansenFranziska Biegler A. Tkatchenko

    Chemistry, Computer Science

    The journal of physical chemistry letters

  • 2015

A systematic hierarchy of efficient empirical methods to estimate atomization and total energies of molecules and is achieved by a vectorized representation of molecules (so-called Bag of Bonds model) that exhibits strong nonlocality in chemical space.

  • 607
  • Highly Influential
  • PDF
Constant Size Molecular Descriptors For Use With Machine Learning
    Christopher R. CollinsGeoffrey J. GordonO. V. LilienfeldD. Yaron

    Computer Science, Chemistry

  • 2017

A set of molecular descriptors whose length is independent of molecular size is developed for machine learning models that target thermodynamic and electronic properties of molecules, which have the advantage of leading to models that may be trained on smaller molecules and then used successfully on larger molecules.

Fourier series of atomic radial distribution functions: A molecular fingerprint for machine learning models of quantum chemical properties
    O. V. LilienfeldR. RamakrishnanM. RuppA. Knoll

    Chemistry, Computer Science

  • 2013

A fingerprint representation of molecules based on a Fourier series of atomic radial distribution functions that requires no preconceived knowledge about chemical bonding, topology, or electronic orbitals is introduced, suggesting its usefulness for machine learning models of molecular properties trained across chemical compound space.

Fast and accurate modeling of molecular atomization energies with machine learning.
    M. RuppA. TkatchenkoK. MüllerO. A. von Lilienfeld

    Computer Science, Chemistry

    Physical review letters

  • 2012

A machine learning model is introduced to predict atomization energies of a diverse set of organic molecules, based on nuclear charges and atomic positions only, and applicability is demonstrated for the prediction of molecular atomization potential energy curves.

...

...

Related Papers

Showing 1 through 3 of 0 Related Papers

    [PDF] Prediction Errors of Molecular Machine Learning Models Lower than Hybrid DFT Error. | Semantic Scholar (2024)
    Top Articles
    Latest Posts
    Article information

    Author: Zonia Mosciski DO

    Last Updated:

    Views: 5732

    Rating: 4 / 5 (51 voted)

    Reviews: 82% of readers found this page helpful

    Author information

    Name: Zonia Mosciski DO

    Birthday: 1996-05-16

    Address: Suite 228 919 Deana Ford, Lake Meridithberg, NE 60017-4257

    Phone: +2613987384138

    Job: Chief Retail Officer

    Hobby: Tai chi, Dowsing, Poi, Letterboxing, Watching movies, Video gaming, Singing

    Introduction: My name is Zonia Mosciski DO, I am a enchanting, joyous, lovely, successful, hilarious, tender, outstanding person who loves writing and wants to share my knowledge and understanding with you.