Data-driven Coarse-grained Modeling of Non-equilibrium Systems

Data-driven coarse-grained modeling of polymers in solution with structural and dynamic properties conserved

Soft Matter ◽

10.1039/d0sm01019g ◽

2020 ◽

Vol 16 (36) ◽

pp. 8330-8344

Author(s):

Shu Wang ◽

Zhan Ma ◽

Wenxiao Pan

Keyword(s):

Structural Properties ◽

Dynamic Properties ◽

Data Driven ◽

Coarse Grained ◽

System P

We present data-driven coarse-grained (CG) modeling for polymers in solution, which conserves the dynamic as well as structural properties of the underlying atomistic system.

Download Full-text

Implicit-solvent coarse-grained modeling for polymer solutions via Mori-Zwanzig formalism

Soft Matter ◽

10.1039/c9sm01211g ◽

2019 ◽

Vol 15 (38) ◽

pp. 7567-7582 ◽

Cited By ~ 3

Author(s):

Shu Wang ◽

Zhen Li ◽

Wenxiao Pan

Keyword(s):

Polymer Solutions ◽

Dynamic Properties ◽

Coarse Graining ◽

Coarse Grained ◽

Implicit Solvent ◽

Bottom Up ◽

System P

We present a bottom-up coarse-graining (CG) method to establish implicit-solvent CG modeling for polymers in solution, which conserves the dynamic properties of the reference microscopic system.

Download Full-text

A Sparse Data-Driven Polynomial Chaos Expansion Method for Uncertainty Propagation

Volume 2A: 42nd Design Automation Conference ◽

10.1115/detc2016-59795 ◽

2016 ◽

Cited By ~ 1

Author(s):

F. Wang ◽

F. Xiong ◽

S. Yang ◽

Y. Xiong

Keyword(s):

Polynomial Chaos ◽

Uncertainty Propagation ◽

Computational Cost ◽

Expansion Method ◽

Polynomial Chaos Expansion ◽

Data Driven ◽

High Dimensional ◽

Chaos Expansion ◽

Least Angle Regression ◽

Structure Problem

The data-driven polynomial chaos expansion (DD-PCE) method is claimed to be a more general approach of uncertainty propagation (UP). However, as a common problem of all the full PCE approaches, the size of polynomial terms in the full DD-PCE model is significantly increased with the dimension of random inputs and the order of PCE model, which would greatly increase the computational cost especially for high-dimensional and highly non-linear problems. Therefore, a sparse DD-PCE is developed by employing the least angle regression technique and a stepwise regression strategy to adaptively remove some insignificant terms. Through comparative studies between sparse DD-PCE and the full DD-PCE on three mathematical examples with random input of raw data, common and nontrivial distributions, and a ten-bar structure problem for UP, it is observed that generally both methods yield comparably accurate results, while the computational cost is significantly reduced by sDD-PCE especially for high-dimensional problems, which demonstrates the effectiveness and advantage of the proposed method.

Download Full-text

Acceleration of Chemical Kinetics Computation with the Learned Intelligent Tabulation (LIT) Method

Energies ◽

10.3390/en14237851 ◽

2021 ◽

Vol 14 (23) ◽

pp. 7851

Author(s):

Majid Haghshenas ◽

Peetak Mitra ◽

Niccolò Dal Santo ◽

David P. Schmidt

Keyword(s):

Regression Model ◽

Initial Conditions ◽

Computational Cost ◽

Machine Learning Algorithms ◽

Bayesian Optimization ◽

Data Driven ◽

High Dimensional ◽

Self Organizing Map ◽

Composition Space ◽

Wide Range

In this work, a data-driven methodology for modeling combustion kinetics, Learned Intelligent Tabulation (LIT), is presented. LIT aims to accelerate the tabulation of combustion mechanisms via machine learning algorithms such as Deep Neural Networks (DNNs). The high-dimensional composition space is sampled from high-fidelity simulations covering a wide range of initial conditions to train these DNNs. The input data are clustered into subspaces, while each subspace is trained with a DNN regression model targeted to a particular part of the high-dimensional composition space. This localized approach has proven to be more tractable than having a global ANN regression model, which fails to generalize across various composition spaces. The clustering is performed using an unsupervised method, Self-Organizing Map (SOM), which automatically subdivides the space. A dense network comprised of fully connected layers is considered for the regression model, while the network hyper parameters are optimized using Bayesian optimization. A nonlinear transformation of the parameters is used to improve sensitivity to minor species and enhance the prediction of ignition delay. The LIT method is employed to model the chemistry kinetics of zero-dimensional H2–O2 and CH4-air combustion. The data-driven method achieves good agreement with the benchmark method while being cheaper in terms of computational cost. LIT is naturally extensible to different combustion models such as flamelet and PDF transport models.

Download Full-text

Parameterization of Divalent Cations for Coarse-Grained Simulations

10.26434/chemrxiv.11881716 ◽

2020 ◽

Author(s):

Florencia Klein ◽

Daniela Cáceres-Rojas ◽

Monica Carrasco ◽

Juan Carlos Tapia ◽

Julio Caballero ◽

...

Keyword(s):

Molecular Dynamics ◽

Metal Ions ◽

Molecular Dynamics Simulations ◽

Divalent Cations ◽

Computational Cost ◽

Data Bank ◽

Coarse Grained ◽

Interaction Parameters ◽

Dynamics Simulations ◽

Dynamical Description

<p>Although molecular dynamics simulations allow for the study of interactions among virtually all biomolecular entities, metal ions still pose significant challenges to achieve an accurate structural and dynamical description of many biological assemblies. This is particularly the case for coarse-grained (CG) models. Although the reduced computational cost of CG methods often makes them the technique of choice for the study of large biomolecular systems, the parameterization of metal ions is still very crude or simply not available for the vast majority of CG- force fields. Here, we show that incorporating statistical data retrieved from the Protein Data Bank (PDB) to set specific Lennard-Jones interactions can produce structurally accurate CG molecular dynamics simulations. Using this simple approach, we provide a set of interaction parameters for Calcium, Magnesium, and Zinc ions, which cover more than 80% of the metal-bound structures reported on the PDB. Simulations performed using the SIRAH force field on several proteins and DNA systems show that using the present approach it is possible to obtain non-bonded interaction parameters that obviate the use of topological constraints. </p>

Download Full-text

BAND NN: A Deep Learning Framework For Energy Prediction and Geometry Optimization of Organic Small Molecules

10.26434/chemrxiv.9763094 ◽

2019 ◽

Author(s):

Siddhartha Laghuvarapu ◽

Yashaswi Pathak ◽

U. Deva Priyakumar

Keyword(s):

Machine Learning ◽

Density Functional ◽

Computational Cost ◽

Geometry Optimization ◽

Dft Methods ◽

Energy Prediction ◽

Machine Learning Model ◽

Equilibrium Structures ◽

High Level ◽

Non Equilibrium

Recent advances in artificial intelligence along with development of large datasets of energies calculated using quantum mechanical (QM)/density functional theory (DFT) methods have enabled prediction of accurate molecular energies at reasonably low computational cost. However, machine learning models that have been reported so far requires the atomic positions obtained from geometry optimizations using high level QM/DFT methods as input in order to predict the energies, and do not allow for geometry optimization. In this paper, a transferable and molecule-size independent machine learning model (BAND NN) based on a chemically intuitive representation inspired by molecular mechanics force fields is presented. The model predicts the atomization energies of equilibrium and non-equilibrium structures as sum of energy contributions from bonds (B), angles (A), nonbonds (N) and dihedrals (D) at remarkable accuracy. The robustness of the proposed model is further validated by calculations that span over the conformational, configurational and reaction space. The transferability of this model on systems larger than the ones in the dataset is demonstrated by performing calculations on select large molecules. Importantly, employing the BAND NN model, it is possible to perform geometry optimizations starting from non-equilibrium structures along with predicting their energies.

Download Full-text

Computing Expectiles Using k-Nearest Neighbours Approach

Symmetry ◽

10.3390/sym13040645 ◽

2021 ◽

Vol 13 (4) ◽

pp. 645

Author(s):

Muhammad Farooq ◽

Sehrish Sarfraz ◽

Christophe Chesneau ◽

Mahmood Ul Hassan ◽

Muhammad Ali Raza ◽

...

Keyword(s):

Computational Cost ◽

Real Life ◽

Distance Measures ◽

Computational Time ◽

High Dimensional ◽

Test Error ◽

Nearest Neighbours ◽

Comparable Performance ◽

Asymmetric Least Squares ◽

Low Computational Cost

Expectiles have gained considerable attention in recent years due to wide applications in many areas. In this study, the k-nearest neighbours approach, together with the asymmetric least squares loss function, called ex-kNN, is proposed for computing expectiles. Firstly, the effect of various distance measures on ex-kNN in terms of test error and computational time is evaluated. It is found that Canberra, Lorentzian, and Soergel distance measures lead to minimum test error, whereas Euclidean, Canberra, and Average of (L1,L∞) lead to a low computational cost. Secondly, the performance of ex-kNN is compared with existing packages er-boost and ex-svm for computing expectiles that are based on nine real life examples. Depending on the nature of data, the ex-kNN showed two to 10 times better performance than er-boost and comparable performance with ex-svm regarding test error. Computationally, the ex-kNN is found two to five times faster than ex-svm and much faster than er-boost, particularly, in the case of high dimensional data.

Download Full-text

A new one-site coarse-grained model for water: Bottom-up many-body projected water (BUMPer). II. Temperature transferability and structural properties at low temperature

The Journal of Chemical Physics ◽

10.1063/5.0026652 ◽

2021 ◽

Vol 154 (4) ◽

pp. 044105

Author(s):

Jaehyeok Jin ◽

Alexander J. Pak ◽

Yining Han ◽

Gregory A. Voth

Keyword(s):

Low Temperature ◽

Structural Properties ◽

Coarse Grained ◽

Many Body ◽

Bottom Up ◽

Coarse Grained Model ◽

Water Bottom

Download Full-text

Data-driven Feature Selection for Long Longitudinal Breadth and High Dimensional Dataset

Proceedings of the 2020 12th International Conference on Machine Learning and Computing ◽

10.1145/3383972.3383992 ◽

2020 ◽

Author(s):

Ji-Han Liu ◽

Cheng-Tse Wu ◽

Ta-Wei Chu ◽

and Jyh-Shing Roger Jang

Keyword(s):

Feature Selection ◽

Data Driven ◽

High Dimensional ◽

Selection For

Download Full-text

Application of non-equilibrium plasmas in top-down and bottom-up nanotechnologies and biomedicine

2010 27th International Conference on Microelectronics Proceedings ◽

10.1109/miel.2010.5490535 ◽

2010 ◽

Cited By ~ 1

Author(s):

Z. Lj. Petrovic ◽

M. Radmilovic-Radenovic ◽

P. Maguire ◽

M. Radetic ◽

N. Puac ◽

...

Keyword(s):

Top Down ◽

Bottom Up ◽

Non Equilibrium

Download Full-text