scholarly journals Unsupervised Machine Learning for Analysis of Coexisting Lipid Phases and Domain Growth in Biological Membranes

2019 ◽  
Author(s):  
Cesar A. López ◽  
Velimir V. Vesselinov ◽  
Sandrasegaram Gnanakaran ◽  
Boian S. Alexandrov

ABSTRACTPhase separation in mixed lipid systems has been extensively studied both experimentally and theoretically because of its biological importance. A detailed description of such complex systems undoubtedly requires novel mathematical frameworks that are capable to decompose and categorize the evolution of thousands if not millions of lipids involved in the phenomenon. The interpretation and analysis of Molecular Dynamics (MD) simulations representing temporal and spatial changes in such systems is still a challenging task. Here, we present a new unsupervised machine learning approach based on Nonnegative Matrix Factorization, called NMFk, that successfully extracts physically meaningful features from neighborhood profiles derived from coarse-grained MD simulations of ternary lipid mixture. Our results demonstrate that leveraging NMFk can (a) determine the role of different lipid molecules in phase separation, (b) characterize the formation of nano-domains of lipids, (c) determine the timescales of interest and (d) extract physically meaningful features that uniquely describe the phase separation with broad implications.

2019 ◽  
Vol 15 (11) ◽  
pp. 6343-6357 ◽  
Author(s):  
Cesar A. Löpez ◽  
Velimir V. Vesselinov ◽  
S. Gnanakaran ◽  
Boian S. Alexandrov

2017 ◽  
Author(s):  
Sabrina Jaeger ◽  
Simone Fulle ◽  
Samo Turk

Inspired by natural language processing techniques we here introduce Mol2vec which is an unsupervised machine learning approach to learn vector representations of molecular substructures. Similarly, to the Word2vec models where vectors of closely related words are in close proximity in the vector space, Mol2vec learns vector representations of molecular substructures that are pointing in similar directions for chemically related substructures. Compounds can finally be encoded as vectors by summing up vectors of the individual substructures and, for instance, feed into supervised machine learning approaches to predict compound properties. The underlying substructure vector embeddings are obtained by training an unsupervised machine learning approach on a so-called corpus of compounds that consists of all available chemical matter. The resulting Mol2vec model is pre-trained once, yields dense vector representations and overcomes drawbacks of common compound feature representations such as sparseness and bit collisions. The prediction capabilities are demonstrated on several compound property and bioactivity data sets and compared with results obtained for Morgan fingerprints as reference compound representation. Mol2vec can be easily combined with ProtVec, which employs the same Word2vec concept on protein sequences, resulting in a proteochemometric approach that is alignment independent and can be thus also easily used for proteins with low sequence similarities.


2021 ◽  
Vol 224 (2) ◽  
pp. S121-S122
Author(s):  
Ramamurthy Siripuram ◽  
Nathan R. Blue ◽  
Robert M. Silver ◽  
William A. Grobman ◽  
Uma M. Reddy ◽  
...  

BJS Open ◽  
2021 ◽  
Vol 5 (1) ◽  
Author(s):  
F Torresan ◽  
F Crimì ◽  
F Ceccato ◽  
F Zavan ◽  
M Barbot ◽  
...  

Abstract Background The main challenge in the management of indeterminate incidentally discovered adrenal tumours is to differentiate benign from malignant lesions. In the absence of clear signs of invasion or metastases, imaging techniques do not always precisely define the nature of the mass. The present pilot study aimed to determine whether radiomics may predict malignancy in adrenocortical tumours. Methods CT images in unenhanced, arterial, and venous phases from 19 patients who had undergone resection of adrenocortical tumours and a cohort who had undergone surveillance for at least 5 years for incidentalomas were reviewed. A volume of interest was drawn for each lesion using dedicated software, and, for each phase, first-order (histogram) and second-order (grey-level colour matrix and run-length matrix) radiological features were extracted. Data were revised by an unsupervised machine learning approach using the K-means clustering technique. Results Of operated patients, nine had non-functional adenoma and 10 carcinoma. There were 11 patients in the surveillance group. Two first-order features in unenhanced CT and one in arterial CT, and 14 second-order parameters in unenhanced and venous CT and 10 second-order features in arterial CT, were able to differentiate adrenocortical carcinoma from adenoma (P < 0.050). After excluding two malignant outliers, the unsupervised machine learning approach correctly predicted malignancy in seven of eight adrenocortical carcinomas in all phases. Conclusion Radiomics with CT texture analysis was able to discriminate malignant from benign adrenocortical tumours, even by an unsupervised machine learning approach, in nearly all patients.


2019 ◽  
Vol 116 (3) ◽  
pp. 364a
Author(s):  
Cesar A. López ◽  
Boian Alexandrov ◽  
S. Gnanakaran

Molecules ◽  
2019 ◽  
Vol 24 (11) ◽  
pp. 2097 ◽  
Author(s):  
Ambrose Plante ◽  
Derek M. Shore ◽  
Giulia Morra ◽  
George Khelashvili ◽  
Harel Weinstein

G protein-coupled receptors (GPCRs) play a key role in many cellular signaling mechanisms, and must select among multiple coupling possibilities in a ligand-specific manner in order to carry out a myriad of functions in diverse cellular contexts. Much has been learned about the molecular mechanisms of ligand-GPCR complexes from Molecular Dynamics (MD) simulations. However, to explore ligand-specific differences in the response of a GPCR to diverse ligands, as is required to understand ligand bias and functional selectivity, necessitates creating very large amounts of data from the needed large-scale simulations. This becomes a Big Data problem for the high dimensionality analysis of the accumulated trajectories. Here we describe a new machine learning (ML) approach to the problem that is based on transforming the analysis of GPCR function-related, ligand-specific differences encoded in the MD simulation trajectories into a representation recognizable by state-of-the-art deep learning object recognition technology. We illustrate this method by applying it to recognize the pharmacological classification of ligands bound to the 5-HT2A and D2 subtypes of class-A GPCRs from the serotonin and dopamine families. The ML-based approach is shown to perform the classification task with high accuracy, and we identify the molecular determinants of the classifications in the context of GPCR structure and function. This study builds a framework for the efficient computational analysis of MD Big Data collected for the purpose of understanding ligand-specific GPCR activity.


2020 ◽  
Author(s):  
Daniel Oluwadara Fadokun ◽  
Ishioma Bridget Oshilike ◽  
Mike Obi Onyekonwu

Sign in / Sign up

Export Citation Format

Share Document