Correlation between Virtual Screening Performance and Binding Site Descriptors of Protein Targets

CompScore: boosting structure-based virtual screening performance by incorporating docking scoring functions components into consensus scoring

10.1101/550590 ◽

2019 ◽

Author(s):

Yunierkis Perez-Castillo ◽

Stellamaris Sotomayor-Burneo ◽

Karina Jimenes-Vargas ◽

Mario Gonzalez-Rodriguez ◽

Maykel Cruz-Monteagudo ◽

...

Keyword(s):

Genetic Algorithms ◽

Virtual Screening ◽

High Performance ◽

Scoring Function ◽

Scoring Functions ◽

Traditional Use ◽

Screening Performance ◽

Consensus Scoring ◽

Improved Performance ◽

Virtual Screening Performance

AbstractConsensus scoring has become a commonly used strategy within structure-based virtual screening (VS) workflows with improved performance compared to those based in a single scoring function. However, no research has been devoted to analyze the worth of docking scoring functions components in consensus scoring. We implemented and tested a method that incorporates docking scoring functions components into the setting of high performance VS workflows. This method uses genetic algorithms for finding the combination of scoring components that maximizes the VS enrichment for any target. Our methodology was validated using a dataset that contains ligands and decoys for 102 targets that has been widely used in VS validation studies. Results show that our approach outperforms other methods for all targets. It also boosts the initial enrichment performance of the traditional use of whole scoring functions in consensus scoring by an average of 45%. CompScore is freely available at: http://bioquimio.udla.edu.ec/compscore/

Download Full-text

Incorporating structural similarity into a scoring function to enhance the prediction of binding affinities

Journal of Cheminformatics ◽

10.1186/s13321-021-00493-4 ◽

2021 ◽

Vol 13 (1) ◽

Author(s):

Beihong Ji ◽

Xibing He ◽

Yuzhao Zhang ◽

Jingchen Zhai ◽

Viet Hoang Man ◽

...

Keyword(s):

Computational Cost ◽

Scoring Function ◽

Structural Similarity ◽

Scoring Functions ◽

Binding Affinities ◽

Autodock Vina ◽

Predictive Index ◽

Drug Lead ◽

Screening Performance ◽

Calibration Algorithm

AbstractIn this study, we developed a novel algorithm to improve the screening performance of an arbitrary docking scoring function by recalibrating the docking score of a query compound based on its structure similarity with a set of training compounds, while the extra computational cost is neglectable. Two popular docking methods, Glide and AutoDock Vina were adopted as the original scoring functions to be processed with our new algorithm and similar improvement performance was achieved. Predicted binding affinities were compared against experimental data from ChEMBL and DUD-E databases. 11 representative drug receptors from diverse drug target categories were applied to evaluate the hybrid scoring function. The effects of four different fingerprints (FP2, FP3, FP4, and MACCS) and the four different compound similarity effect (CSE) functions were explored. Encouragingly, the screening performance was significantly improved for all 11 drug targets especially when CSE = S4 (S is the Tanimoto structural similarity) and FP2 fingerprint were applied. The average predictive index (PI) values increased from 0.34 to 0.66 and 0.39 to 0.71 for the Glide and AutoDock vina scoring functions, respectively. To evaluate the performance of the calibration algorithm in drug lead identification, we also imposed an upper limit on the structural similarity to mimic the real scenario of screening diverse libraries for which query ligands are general-purpose screening compounds and they are not necessarily structurally similar to reference ligands. Encouragingly, we found our hybrid scoring function still outperformed the original docking scoring function. The hybrid scoring function was further evaluated using external datasets for two systems and we found the PI values increased from 0.24 to 0.46 and 0.14 to 0.42 for A2AR and CFX systems, respectively. In a conclusion, our calibration algorithm can significantly improve the virtual screening performance in both drug lead optimization and identification phases with neglectable computational cost.

Download Full-text

CompScore: Boosting Structure-Based Virtual Screening Performance by Incorporating Docking Scoring Function Components into Consensus Scoring

Journal of Chemical Information and Modeling ◽

10.1021/acs.jcim.9b00343 ◽

2019 ◽

Vol 59 (9) ◽

pp. 3655-3666 ◽

Cited By ~ 7

Author(s):

Yunierkis Perez-Castillo ◽

Stellamaris Sotomayor-Burneo ◽

Karina Jimenes-Vargas ◽

Mario Gonzalez-Rodriguez ◽

Maykel Cruz-Monteagudo ◽

...

Keyword(s):

Virtual Screening ◽

Scoring Function ◽

Screening Performance ◽

Consensus Scoring ◽

Virtual Screening Performance

Download Full-text

Virtual Screening with Gnina 1.0

10.20944/preprints202111.0329.v1 ◽

2021 ◽

Author(s):

Jocelyn Sunseri ◽

David Koes

Keyword(s):

Virtual Screening ◽

Scoring Function ◽

Compound Library ◽

Autodock Vina ◽

Convolutional Networks ◽

Development Costs ◽

Screening Performance ◽

Computationally Intensive ◽

Speed And Accuracy ◽

Virtual Screening Performance

Virtual screening - predicting which compounds within a specified compound library bind to a target molecule, typically a protein - is a fundamental task in the field of drug discovery. Doing virtual screening well provides tangible practical benefits, including reduced drug development costs, faster time to therapeutic viability, and fewer unforeseen side effects. As with most applied computational tasks, the algorithms currently used to perform virtual screening feature inherent tradeoffs between speed and accuracy. Furthermore, even theoretically rigorous, computationally intensive methods may fail to account for important effects relevant to whether a given compound will ultimately be usable as a drug. Here we investigate the virtual screening performance of the recently released Gnina molecular docking software, which uses deep convolutional networks to score protein-ligand structures. We find, on average, that Gnina outperforms conventional empirical scoring. The default scoring in Gnina outperforms the empirical AutoDock Vina scoring function on 89 of the 117 targets of the DUD-E and LIT-PCBA virtual screening benchmarks with a median 1% early enrichment factor that is more than twice that of Vina. However, we also find that issues of bias linger in these sets, even when not used directly to train models, and this bias obfuscates to what extent machine learning models are achieving their performance through a sophisticated interpretation of molecular interactions versus fitting to non-informative simplistic property distributions.

Download Full-text

Virtual Screening with Gnina 1.0

Molecules ◽

10.3390/molecules26237369 ◽

2021 ◽

Vol 26 (23) ◽

pp. 7369

Author(s):

Jocelyn Sunseri ◽

David Ryan Koes

Keyword(s):

Virtual Screening ◽

Scoring Function ◽

Compound Library ◽

Autodock Vina ◽

Convolutional Networks ◽

Development Costs ◽

Screening Performance ◽

Computationally Intensive ◽

Speed And Accuracy ◽

Virtual Screening Performance

Virtual screening—predicting which compounds within a specified compound library bind to a target molecule, typically a protein—is a fundamental task in the field of drug discovery. Doing virtual screening well provides tangible practical benefits, including reduced drug development costs, faster time to therapeutic viability, and fewer unforeseen side effects. As with most applied computational tasks, the algorithms currently used to perform virtual screening feature inherent tradeoffs between speed and accuracy. Furthermore, even theoretically rigorous, computationally intensive methods may fail to account for important effects relevant to whether a given compound will ultimately be usable as a drug. Here we investigate the virtual screening performance of the recently released Gnina molecular docking software, which uses deep convolutional networks to score protein-ligand structures. We find, on average, that Gnina outperforms conventional empirical scoring. The default scoring in Gnina outperforms the empirical AutoDock Vina scoring function on 89 of the 117 targets of the DUD-E and LIT-PCBA virtual screening benchmarks with a median 1% early enrichment factor that is more than twice that of Vina. However, we also find that issues of bias linger in these sets, even when not used directly to train models, and this bias obfuscates to what extent machine learning models are achieving their performance through a sophisticated interpretation of molecular interactions versus fitting to non-informative simplistic property distributions.

Download Full-text

Selecting Machine-Learning Scoring Functions for Structure-Based Virtual Screening

10.26434/chemrxiv.12967160 ◽

2020 ◽

Author(s):

Pedro Ballester

Keyword(s):

Machine Learning ◽

Drug Discovery ◽

Virtual Screening ◽

Predictive Accuracy ◽

Scoring Function ◽

3D Models ◽

Large Datasets ◽

Scoring Functions ◽

Discovery Process ◽

Drug Discovery Process

Interest in docking technologies has grown parallel to the ever increasing number and diversity of 3D models for macromolecular therapeutic targets. Structure-Based Virtual Screening (SBVS) aims at leveraging these experimental structures to discover the necessary starting points for the drug discovery process. It is now established that Machine Learning (ML) can strongly enhance the predictive accuracy of scoring functions for SBVS by exploiting large datasets from targets, molecules and their associations. However, with greater choice, the question of which ML-based scoring function is the most suitable for prospective use on a given target has gained importance. Here we analyse two approaches to select an existing scoring function for the target along with a third approach consisting in generating a scoring function tailored to the target. These analyses required discussing the limitations of popular SBVS benchmarks, the alternatives to benchmark scoring functions for SBVS and how to generate them or use them using freely-available software.

Download Full-text

Learning RMSD to Improve Protein-Ligand Scoring and Pose Selection

10.26434/chemrxiv.11910870.v1 ◽

2020 ◽

Author(s):

Rishal Aggarwal ◽

David R. Koes

Keyword(s):

Scoring Function ◽

Target Site ◽

Scoring Functions ◽

Autodock Vina ◽

Binding Ability ◽

Structure Based Drug Design ◽

Chemical Structures ◽

Novel Approach ◽

Experimental Values ◽

Ligand Conformation

Docking algorithms are an essential part of the Structure Based Drug Design (SBDD) process as they aim to effectively identify the binding poses of chemical structures at the target site. These algorithms are reliant on scoring functions that evaluate the binding ability of a ligand conformation. Typically, scoring functions are designed to predict the binding affinity of various poses at the target site. In this work, we design a novel approach where the scoring function attempts to predict the Root Mean Square Deviation (RMSD) of a pose to the true binding pose. We show that a Convolutional Neural Network (CNN) can be trained to learn these RMSD values with high correlation between predicted and experimental values. Furthermore we show that this scoring function can improve pose selection performance when used in combination with orthogonal scoring functions like Autodock Vina.

Download Full-text

DeepBindRG: a deep learning based method for estimating effective protein–ligand affinity

PeerJ ◽

10.7717/peerj.7362 ◽

2019 ◽

Vol 7 ◽

pp. e7362 ◽

Cited By ~ 13

Author(s):

Haiping Zhang ◽

Linbu Liao ◽

Konda Mani Saravanan ◽

Peng Yin ◽

Yanjie Wei

Keyword(s):

Deep Learning ◽

Ligand Binding ◽

Binding Affinity ◽

Mean Squared Error ◽

Scoring Function ◽

Binding Mode ◽

Scoring Functions ◽

Autodock Vina ◽

Cellular Functions ◽

Ligand Complex

Proteins interact with small molecules to modulate several important cellular functions. Many acute diseases were cured by small molecule binding in the active site of protein either by inhibition or activation. Currently, there are several docking programs to estimate the binding position and the binding orientation of protein–ligand complex. Many scoring functions were developed to estimate the binding strength and predict the effective protein–ligand binding. While the accuracy of current scoring function is limited by several aspects, the solvent effect, entropy effect, and multibody effect are largely ignored in traditional machine learning methods. In this paper, we proposed a new deep neural network-based model named DeepBindRG to predict the binding affinity of protein–ligand complex, which learns all the effects, binding mode, and specificity implicitly by learning protein–ligand interface contact information from a large protein–ligand dataset. During the initial data processing step, the critical interface information was preserved to make sure the input is suitable for the proposed deep learning model. While validating our model on three independent datasets, DeepBindRG achieves root mean squared error (RMSE) value of pKa (−logKd or −logKi) about 1.6–1.8 and R value around 0.5–0.6, which is better than the autodock vina whose RMSE value is about 2.2–2.4 and R value is 0.42–0.57. We also explored the detailed reasons for the performance of DeepBindRG, especially for several failed cases by vina. Furthermore, DeepBindRG performed better for four challenging datasets from DUD.E database with no experimental protein–ligand complexes. The better performance of DeepBindRG than autodock vina in predicting protein–ligand binding affinity indicates that deep learning approach can greatly help with the drug discovery process. We also compare the performance of DeepBindRG with a 4D based deep learning method “pafnucy”, the advantage and limitation of both methods have provided clues for improving the deep learning based protein–ligand prediction model in the future.

Download Full-text

Selecting Machine-Learning Scoring Functions for Structure-Based Virtual Screening

10.26434/chemrxiv.12967160.v1 ◽

2020 ◽

Author(s):

Pedro Ballester

Keyword(s):

Machine Learning ◽

Drug Discovery ◽

Virtual Screening ◽

Predictive Accuracy ◽

Scoring Function ◽

3D Models ◽

Large Datasets ◽

Scoring Functions ◽

Discovery Process ◽

Drug Discovery Process

Interest in docking technologies has grown parallel to the ever increasing number and diversity of 3D models for macromolecular therapeutic targets. Structure-Based Virtual Screening (SBVS) aims at leveraging these experimental structures to discover the necessary starting points for the drug discovery process. It is now established that Machine Learning (ML) can strongly enhance the predictive accuracy of scoring functions for SBVS by exploiting large datasets from targets, molecules and their associations. However, with greater choice, the question of which ML-based scoring function is the most suitable for prospective use on a given target has gained importance. Here we analyse two approaches to select an existing scoring function for the target along with a third approach consisting in generating a scoring function tailored to the target. These analyses required discussing the limitations of popular SBVS benchmarks, the alternatives to benchmark scoring functions for SBVS and how to generate them or use them using freely-available software.

Download Full-text

Machine learning classification can reduce false positives in structure-based virtual screening

Proceedings of the National Academy of Sciences ◽

10.1073/pnas.2000585117 ◽

2020 ◽

Vol 117 (31) ◽

pp. 18477-18488 ◽

Cited By ~ 6

Author(s):

Yusuf O. Adeshina ◽

Eric J. Deeds ◽

John Karanicolas

Keyword(s):

Virtual Screening ◽

General Purpose ◽

Training Dataset ◽

Scoring Functions ◽

Protein Targets ◽

Machine Learning Classification ◽

Biochemical Assays ◽

Active Chemical ◽

Early Drug ◽

Better Than

With the recent explosion in the size of libraries available for screening, virtual screening is positioned to assume a more prominent role in early drug discovery’s search for active chemical matter. In typical virtual screens, however, only about 12% of the top-scoring compounds actually show activity when tested in biochemical assays. We argue that most scoring functions used for this task have been developed with insufficient thoughtfulness into the datasets on which they are trained and tested, leading to overly simplistic models and/or overtraining. These problems are compounded in the literature because studies reporting new scoring methods have not validated their models prospectively within the same study. Here, we report a strategy for building a training dataset (D-COID) that aims to generate highly compelling decoy complexes that are individually matched to available active complexes. Using this dataset, we train a general-purpose classifier for virtual screening (vScreenML) that is built on the XGBoost framework. In retrospective benchmarks, our classifier shows outstanding performance relative to other scoring functions. In a prospective context, nearly all candidate inhibitors from a screen against acetylcholinesterase show detectable activity; beyond this, 10 of 23 compounds have IC50better than 50 μM. Without any medicinal chemistry optimization, the most potent hit has IC50280 nM, corresponding toKiof 173 nM. These results support using the D-COID strategy for training classifiers in other computational biology tasks, and for vScreenML in virtual screening campaigns against other protein targets. Both D-COID and vScreenML are freely distributed to facilitate such efforts.

Download Full-text