Loading…
PharmRF: A machine‐learning scoring function to identify the best protein‐ligand complexes for structure‐based pharmacophore screening with high enrichments
Structure‐based pharmacophore models are often developed by selecting a single protein‐ligand complex with good resolution and better binding affinity data which prevents the analysis of other structures having a similar potential to act as better templates. PharmRF is a pharmacophore‐based scoring...
Saved in:
Published in: | Journal of computational chemistry 2022-05, Vol.43 (12), p.847-863 |
---|---|
Main Authors: | , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Structure‐based pharmacophore models are often developed by selecting a single protein‐ligand complex with good resolution and better binding affinity data which prevents the analysis of other structures having a similar potential to act as better templates. PharmRF is a pharmacophore‐based scoring function for selecting the best crystal structures with the potential to attain high enrichment rates in pharmacophore‐based virtual screening prospectively. The PharmRF scoring function is trained and tested on the PDBbind v2018 protein‐ligand complex dataset and employs a random forest regressor to correlate protein pocket descriptors and ligand pharmacophoric elements with binding affinity. PharmRF score represents the calculated binding affinity which identifies high‐affinity ligands by thorough pruning of all the PDB entries available for a particular protein of interest with a high PharmRF score. Ligands with high PharmRF scores can provide a better basis for structure‐based pharmacophore enumerations with a better enrichment rate. Evaluated on 10 protein‐ligand systems of the DUD‐E dataset, PharmRF achieved superior performance (average success rate: 77.61%, median success rate: 87.16%) than Vina docking score (75.47%, 79.39%). PharmRF was further evaluated using the CASF‐2016 benchmark set yielding a moderate correlation of 0.591 with experimental binding affinity, similar in performance to 25 scoring functions tested on this dataset. Independent assessment of PharmRF on 8 protein‐ligand systems of LIT‐PCBA dataset exhibited average and median success rates of 57.55% and 74.72% with 4 targets attaining success rate > 90%. The PharmRF scoring model, scripts, and related resources can be accessed at https://github.com/Prasanth-Kumar87/PharmRF.
A machine‐learning scoring function to identify protein‐ligand complexes with desirable pharmacophoric elements with the potential to secure high active enrichments in database screening of small molecules. |
---|---|
ISSN: | 0192-8651 1096-987X |
DOI: | 10.1002/jcc.26840 |