Loading…

Exploiting cheminformatic and machine learning to navigate the available chemical space of potential small molecule inhibitors of SARS-CoV-2

[Display omitted] •Scaffold, flexophore-similarity and activity cliff based chemical space analysis.•Clustering, scaffold hopping and activity cliff analysis of antiviral compounds.•Highlighted most frequent fragments and polypharmacological ligands.•Machine learning to predict structures with high...

Full description

Saved in:
Bibliographic Details
Published in:Computational and structural biotechnology journal 2021-01, Vol.19, p.424-438
Main Authors: Kumar, Abhinit, Loharch, Saurabh, Kumar, Sunil, Ringe, Rajesh P., Parkesh, Raman
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:[Display omitted] •Scaffold, flexophore-similarity and activity cliff based chemical space analysis.•Clustering, scaffold hopping and activity cliff analysis of antiviral compounds.•Highlighted most frequent fragments and polypharmacological ligands.•Machine learning to predict structures with high probability to bind SARS-CoV-2. The current life-threatening and tenacious pandemic eruption of coronavirus disease in 2019 (COVID-19) has posed a significant global hazard concerning high mortality rate, economic meltdown, and everyday life distress. The rapid spread of COVID-19 demands countermeasures to combat this deadly virus. Currently, there are no drugs approved by the FDA to treat COVID-19. Therefore, discovering small molecule therapeutics for treating COVID-19 infection is essential. So far, only a few small molecule inhibitors are reported for coronaviruses. There is a need to expand the small chemical space of coronaviruses inhibitors by adding potent and selective scaffolds with anti-COVID activity. In this context, the huge antiviral chemical space already available can be analysed using cheminformatic and machine learning to unearth new scaffolds. We created three specific datasets called “antiviral dataset” (N = 38,428) “drug-like antiviral dataset” (N = 20,963) and “anticorona dataset” (N = 433) for this purpose. We analyzed the 433 molecules of “anticorona dataset” for their scaffold diversity, physicochemical distributions, principal component analysis, activity cliffs, R-group decomposition, and scaffold mapping. The scaffold diversity of the “anticorona dataset” in terms of Murcko scaffold analysis demonstrates a thorough representation of diverse chemical scaffolds. However, physicochemical descriptor analysis and principal component analysis demonstrated negligible drug-like features for the “anticorona dataset” molecules. The “antiviral dataset” and “drug-like antiviral dataset” showed low scaffold diversity as measured by the Gini coefficient. The hierarchical clustering of the “antiviral dataset” against the “anticorona dataset” demonstrated little molecular similarity. We generated a library of frequent fragments and polypharmacological ligands targeting various essential viral proteins such as main protease, helicase, papain-like protease, and replicase polyprotein 1ab. Further structural and chemical features of the “anticorona dataset” were compared with SARS-CoV-2 repurposed drugs, FDA-approved drugs, natural products, and drugs c
ISSN:2001-0370
2001-0370
DOI:10.1016/j.csbj.2020.12.028