Loading…

MultiDTI: drug–target interaction prediction based on multi-modal representation learning to bridge the gap between new chemical entities and known heterogeneous network

Abstract Motivation Predicting new drug–target interactions is an important step in new drug development, understanding of its side effects and drug repositioning. Heterogeneous data sources can provide comprehensive information and different perspectives for drug–target interaction prediction. Thus...

Full description

Saved in:
Bibliographic Details
Published in:Bioinformatics 2021-12, Vol.37 (23), p.4485-4492
Main Authors: Zhou, Deshan, Xu, Zhijian, Li, WenTao, Xie, Xiaolan, Peng, Shaoliang
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c419t-87f8b6c043abbcd2cbeffdd6ffef9e01b66904f37dc1b6ce05f7ef25a43de5c03
cites cdi_FETCH-LOGICAL-c419t-87f8b6c043abbcd2cbeffdd6ffef9e01b66904f37dc1b6ce05f7ef25a43de5c03
container_end_page 4492
container_issue 23
container_start_page 4485
container_title Bioinformatics
container_volume 37
creator Zhou, Deshan
Xu, Zhijian
Li, WenTao
Xie, Xiaolan
Peng, Shaoliang
description Abstract Motivation Predicting new drug–target interactions is an important step in new drug development, understanding of its side effects and drug repositioning. Heterogeneous data sources can provide comprehensive information and different perspectives for drug–target interaction prediction. Thus, there have been many calculation methods relying on heterogeneous networks. Most of them use graph-related algorithms to characterize nodes in heterogeneous networks for predicting new drug–target interactions (DTI). However, these methods can only make predictions in known heterogeneous network datasets, and cannot support the prediction of new chemical entities outside the heterogeneous network, which hinder further drug discovery and development. Results To solve this problem, we proposed a multi-modal DTI prediction model named ‘MultiDTI’ which uses our proposed joint learning framework based on heterogeneous networks. It combines the interaction or association information of the heterogeneous network and the drug/target sequence information, and maps the drugs, targets, side effects and disease nodes in the heterogeneous network into a common space. In this way, ‘MultiDTI’ can map the new chemical entity to this learned common space based on the chemical structure of the new entity. That is, bridging the gap between new chemical entities and known heterogeneous network. Our model has strong predictive performance, and the area under the receiver operating characteristic curve of the model is 0.961 and the area under the precision recall curve is 0.947 with 10-fold cross validation. In addition, some predicted new DTIs have been confirmed by ChEMBL database. Our results indicate that ‘MultiDTI’ is a powerful and practical tool for predicting new DTI, which can promote the development of drug discovery or drug repositioning. Availability and implementation Python codes and dataset are available at https://github.com/Deshan-Zhou/MultiDTI/. Supplementary information Supplementary data are available at Bioinformatics online.
doi_str_mv 10.1093/bioinformatics/btab473
format article
fullrecord <record><control><sourceid>oup_TOX</sourceid><recordid>TN_cdi_crossref_primary_10_1093_bioinformatics_btab473</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><oup_id>10.1093/bioinformatics/btab473</oup_id><sourcerecordid>10.1093/bioinformatics/btab473</sourcerecordid><originalsourceid>FETCH-LOGICAL-c419t-87f8b6c043abbcd2cbeffdd6ffef9e01b66904f37dc1b6ce05f7ef25a43de5c03</originalsourceid><addsrcrecordid>eNqNkEtOwzAURS0E4r8F5A0E7NpJGmaIvwRiUsaRP8-pobEj21HFjD2wDHbFSjC0IDFj9O7gnPuki9ARJceUNOxEWm-d8aEXyap4IpOQvGYbaJfyihQTUjabObOqLviUsB20F-MTISXlnG-jHcbplDQ12UXv9-Mi2YvZ7SnWYew-Xt-SCB0kbF2CIFSy3uEhgLarKEUEjXPov7yi91oscIBMRHBJfDMLEMFZ1-HksQxWd4DTHHAnBiwhLQEcdrDEag69VVnPok0WIhZO42fnlw7PIX_3HTjwY8x0WvrwfIC2jFhEOFzfffR4dTk7vynuHq5vz8_uCsVpk4ppbaayUoQzIaXSEyXBGK0rY8A0QKisqoZww2qtclZASlODmZSCMw2lImwfVateFXyMAUw7BNuL8NJS0n6t3_5dv12vn8WjlTiMsgf9q_3MnQG6Avw4_Lf0E3ENoNE</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>MultiDTI: drug–target interaction prediction based on multi-modal representation learning to bridge the gap between new chemical entities and known heterogeneous network</title><source>Oxford University Press Open Access</source><creator>Zhou, Deshan ; Xu, Zhijian ; Li, WenTao ; Xie, Xiaolan ; Peng, Shaoliang</creator><contributor>Przytycka, Teresa</contributor><creatorcontrib>Zhou, Deshan ; Xu, Zhijian ; Li, WenTao ; Xie, Xiaolan ; Peng, Shaoliang ; Przytycka, Teresa</creatorcontrib><description>Abstract Motivation Predicting new drug–target interactions is an important step in new drug development, understanding of its side effects and drug repositioning. Heterogeneous data sources can provide comprehensive information and different perspectives for drug–target interaction prediction. Thus, there have been many calculation methods relying on heterogeneous networks. Most of them use graph-related algorithms to characterize nodes in heterogeneous networks for predicting new drug–target interactions (DTI). However, these methods can only make predictions in known heterogeneous network datasets, and cannot support the prediction of new chemical entities outside the heterogeneous network, which hinder further drug discovery and development. Results To solve this problem, we proposed a multi-modal DTI prediction model named ‘MultiDTI’ which uses our proposed joint learning framework based on heterogeneous networks. It combines the interaction or association information of the heterogeneous network and the drug/target sequence information, and maps the drugs, targets, side effects and disease nodes in the heterogeneous network into a common space. In this way, ‘MultiDTI’ can map the new chemical entity to this learned common space based on the chemical structure of the new entity. That is, bridging the gap between new chemical entities and known heterogeneous network. Our model has strong predictive performance, and the area under the receiver operating characteristic curve of the model is 0.961 and the area under the precision recall curve is 0.947 with 10-fold cross validation. In addition, some predicted new DTIs have been confirmed by ChEMBL database. Our results indicate that ‘MultiDTI’ is a powerful and practical tool for predicting new DTI, which can promote the development of drug discovery or drug repositioning. Availability and implementation Python codes and dataset are available at https://github.com/Deshan-Zhou/MultiDTI/. Supplementary information Supplementary data are available at Bioinformatics online.</description><identifier>ISSN: 1367-4803</identifier><identifier>EISSN: 1460-2059</identifier><identifier>EISSN: 1367-4811</identifier><identifier>DOI: 10.1093/bioinformatics/btab473</identifier><identifier>PMID: 34180970</identifier><language>eng</language><publisher>England: Oxford University Press</publisher><subject>Algorithms ; Drug Development ; Drug Discovery ; Drug Repositioning ; Drug-Related Side Effects and Adverse Reactions ; Humans</subject><ispartof>Bioinformatics, 2021-12, Vol.37 (23), p.4485-4492</ispartof><rights>The Author(s) 2021. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com 2021</rights><rights>The Author(s) 2021. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c419t-87f8b6c043abbcd2cbeffdd6ffef9e01b66904f37dc1b6ce05f7ef25a43de5c03</citedby><cites>FETCH-LOGICAL-c419t-87f8b6c043abbcd2cbeffdd6ffef9e01b66904f37dc1b6ce05f7ef25a43de5c03</cites><orcidid>0000-0002-6725-9303 ; 0000-0002-3063-8473</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,1604,27924,27925</link.rule.ids><linktorsrc>$$Uhttps://dx.doi.org/10.1093/bioinformatics/btab473$$EView_record_in_Oxford_University_Press$$FView_record_in_$$GOxford_University_Press</linktorsrc><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/34180970$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><contributor>Przytycka, Teresa</contributor><creatorcontrib>Zhou, Deshan</creatorcontrib><creatorcontrib>Xu, Zhijian</creatorcontrib><creatorcontrib>Li, WenTao</creatorcontrib><creatorcontrib>Xie, Xiaolan</creatorcontrib><creatorcontrib>Peng, Shaoliang</creatorcontrib><title>MultiDTI: drug–target interaction prediction based on multi-modal representation learning to bridge the gap between new chemical entities and known heterogeneous network</title><title>Bioinformatics</title><addtitle>Bioinformatics</addtitle><description>Abstract Motivation Predicting new drug–target interactions is an important step in new drug development, understanding of its side effects and drug repositioning. Heterogeneous data sources can provide comprehensive information and different perspectives for drug–target interaction prediction. Thus, there have been many calculation methods relying on heterogeneous networks. Most of them use graph-related algorithms to characterize nodes in heterogeneous networks for predicting new drug–target interactions (DTI). However, these methods can only make predictions in known heterogeneous network datasets, and cannot support the prediction of new chemical entities outside the heterogeneous network, which hinder further drug discovery and development. Results To solve this problem, we proposed a multi-modal DTI prediction model named ‘MultiDTI’ which uses our proposed joint learning framework based on heterogeneous networks. It combines the interaction or association information of the heterogeneous network and the drug/target sequence information, and maps the drugs, targets, side effects and disease nodes in the heterogeneous network into a common space. In this way, ‘MultiDTI’ can map the new chemical entity to this learned common space based on the chemical structure of the new entity. That is, bridging the gap between new chemical entities and known heterogeneous network. Our model has strong predictive performance, and the area under the receiver operating characteristic curve of the model is 0.961 and the area under the precision recall curve is 0.947 with 10-fold cross validation. In addition, some predicted new DTIs have been confirmed by ChEMBL database. Our results indicate that ‘MultiDTI’ is a powerful and practical tool for predicting new DTI, which can promote the development of drug discovery or drug repositioning. Availability and implementation Python codes and dataset are available at https://github.com/Deshan-Zhou/MultiDTI/. Supplementary information Supplementary data are available at Bioinformatics online.</description><subject>Algorithms</subject><subject>Drug Development</subject><subject>Drug Discovery</subject><subject>Drug Repositioning</subject><subject>Drug-Related Side Effects and Adverse Reactions</subject><subject>Humans</subject><issn>1367-4803</issn><issn>1460-2059</issn><issn>1367-4811</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><recordid>eNqNkEtOwzAURS0E4r8F5A0E7NpJGmaIvwRiUsaRP8-pobEj21HFjD2wDHbFSjC0IDFj9O7gnPuki9ARJceUNOxEWm-d8aEXyap4IpOQvGYbaJfyihQTUjabObOqLviUsB20F-MTISXlnG-jHcbplDQ12UXv9-Mi2YvZ7SnWYew-Xt-SCB0kbF2CIFSy3uEhgLarKEUEjXPov7yi91oscIBMRHBJfDMLEMFZ1-HksQxWd4DTHHAnBiwhLQEcdrDEag69VVnPok0WIhZO42fnlw7PIX_3HTjwY8x0WvrwfIC2jFhEOFzfffR4dTk7vynuHq5vz8_uCsVpk4ppbaayUoQzIaXSEyXBGK0rY8A0QKisqoZww2qtclZASlODmZSCMw2lImwfVateFXyMAUw7BNuL8NJS0n6t3_5dv12vn8WjlTiMsgf9q_3MnQG6Avw4_Lf0E3ENoNE</recordid><startdate>20211207</startdate><enddate>20211207</enddate><creator>Zhou, Deshan</creator><creator>Xu, Zhijian</creator><creator>Li, WenTao</creator><creator>Xie, Xiaolan</creator><creator>Peng, Shaoliang</creator><general>Oxford University Press</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0002-6725-9303</orcidid><orcidid>https://orcid.org/0000-0002-3063-8473</orcidid></search><sort><creationdate>20211207</creationdate><title>MultiDTI: drug–target interaction prediction based on multi-modal representation learning to bridge the gap between new chemical entities and known heterogeneous network</title><author>Zhou, Deshan ; Xu, Zhijian ; Li, WenTao ; Xie, Xiaolan ; Peng, Shaoliang</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c419t-87f8b6c043abbcd2cbeffdd6ffef9e01b66904f37dc1b6ce05f7ef25a43de5c03</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Algorithms</topic><topic>Drug Development</topic><topic>Drug Discovery</topic><topic>Drug Repositioning</topic><topic>Drug-Related Side Effects and Adverse Reactions</topic><topic>Humans</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Zhou, Deshan</creatorcontrib><creatorcontrib>Xu, Zhijian</creatorcontrib><creatorcontrib>Li, WenTao</creatorcontrib><creatorcontrib>Xie, Xiaolan</creatorcontrib><creatorcontrib>Peng, Shaoliang</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><jtitle>Bioinformatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Zhou, Deshan</au><au>Xu, Zhijian</au><au>Li, WenTao</au><au>Xie, Xiaolan</au><au>Peng, Shaoliang</au><au>Przytycka, Teresa</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>MultiDTI: drug–target interaction prediction based on multi-modal representation learning to bridge the gap between new chemical entities and known heterogeneous network</atitle><jtitle>Bioinformatics</jtitle><addtitle>Bioinformatics</addtitle><date>2021-12-07</date><risdate>2021</risdate><volume>37</volume><issue>23</issue><spage>4485</spage><epage>4492</epage><pages>4485-4492</pages><issn>1367-4803</issn><eissn>1460-2059</eissn><eissn>1367-4811</eissn><abstract>Abstract Motivation Predicting new drug–target interactions is an important step in new drug development, understanding of its side effects and drug repositioning. Heterogeneous data sources can provide comprehensive information and different perspectives for drug–target interaction prediction. Thus, there have been many calculation methods relying on heterogeneous networks. Most of them use graph-related algorithms to characterize nodes in heterogeneous networks for predicting new drug–target interactions (DTI). However, these methods can only make predictions in known heterogeneous network datasets, and cannot support the prediction of new chemical entities outside the heterogeneous network, which hinder further drug discovery and development. Results To solve this problem, we proposed a multi-modal DTI prediction model named ‘MultiDTI’ which uses our proposed joint learning framework based on heterogeneous networks. It combines the interaction or association information of the heterogeneous network and the drug/target sequence information, and maps the drugs, targets, side effects and disease nodes in the heterogeneous network into a common space. In this way, ‘MultiDTI’ can map the new chemical entity to this learned common space based on the chemical structure of the new entity. That is, bridging the gap between new chemical entities and known heterogeneous network. Our model has strong predictive performance, and the area under the receiver operating characteristic curve of the model is 0.961 and the area under the precision recall curve is 0.947 with 10-fold cross validation. In addition, some predicted new DTIs have been confirmed by ChEMBL database. Our results indicate that ‘MultiDTI’ is a powerful and practical tool for predicting new DTI, which can promote the development of drug discovery or drug repositioning. Availability and implementation Python codes and dataset are available at https://github.com/Deshan-Zhou/MultiDTI/. Supplementary information Supplementary data are available at Bioinformatics online.</abstract><cop>England</cop><pub>Oxford University Press</pub><pmid>34180970</pmid><doi>10.1093/bioinformatics/btab473</doi><tpages>8</tpages><orcidid>https://orcid.org/0000-0002-6725-9303</orcidid><orcidid>https://orcid.org/0000-0002-3063-8473</orcidid></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1367-4803
ispartof Bioinformatics, 2021-12, Vol.37 (23), p.4485-4492
issn 1367-4803
1460-2059
1367-4811
language eng
recordid cdi_crossref_primary_10_1093_bioinformatics_btab473
source Oxford University Press Open Access
subjects Algorithms
Drug Development
Drug Discovery
Drug Repositioning
Drug-Related Side Effects and Adverse Reactions
Humans
title MultiDTI: drug–target interaction prediction based on multi-modal representation learning to bridge the gap between new chemical entities and known heterogeneous network
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-28T12%3A33%3A13IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-oup_TOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=MultiDTI:%20drug%E2%80%93target%20interaction%20prediction%20based%20on%20multi-modal%20representation%20learning%20to%20bridge%20the%20gap%20between%20new%20chemical%20entities%20and%20known%20heterogeneous%20network&rft.jtitle=Bioinformatics&rft.au=Zhou,%20Deshan&rft.date=2021-12-07&rft.volume=37&rft.issue=23&rft.spage=4485&rft.epage=4492&rft.pages=4485-4492&rft.issn=1367-4803&rft.eissn=1460-2059&rft_id=info:doi/10.1093/bioinformatics/btab473&rft_dat=%3Coup_TOX%3E10.1093/bioinformatics/btab473%3C/oup_TOX%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c419t-87f8b6c043abbcd2cbeffdd6ffef9e01b66904f37dc1b6ce05f7ef25a43de5c03%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/34180970&rft_oup_id=10.1093/bioinformatics/btab473&rfr_iscdi=true