Loading…

Machine learning based anti-cancer drug response prediction and search for predictor genes using cancer cell line gene expression

Although many models have been proposed to accurately predict the response of drugs in cell lines recent years, understanding the genome related to drug response is also the key for completing oncology precision medicine. In this paper, based on the cancer cell line gene expression and the drug resp...

Full description

Saved in:
Bibliographic Details
Published in:Genomics & informatics 2021, Vol.19 (1), p.10.1-10.7
Main Authors: Qiu, Kexin, Lee, JoongHo, Kim, HanByeol, Yoon, Seokhyun, Kang, Keunsoo
Format: Article
Language:Korean
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page 10.7
container_issue 1
container_start_page 10.1
container_title Genomics & informatics
container_volume 19
creator Qiu, Kexin
Lee, JoongHo
Kim, HanByeol
Yoon, Seokhyun
Kang, Keunsoo
description Although many models have been proposed to accurately predict the response of drugs in cell lines recent years, understanding the genome related to drug response is also the key for completing oncology precision medicine. In this paper, based on the cancer cell line gene expression and the drug response data, we established a reliable and accurate drug response prediction model and found predictor genes for some drugs of interest. To this end, we first performed pre-selection of genes based on the Pearson correlation coefficient and then used ElasticNet regression model for drug response prediction and fine gene selection. To find more reliable set of predictor genes, we performed regression twice for each drug, one with IC50 and the other with area under the curve (AUC) (or activity area). For the 12 drugs we tested, the predictive performance in terms of Pearson correlation coefficient exceeded 0.6 and the highest one was 17-AAG for which Pearson correlation coefficient was 0.811 for IC50 and 0.81 for AUC. We identify common predictor genes for IC50 and AUC, with which the performance was similar to those with genes separately found for IC50 and AUC, but with much smaller number of predictor genes. By using only common predictor genes, the highest performance was AZD6244 (0.8016 for IC50, 0.7945 for AUC) with 321 predictor genes.
format article
fullrecord <record><control><sourceid>kisti</sourceid><recordid>TN_cdi_kisti_ndsl_JAKO202123162133735</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>JAKO202123162133735</sourcerecordid><originalsourceid>FETCH-kisti_ndsl_JAKO2021231621337353</originalsourceid><addsrcrecordid>eNqNjjFrwzAQhUVpoabNf7ilo8GWbMUeQ0kpKaVLhmxGli6OiDgbnQNZ-88rQ7J3egfvu_feg8ikVFVerCv5KLKybpu80frwLFbMvi9qpdZat20mfr-NPXlCCGgieRqgN4wODM0-t4YsRnDxMkBEnkZihCmi83b2IyXIAac_e4LjGO9OugYkZLjwkncLsRgChKVpMQGviU5bRnoVT0cTGFc3fRFvH9v9-2d-9jz7jhyHbrf5-pGFLKUqtSzTeFWr_3J_7QlR7w</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Machine learning based anti-cancer drug response prediction and search for predictor genes using cancer cell line gene expression</title><source>Open Access: PubMed Central</source><creator>Qiu, Kexin ; Lee, JoongHo ; Kim, HanByeol ; Yoon, Seokhyun ; Kang, Keunsoo</creator><creatorcontrib>Qiu, Kexin ; Lee, JoongHo ; Kim, HanByeol ; Yoon, Seokhyun ; Kang, Keunsoo</creatorcontrib><description>Although many models have been proposed to accurately predict the response of drugs in cell lines recent years, understanding the genome related to drug response is also the key for completing oncology precision medicine. In this paper, based on the cancer cell line gene expression and the drug response data, we established a reliable and accurate drug response prediction model and found predictor genes for some drugs of interest. To this end, we first performed pre-selection of genes based on the Pearson correlation coefficient and then used ElasticNet regression model for drug response prediction and fine gene selection. To find more reliable set of predictor genes, we performed regression twice for each drug, one with IC50 and the other with area under the curve (AUC) (or activity area). For the 12 drugs we tested, the predictive performance in terms of Pearson correlation coefficient exceeded 0.6 and the highest one was 17-AAG for which Pearson correlation coefficient was 0.811 for IC50 and 0.81 for AUC. We identify common predictor genes for IC50 and AUC, with which the performance was similar to those with genes separately found for IC50 and AUC, but with much smaller number of predictor genes. By using only common predictor genes, the highest performance was AZD6244 (0.8016 for IC50, 0.7945 for AUC) with 321 predictor genes.</description><identifier>ISSN: 1598-866X</identifier><identifier>EISSN: 2234-0742</identifier><language>kor</language><ispartof>Genomics &amp; informatics, 2021, Vol.19 (1), p.10.1-10.7</ispartof><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,314,780,784,885,4024</link.rule.ids></links><search><creatorcontrib>Qiu, Kexin</creatorcontrib><creatorcontrib>Lee, JoongHo</creatorcontrib><creatorcontrib>Kim, HanByeol</creatorcontrib><creatorcontrib>Yoon, Seokhyun</creatorcontrib><creatorcontrib>Kang, Keunsoo</creatorcontrib><title>Machine learning based anti-cancer drug response prediction and search for predictor genes using cancer cell line gene expression</title><title>Genomics &amp; informatics</title><addtitle>Genomics &amp; informatics</addtitle><description>Although many models have been proposed to accurately predict the response of drugs in cell lines recent years, understanding the genome related to drug response is also the key for completing oncology precision medicine. In this paper, based on the cancer cell line gene expression and the drug response data, we established a reliable and accurate drug response prediction model and found predictor genes for some drugs of interest. To this end, we first performed pre-selection of genes based on the Pearson correlation coefficient and then used ElasticNet regression model for drug response prediction and fine gene selection. To find more reliable set of predictor genes, we performed regression twice for each drug, one with IC50 and the other with area under the curve (AUC) (or activity area). For the 12 drugs we tested, the predictive performance in terms of Pearson correlation coefficient exceeded 0.6 and the highest one was 17-AAG for which Pearson correlation coefficient was 0.811 for IC50 and 0.81 for AUC. We identify common predictor genes for IC50 and AUC, with which the performance was similar to those with genes separately found for IC50 and AUC, but with much smaller number of predictor genes. By using only common predictor genes, the highest performance was AZD6244 (0.8016 for IC50, 0.7945 for AUC) with 321 predictor genes.</description><issn>1598-866X</issn><issn>2234-0742</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><recordid>eNqNjjFrwzAQhUVpoabNf7ilo8GWbMUeQ0kpKaVLhmxGli6OiDgbnQNZ-88rQ7J3egfvu_feg8ikVFVerCv5KLKybpu80frwLFbMvi9qpdZat20mfr-NPXlCCGgieRqgN4wODM0-t4YsRnDxMkBEnkZihCmi83b2IyXIAac_e4LjGO9OugYkZLjwkncLsRgChKVpMQGviU5bRnoVT0cTGFc3fRFvH9v9-2d-9jz7jhyHbrf5-pGFLKUqtSzTeFWr_3J_7QlR7w</recordid><startdate>2021</startdate><enddate>2021</enddate><creator>Qiu, Kexin</creator><creator>Lee, JoongHo</creator><creator>Kim, HanByeol</creator><creator>Yoon, Seokhyun</creator><creator>Kang, Keunsoo</creator><scope>JDI</scope></search><sort><creationdate>2021</creationdate><title>Machine learning based anti-cancer drug response prediction and search for predictor genes using cancer cell line gene expression</title><author>Qiu, Kexin ; Lee, JoongHo ; Kim, HanByeol ; Yoon, Seokhyun ; Kang, Keunsoo</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-kisti_ndsl_JAKO2021231621337353</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>kor</language><creationdate>2021</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Qiu, Kexin</creatorcontrib><creatorcontrib>Lee, JoongHo</creatorcontrib><creatorcontrib>Kim, HanByeol</creatorcontrib><creatorcontrib>Yoon, Seokhyun</creatorcontrib><creatorcontrib>Kang, Keunsoo</creatorcontrib><collection>KoreaScience</collection><jtitle>Genomics &amp; informatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Qiu, Kexin</au><au>Lee, JoongHo</au><au>Kim, HanByeol</au><au>Yoon, Seokhyun</au><au>Kang, Keunsoo</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Machine learning based anti-cancer drug response prediction and search for predictor genes using cancer cell line gene expression</atitle><jtitle>Genomics &amp; informatics</jtitle><addtitle>Genomics &amp; informatics</addtitle><date>2021</date><risdate>2021</risdate><volume>19</volume><issue>1</issue><spage>10.1</spage><epage>10.7</epage><pages>10.1-10.7</pages><issn>1598-866X</issn><eissn>2234-0742</eissn><abstract>Although many models have been proposed to accurately predict the response of drugs in cell lines recent years, understanding the genome related to drug response is also the key for completing oncology precision medicine. In this paper, based on the cancer cell line gene expression and the drug response data, we established a reliable and accurate drug response prediction model and found predictor genes for some drugs of interest. To this end, we first performed pre-selection of genes based on the Pearson correlation coefficient and then used ElasticNet regression model for drug response prediction and fine gene selection. To find more reliable set of predictor genes, we performed regression twice for each drug, one with IC50 and the other with area under the curve (AUC) (or activity area). For the 12 drugs we tested, the predictive performance in terms of Pearson correlation coefficient exceeded 0.6 and the highest one was 17-AAG for which Pearson correlation coefficient was 0.811 for IC50 and 0.81 for AUC. We identify common predictor genes for IC50 and AUC, with which the performance was similar to those with genes separately found for IC50 and AUC, but with much smaller number of predictor genes. By using only common predictor genes, the highest performance was AZD6244 (0.8016 for IC50, 0.7945 for AUC) with 321 predictor genes.</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1598-866X
ispartof Genomics & informatics, 2021, Vol.19 (1), p.10.1-10.7
issn 1598-866X
2234-0742
language kor
recordid cdi_kisti_ndsl_JAKO202123162133735
source Open Access: PubMed Central
title Machine learning based anti-cancer drug response prediction and search for predictor genes using cancer cell line gene expression
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-30T21%3A52%3A56IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-kisti&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Machine%20learning%20based%20anti-cancer%20drug%20response%20prediction%20and%20search%20for%20predictor%20genes%20using%20cancer%20cell%20line%20gene%20expression&rft.jtitle=Genomics%20&%20informatics&rft.au=Qiu,%20Kexin&rft.date=2021&rft.volume=19&rft.issue=1&rft.spage=10.1&rft.epage=10.7&rft.pages=10.1-10.7&rft.issn=1598-866X&rft.eissn=2234-0742&rft_id=info:doi/&rft_dat=%3Ckisti%3EJAKO202123162133735%3C/kisti%3E%3Cgrp_id%3Ecdi_FETCH-kisti_ndsl_JAKO2021231621337353%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true