Loading…
Abstract 4415: Exploiting convergent evolution to derive a cisplatin sensitivity gene expression signature in epithelial based cancer
Significance: Gene expression signatures have been successfully used in radiotherapy dosing, predicting benefit of adjuvant chemotherapy, and prognosticating based on hypoxic state. Yet, despite these successes, these tools often fall short in validation between datasets and translation to clinical...
Saved in:
Published in: | Cancer research (Chicago, Ill.) Ill.), 2020-08, Vol.80 (16_Supplement), p.4415-4415 |
---|---|
Main Authors: | , , |
Format: | Article |
Language: | English |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Significance: Gene expression signatures have been successfully used in radiotherapy dosing, predicting benefit of adjuvant chemotherapy, and prognosticating based on hypoxic state. Yet, despite these successes, these tools often fall short in validation between datasets and translation to clinical use. We propose a novel method for deriving more robust gene expression signatures predictive of chemotherapeutic response. Although this method may be applied to any drug, here, we derive a signature to predict cisplatin sensitivity in epithelial-based cancer cell lines.
Methodology: We ranked epithelial-based cell lines from the Genomics of Drug Discovery in Cancer dataset based on their response to cisplatin (IC50). Differential gene expression analysis between resistant and sensitive cell lines was performed using the SAM, limma, and multtest algorithms implemented with R. The intersection of the DE results from each method became the seed genes in a co-expression network built from TCGA gene expression data. The most highly co-expressed seed genes are extracted and they are termed “connectivity genes.” We perform this analysis 5 times, each time excluding 20% of the GDSC dataset. We formed the final signature by combining genes found in at least 3 of the 5 connectivity gene sets.
A cell line's median expression of the signature genes is termed its “signature score.” Using 5-fold cross validation, models are built to predict IC50 given a cell line's signature score or expression of all signature genes. For every model built testing our gene signature, a null distribution of performance metrics was produced by repeating the procedure 1000 times with random gene signatures of the same length as the signature in question.
Results: The final signature contains 13 genes: ADA, NPM3, CSTA, KRT5, KRT14, ATP1B3, LY6K, USP31, BNC1, MAP7D3, LRRC8C, C15orf41, and SLFN11.
Using linear regression to predict a cell line's IC50 from its signature score, the predicted vs. actual rank of IC50 has a Spearman correlation coefficient of 0.408 and a p-value of |
---|---|
ISSN: | 0008-5472 1538-7445 |
DOI: | 10.1158/1538-7445.AM2020-4415 |