Loading…

Abstract 4415: Exploiting convergent evolution to derive a cisplatin sensitivity gene expression signature in epithelial based cancer

Significance: Gene expression signatures have been successfully used in radiotherapy dosing, predicting benefit of adjuvant chemotherapy, and prognosticating based on hypoxic state. Yet, despite these successes, these tools often fall short in validation between datasets and translation to clinical...

Full description

Saved in:
Bibliographic Details
Published in:Cancer research (Chicago, Ill.) Ill.), 2020-08, Vol.80 (16_Supplement), p.4415-4415
Main Authors: Scarborough, Jessica, Dhawan, Andrew, Scott, Jacob
Format: Article
Language:English
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Significance: Gene expression signatures have been successfully used in radiotherapy dosing, predicting benefit of adjuvant chemotherapy, and prognosticating based on hypoxic state. Yet, despite these successes, these tools often fall short in validation between datasets and translation to clinical use. We propose a novel method for deriving more robust gene expression signatures predictive of chemotherapeutic response. Although this method may be applied to any drug, here, we derive a signature to predict cisplatin sensitivity in epithelial-based cancer cell lines. Methodology: We ranked epithelial-based cell lines from the Genomics of Drug Discovery in Cancer dataset based on their response to cisplatin (IC50). Differential gene expression analysis between resistant and sensitive cell lines was performed using the SAM, limma, and multtest algorithms implemented with R. The intersection of the DE results from each method became the seed genes in a co-expression network built from TCGA gene expression data. The most highly co-expressed seed genes are extracted and they are termed “connectivity genes.” We perform this analysis 5 times, each time excluding 20% of the GDSC dataset. We formed the final signature by combining genes found in at least 3 of the 5 connectivity gene sets. A cell line's median expression of the signature genes is termed its “signature score.” Using 5-fold cross validation, models are built to predict IC50 given a cell line's signature score or expression of all signature genes. For every model built testing our gene signature, a null distribution of performance metrics was produced by repeating the procedure 1000 times with random gene signatures of the same length as the signature in question. Results: The final signature contains 13 genes: ADA, NPM3, CSTA, KRT5, KRT14, ATP1B3, LY6K, USP31, BNC1, MAP7D3, LRRC8C, C15orf41, and SLFN11. Using linear regression to predict a cell line's IC50 from its signature score, the predicted vs. actual rank of IC50 has a Spearman correlation coefficient of 0.408 and a p-value of
ISSN:0008-5472
1538-7445
DOI:10.1158/1538-7445.AM2020-4415