Loading…

polyClustR: defining communities of reconciled cancer subtypes with biological and prognostic significance

To ensure cancer patients are stratified towards treatments that are optimally beneficial, it is a priority to define robust molecular subtypes using clustering methods applied to high-dimensional biological data. If each of these methods produces different numbers of clusters for the same data, it...

Full description

Saved in:
Bibliographic Details
Published in:BMC bioinformatics 2018-05, Vol.19 (1), p.182-182, Article 182
Main Authors: Eason, Katherine, Nyamundanda, Gift, Sadanandam, Anguraj
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:To ensure cancer patients are stratified towards treatments that are optimally beneficial, it is a priority to define robust molecular subtypes using clustering methods applied to high-dimensional biological data. If each of these methods produces different numbers of clusters for the same data, it is difficult to achieve an optimal solution. Here, we introduce "polyClustR", a tool that reconciles clusters identified by different methods into subtype "communities" using a hypergeometric test or a measure of relative proportion of common samples. The polyClustR pipeline was initially tested using a breast cancer dataset to demonstrate how results are compatible with and add to the understanding of this well-characterised cancer. Two uveal melanoma datasets were then utilised to identify and validate novel subtype communities with significant metastasis-free prognostic differences and associations with known chromosomal aberrations. We demonstrate the value of the polyClustR approach of applying multiple consensus clustering algorithms and systematically reconciling the results in identifying novel subtype communities of two cancer types, which nevertheless are compatible with established understanding of these diseases. An R implementation of the pipeline is available at: https://github.com/syspremed/polyClustR.
ISSN:1471-2105
1471-2105
DOI:10.1186/s12859-018-2204-4