Loading…

Fuzzy Correlated Association Mining: Selecting altered associations among the genes, and some possible marker genes mediating certain cancers

[Display omitted] •Proposed Fuzzy Correlated Association Mining from microarray gene expression data.•Identified altered gene–gene associations from normal to diseased state.•Demonstrated its effectiveness on five human gene expression cancer data sets.•Validated the results biologically as well as...

Full description

Saved in:
Bibliographic Details
Published in:Applied soft computing 2016-01, Vol.38, p.587-605
Main Authors: Ghosh, Anupam, De, Rajat K.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:[Display omitted] •Proposed Fuzzy Correlated Association Mining from microarray gene expression data.•Identified altered gene–gene associations from normal to diseased state.•Demonstrated its effectiveness on five human gene expression cancer data sets.•Validated the results biologically as well as statistically.•Compared its performance with 11 existing association rule mining algorithms. Association mining is a well explored topic applied to various fields. In this article, the associations among the genes have been identified from microarray gene expression data. Here a methodology, called Fuzzy Correlated Association Mining (FCAM), is developed for identifying the associations among the genes that have altered quite significantly from normal state to diseased state with respect to their expression patterns. This idea leads to predict the disease mediating genes along with their altered associations. The proposed methodology involves generation of fuzzy gene sets, construction of fuzzy items, computation of fuzzy support for fuzzy items and fuzzy correlation coefficient of a pair of fuzzy items, generation of associations, and identification of altered associations from normal to diseased state. The concept of finding fuzzy correlation between two groups of items, generation of altered associations among the items (groups of items) and then rank these items (groups of items) according to their importance are the novel contribution of the present article. The effectiveness of the methodology has been demonstrated on five gene expression data sets dealing with human lung cancer, colon cancer, sarcoma, breast cancer and leukemia. As a result, some possible genes, like IGFBP3, ERBB2, TP53, HBB, KRAS, PTEN, CALCA, CDKN2A, has been found as important genes that may mediate the development of various cancers considered here. For comparison, we have considered 11 existing association rule mining algorithms. The results are appropriately validated in terms of gene–gene interactions, functional enrichment, biochemical pathways, and using NCBI database.
ISSN:1568-4946
1872-9681
DOI:10.1016/j.asoc.2015.09.057