Loading…
Identification of common and dissimilar biomarkers for different cancer types from gene expressions of RNA-sequencing data
PAN-Cancer research aims at characterizing the common genetic alterations across cancer types to identify the set of similar and different biomarkers for many cancer types. Analyzing RNA-Sequencing data could assist in developing predictive models for cancer progression. As clinical data is prone to...
Saved in:
Published in: | Gene reports 2020-06, Vol.19, p.100654, Article 100654 |
---|---|
Main Authors: | , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c306t-9eab8820a8f1f137161f62dad4d79d8deba7b5b9506a0275a597d173c917f6f83 |
---|---|
cites | cdi_FETCH-LOGICAL-c306t-9eab8820a8f1f137161f62dad4d79d8deba7b5b9506a0275a597d173c917f6f83 |
container_end_page | |
container_issue | |
container_start_page | 100654 |
container_title | Gene reports |
container_volume | 19 |
creator | Venkataramana, Lokeswari Jacob, Shomona Gracia Saraswathi, S. Venkata Vara Prasad, D. |
description | PAN-Cancer research aims at characterizing the common genetic alterations across cancer types to identify the set of similar and different biomarkers for many cancer types. Analyzing RNA-Sequencing data could assist in developing predictive models for cancer progression. As clinical data is prone to grow exponentially, applying computational methods on such data is very complex.
Parallel computational methods were exploited to weed out the problem of high computational complexity when applied on large clinical data. Parallelized Decremental Feature Selection (DFS) method was introduced to select pre-dominant genes from gene expressions of RNA-sequencing data. These selected genes were evaluated using parallelized classification models and tested using hold-out and 10-fold cross validation methods.
A computational study was performed on five cancer types namely PRAD (Prostate Adenocarcinoma), LUAD (Lung Cancer), BRCA (Breast Cancer), KIRC (Kidney Cancer) and COAD (Colon Cancer). These five cancer samples were segregated into five separate datasets and DFS was performed to unearth the common genes that play a major role in heterogeneous cancer types. Consequently the genes were investigated to identify the inter-cancer similarities and differences. The parallel classification methods yielded the classification accuracy of 93% to 98%.
The current research on investigating cancer lies in gene expressions of RNA- sequences. As RNA-Sequencing data has many features compared to number of instances, vertical partitioning and parallel Decremental Feature Selection method was employed on gene expressions of RNA-Sequencing for the first time to select more relevant genes (features) and classify cancer types accurately. The results prove to turn out higher predication accuracy with very less number of predictive genes. |
doi_str_mv | 10.1016/j.genrep.2020.100654 |
format | article |
fullrecord | <record><control><sourceid>elsevier_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1016_j_genrep_2020_100654</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S2452014420300686</els_id><sourcerecordid>S2452014420300686</sourcerecordid><originalsourceid>FETCH-LOGICAL-c306t-9eab8820a8f1f137161f62dad4d79d8deba7b5b9506a0275a597d173c917f6f83</originalsourceid><addsrcrecordid>eNp9kM1KAzEUhYMoWGrfwEVeYGqS-cnMRijFn0JREF2HTHJTUjvJmIxiffpmGBeuXN3LPXyHcw9C15QsKaHVzX65AxegXzLCxhOpyuIMzVhRsozQojj_s1-iRYx7QhLHacOLGfrZaHCDNVbJwXqHvcHKd13apNNY2xhtZw8y4Nb6ToZ3CBEbH5JiDISEYiWdgoCHYw9JCr7DKQ9g-O4DJNq7OJq-PK2yCB-f4JR1O6zlIK_QhZGHCIvfOUdv93ev68ds-_ywWa-2mcpJNWQNyLauGZG1oYbmnFbUVExLXWje6FpDK3lbtk1JKkkYL2XZcE15rhrKTWXqfI6KyVcFH2MAI_pg0y9HQYkYKxR7MVUoxgrFVGHCbicMUrYvC0FEZVN80DaAGoT29n-DE88ofk0</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Identification of common and dissimilar biomarkers for different cancer types from gene expressions of RNA-sequencing data</title><source>ScienceDirect Journals</source><creator>Venkataramana, Lokeswari ; Jacob, Shomona Gracia ; Saraswathi, S. ; Venkata Vara Prasad, D.</creator><creatorcontrib>Venkataramana, Lokeswari ; Jacob, Shomona Gracia ; Saraswathi, S. ; Venkata Vara Prasad, D.</creatorcontrib><description>PAN-Cancer research aims at characterizing the common genetic alterations across cancer types to identify the set of similar and different biomarkers for many cancer types. Analyzing RNA-Sequencing data could assist in developing predictive models for cancer progression. As clinical data is prone to grow exponentially, applying computational methods on such data is very complex.
Parallel computational methods were exploited to weed out the problem of high computational complexity when applied on large clinical data. Parallelized Decremental Feature Selection (DFS) method was introduced to select pre-dominant genes from gene expressions of RNA-sequencing data. These selected genes were evaluated using parallelized classification models and tested using hold-out and 10-fold cross validation methods.
A computational study was performed on five cancer types namely PRAD (Prostate Adenocarcinoma), LUAD (Lung Cancer), BRCA (Breast Cancer), KIRC (Kidney Cancer) and COAD (Colon Cancer). These five cancer samples were segregated into five separate datasets and DFS was performed to unearth the common genes that play a major role in heterogeneous cancer types. Consequently the genes were investigated to identify the inter-cancer similarities and differences. The parallel classification methods yielded the classification accuracy of 93% to 98%.
The current research on investigating cancer lies in gene expressions of RNA- sequences. As RNA-Sequencing data has many features compared to number of instances, vertical partitioning and parallel Decremental Feature Selection method was employed on gene expressions of RNA-Sequencing for the first time to select more relevant genes (features) and classify cancer types accurately. The results prove to turn out higher predication accuracy with very less number of predictive genes.</description><identifier>ISSN: 2452-0144</identifier><identifier>EISSN: 2452-0144</identifier><identifier>DOI: 10.1016/j.genrep.2020.100654</identifier><language>eng</language><publisher>Elsevier Inc</publisher><subject>Decremental feature selection ; Gene expression ; PAN-Cancer analysis ; Parallelized classification ; RNA-sequencing data</subject><ispartof>Gene reports, 2020-06, Vol.19, p.100654, Article 100654</ispartof><rights>2020 Elsevier Inc.</rights><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c306t-9eab8820a8f1f137161f62dad4d79d8deba7b5b9506a0275a597d173c917f6f83</citedby><cites>FETCH-LOGICAL-c306t-9eab8820a8f1f137161f62dad4d79d8deba7b5b9506a0275a597d173c917f6f83</cites><orcidid>0000-0003-2331-4507</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S2452014420300686$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,780,784,3547,27922,27923,45778</link.rule.ids></links><search><creatorcontrib>Venkataramana, Lokeswari</creatorcontrib><creatorcontrib>Jacob, Shomona Gracia</creatorcontrib><creatorcontrib>Saraswathi, S.</creatorcontrib><creatorcontrib>Venkata Vara Prasad, D.</creatorcontrib><title>Identification of common and dissimilar biomarkers for different cancer types from gene expressions of RNA-sequencing data</title><title>Gene reports</title><description>PAN-Cancer research aims at characterizing the common genetic alterations across cancer types to identify the set of similar and different biomarkers for many cancer types. Analyzing RNA-Sequencing data could assist in developing predictive models for cancer progression. As clinical data is prone to grow exponentially, applying computational methods on such data is very complex.
Parallel computational methods were exploited to weed out the problem of high computational complexity when applied on large clinical data. Parallelized Decremental Feature Selection (DFS) method was introduced to select pre-dominant genes from gene expressions of RNA-sequencing data. These selected genes were evaluated using parallelized classification models and tested using hold-out and 10-fold cross validation methods.
A computational study was performed on five cancer types namely PRAD (Prostate Adenocarcinoma), LUAD (Lung Cancer), BRCA (Breast Cancer), KIRC (Kidney Cancer) and COAD (Colon Cancer). These five cancer samples were segregated into five separate datasets and DFS was performed to unearth the common genes that play a major role in heterogeneous cancer types. Consequently the genes were investigated to identify the inter-cancer similarities and differences. The parallel classification methods yielded the classification accuracy of 93% to 98%.
The current research on investigating cancer lies in gene expressions of RNA- sequences. As RNA-Sequencing data has many features compared to number of instances, vertical partitioning and parallel Decremental Feature Selection method was employed on gene expressions of RNA-Sequencing for the first time to select more relevant genes (features) and classify cancer types accurately. The results prove to turn out higher predication accuracy with very less number of predictive genes.</description><subject>Decremental feature selection</subject><subject>Gene expression</subject><subject>PAN-Cancer analysis</subject><subject>Parallelized classification</subject><subject>RNA-sequencing data</subject><issn>2452-0144</issn><issn>2452-0144</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><recordid>eNp9kM1KAzEUhYMoWGrfwEVeYGqS-cnMRijFn0JREF2HTHJTUjvJmIxiffpmGBeuXN3LPXyHcw9C15QsKaHVzX65AxegXzLCxhOpyuIMzVhRsozQojj_s1-iRYx7QhLHacOLGfrZaHCDNVbJwXqHvcHKd13apNNY2xhtZw8y4Nb6ToZ3CBEbH5JiDISEYiWdgoCHYw9JCr7DKQ9g-O4DJNq7OJq-PK2yCB-f4JR1O6zlIK_QhZGHCIvfOUdv93ev68ds-_ywWa-2mcpJNWQNyLauGZG1oYbmnFbUVExLXWje6FpDK3lbtk1JKkkYL2XZcE15rhrKTWXqfI6KyVcFH2MAI_pg0y9HQYkYKxR7MVUoxgrFVGHCbicMUrYvC0FEZVN80DaAGoT29n-DE88ofk0</recordid><startdate>202006</startdate><enddate>202006</enddate><creator>Venkataramana, Lokeswari</creator><creator>Jacob, Shomona Gracia</creator><creator>Saraswathi, S.</creator><creator>Venkata Vara Prasad, D.</creator><general>Elsevier Inc</general><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0003-2331-4507</orcidid></search><sort><creationdate>202006</creationdate><title>Identification of common and dissimilar biomarkers for different cancer types from gene expressions of RNA-sequencing data</title><author>Venkataramana, Lokeswari ; Jacob, Shomona Gracia ; Saraswathi, S. ; Venkata Vara Prasad, D.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c306t-9eab8820a8f1f137161f62dad4d79d8deba7b5b9506a0275a597d173c917f6f83</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Decremental feature selection</topic><topic>Gene expression</topic><topic>PAN-Cancer analysis</topic><topic>Parallelized classification</topic><topic>RNA-sequencing data</topic><toplevel>online_resources</toplevel><creatorcontrib>Venkataramana, Lokeswari</creatorcontrib><creatorcontrib>Jacob, Shomona Gracia</creatorcontrib><creatorcontrib>Saraswathi, S.</creatorcontrib><creatorcontrib>Venkata Vara Prasad, D.</creatorcontrib><collection>CrossRef</collection><jtitle>Gene reports</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Venkataramana, Lokeswari</au><au>Jacob, Shomona Gracia</au><au>Saraswathi, S.</au><au>Venkata Vara Prasad, D.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Identification of common and dissimilar biomarkers for different cancer types from gene expressions of RNA-sequencing data</atitle><jtitle>Gene reports</jtitle><date>2020-06</date><risdate>2020</risdate><volume>19</volume><spage>100654</spage><pages>100654-</pages><artnum>100654</artnum><issn>2452-0144</issn><eissn>2452-0144</eissn><abstract>PAN-Cancer research aims at characterizing the common genetic alterations across cancer types to identify the set of similar and different biomarkers for many cancer types. Analyzing RNA-Sequencing data could assist in developing predictive models for cancer progression. As clinical data is prone to grow exponentially, applying computational methods on such data is very complex.
Parallel computational methods were exploited to weed out the problem of high computational complexity when applied on large clinical data. Parallelized Decremental Feature Selection (DFS) method was introduced to select pre-dominant genes from gene expressions of RNA-sequencing data. These selected genes were evaluated using parallelized classification models and tested using hold-out and 10-fold cross validation methods.
A computational study was performed on five cancer types namely PRAD (Prostate Adenocarcinoma), LUAD (Lung Cancer), BRCA (Breast Cancer), KIRC (Kidney Cancer) and COAD (Colon Cancer). These five cancer samples were segregated into five separate datasets and DFS was performed to unearth the common genes that play a major role in heterogeneous cancer types. Consequently the genes were investigated to identify the inter-cancer similarities and differences. The parallel classification methods yielded the classification accuracy of 93% to 98%.
The current research on investigating cancer lies in gene expressions of RNA- sequences. As RNA-Sequencing data has many features compared to number of instances, vertical partitioning and parallel Decremental Feature Selection method was employed on gene expressions of RNA-Sequencing for the first time to select more relevant genes (features) and classify cancer types accurately. The results prove to turn out higher predication accuracy with very less number of predictive genes.</abstract><pub>Elsevier Inc</pub><doi>10.1016/j.genrep.2020.100654</doi><orcidid>https://orcid.org/0000-0003-2331-4507</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2452-0144 |
ispartof | Gene reports, 2020-06, Vol.19, p.100654, Article 100654 |
issn | 2452-0144 2452-0144 |
language | eng |
recordid | cdi_crossref_primary_10_1016_j_genrep_2020_100654 |
source | ScienceDirect Journals |
subjects | Decremental feature selection Gene expression PAN-Cancer analysis Parallelized classification RNA-sequencing data |
title | Identification of common and dissimilar biomarkers for different cancer types from gene expressions of RNA-sequencing data |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-09T15%3A14%3A49IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-elsevier_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Identification%20of%20common%20and%20dissimilar%20biomarkers%20for%20different%20cancer%20types%20from%20gene%20expressions%20of%20RNA-sequencing%20data&rft.jtitle=Gene%20reports&rft.au=Venkataramana,%20Lokeswari&rft.date=2020-06&rft.volume=19&rft.spage=100654&rft.pages=100654-&rft.artnum=100654&rft.issn=2452-0144&rft.eissn=2452-0144&rft_id=info:doi/10.1016/j.genrep.2020.100654&rft_dat=%3Celsevier_cross%3ES2452014420300686%3C/elsevier_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c306t-9eab8820a8f1f137161f62dad4d79d8deba7b5b9506a0275a597d173c917f6f83%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |