Loading…

Copy number variation analysis based on AluScan sequences

AluScan combines inter-Alu PCR using multiple Alu-based primers with opposite orientations and next-generation sequencing to capture a huge number of Alu-proximal genomic sequences for investigation. Its requirement of only sub-microgram quantities of DNA facilitates the examination of large numbers...

Full description

Saved in:
Bibliographic Details
Published in:Journal of clinical bioinformatics 2014-12, Vol.4 (1), p.15-15, Article 15
Main Authors: Yang, Jian-Feng, Ding, Xiao-Fan, Chen, Lei, Mat, Wai-Kin, Xu, Michelle Zhi, Chen, Jin-Fei, Wang, Jian-Min, Xu, Lin, Poon, Wai-Sang, Kwong, Ava, Leung, Gilberto Ka-Kit, Tan, Tze-Ching, Yu, Chi-Hung, Ke, Yue-Bin, Xu, Xin-Yun, Ke, Xiao-Yan, Ma, Ronald Cw, Chan, Juliana Cn, Wan, Wei-Qing, Zhang, Li-Wei, Kumar, Yogesh, Tsang, Shui-Ying, Li, Shao, Wang, Hong-Yang, Xue, Hong
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c381z-3866c06edef09009f2abf5587723d3bfc82ac86fe45b9bf2c815708ba6364b7f3
cites cdi_FETCH-LOGICAL-c381z-3866c06edef09009f2abf5587723d3bfc82ac86fe45b9bf2c815708ba6364b7f3
container_end_page 15
container_issue 1
container_start_page 15
container_title Journal of clinical bioinformatics
container_volume 4
creator Yang, Jian-Feng
Ding, Xiao-Fan
Chen, Lei
Mat, Wai-Kin
Xu, Michelle Zhi
Chen, Jin-Fei
Wang, Jian-Min
Xu, Lin
Poon, Wai-Sang
Kwong, Ava
Leung, Gilberto Ka-Kit
Tan, Tze-Ching
Yu, Chi-Hung
Ke, Yue-Bin
Xu, Xin-Yun
Ke, Xiao-Yan
Ma, Ronald Cw
Chan, Juliana Cn
Wan, Wei-Qing
Zhang, Li-Wei
Kumar, Yogesh
Tsang, Shui-Ying
Li, Shao
Wang, Hong-Yang
Xue, Hong
description AluScan combines inter-Alu PCR using multiple Alu-based primers with opposite orientations and next-generation sequencing to capture a huge number of Alu-proximal genomic sequences for investigation. Its requirement of only sub-microgram quantities of DNA facilitates the examination of large numbers of samples. However, the special features of AluScan data rendered difficult the calling of copy number variation (CNV) directly using the calling algorithms designed for whole genome sequencing (WGS) or exome sequencing. In this study, an AluScanCNV package has been assembled for efficient CNV calling from AluScan sequencing data employing a Geary-Hinkley transformation (GHT) of read-depth ratios between either paired test-control samples, or between test samples and a reference template constructed from reference samples, to call the localized CNVs, followed by use of a GISTIC-like algorithm to identify recurrent CNVs and circular binary segmentation (CBS) to reveal large extended CNVs. To evaluate the utility of CNVs called from AluScan data, the AluScans from 23 non-cancer and 38 cancer genomes were analyzed in this study. The glioma samples analyzed yielded the familiar extended copy-number losses on chromosomes 1p and 9. Also, the recurrent somatic CNVs identified from liver cancer samples were similar to those reported for liver cancer WGS with respect to a striking enrichment of copy-number gains in chromosomes 1q and 8q. When localized or recurrent CNV-features capable of distinguishing between liver and non-liver cancer samples were selected by correlation-based machine learning, a highly accurate separation of the liver and non-liver cancer classes was attained. The results obtained from non-cancer and cancerous tissues indicated that the AluScanCNV package can be employed to call localized, recurrent and extended CNVs from AluScan sequences. Moreover, both the localized and recurrent CNVs identified by this method could be subjected to machine-learning selection to yield distinguishing CNV-features that were capable of separating between liver cancers and other types of cancers. Since the method is applicable to any human DNA sample with or without the availability of a paired control, it can also be employed to analyze the constitutional CNVs of individuals.
doi_str_mv 10.1186/s13336-014-0015-z
format article
fullrecord <record><control><sourceid>gale_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_4273479</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A541250179</galeid><sourcerecordid>A541250179</sourcerecordid><originalsourceid>FETCH-LOGICAL-c381z-3866c06edef09009f2abf5587723d3bfc82ac86fe45b9bf2c815708ba6364b7f3</originalsourceid><addsrcrecordid>eNptkc1qGzEUhUVpaUKSB8gmDHTTzTj6H82mYEzaBAJZpFkLSXPlKsxI7shjsJ--cp0GGyotJF2dc3TFh9A1wTNClLzNhDEma0x4jTER9e4DOqeYs7olhH082p-hq5xfcRm8HBv1GZ1RIYRiAp-jdpFW2ypOg4Wx2pgxmHVIsTLR9NsccmVNhq4qlXk_PTsTqwy_J4gO8iX65E2f4eptvUAv3-9-Lu7rx6cfD4v5Y-2YIruaKSkdltCBxy3GrafG-vJ601DWMeudosYp6YEL21pPnSKiwcoaySS3jWcX6NshdzXZAToHcT2aXq_GMJhxq5MJ-vQmhl96mTaa04bxpi0BX98CxlR6z2s9hOyg702ENGVNJKcSt7yVRfrlIF2aHnSIPpVEt5frueCECkz-Bs7-oyqzgyG4FMGHUj8xkIPBjSnnEfx79wTrPUx9gKkLTL2HqXfFc3P87XfHP3TsDxgQmYU</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1642609496</pqid></control><display><type>article</type><title>Copy number variation analysis based on AluScan sequences</title><source>PubMed Central</source><creator>Yang, Jian-Feng ; Ding, Xiao-Fan ; Chen, Lei ; Mat, Wai-Kin ; Xu, Michelle Zhi ; Chen, Jin-Fei ; Wang, Jian-Min ; Xu, Lin ; Poon, Wai-Sang ; Kwong, Ava ; Leung, Gilberto Ka-Kit ; Tan, Tze-Ching ; Yu, Chi-Hung ; Ke, Yue-Bin ; Xu, Xin-Yun ; Ke, Xiao-Yan ; Ma, Ronald Cw ; Chan, Juliana Cn ; Wan, Wei-Qing ; Zhang, Li-Wei ; Kumar, Yogesh ; Tsang, Shui-Ying ; Li, Shao ; Wang, Hong-Yang ; Xue, Hong</creator><creatorcontrib>Yang, Jian-Feng ; Ding, Xiao-Fan ; Chen, Lei ; Mat, Wai-Kin ; Xu, Michelle Zhi ; Chen, Jin-Fei ; Wang, Jian-Min ; Xu, Lin ; Poon, Wai-Sang ; Kwong, Ava ; Leung, Gilberto Ka-Kit ; Tan, Tze-Ching ; Yu, Chi-Hung ; Ke, Yue-Bin ; Xu, Xin-Yun ; Ke, Xiao-Yan ; Ma, Ronald Cw ; Chan, Juliana Cn ; Wan, Wei-Qing ; Zhang, Li-Wei ; Kumar, Yogesh ; Tsang, Shui-Ying ; Li, Shao ; Wang, Hong-Yang ; Xue, Hong</creatorcontrib><description>AluScan combines inter-Alu PCR using multiple Alu-based primers with opposite orientations and next-generation sequencing to capture a huge number of Alu-proximal genomic sequences for investigation. Its requirement of only sub-microgram quantities of DNA facilitates the examination of large numbers of samples. However, the special features of AluScan data rendered difficult the calling of copy number variation (CNV) directly using the calling algorithms designed for whole genome sequencing (WGS) or exome sequencing. In this study, an AluScanCNV package has been assembled for efficient CNV calling from AluScan sequencing data employing a Geary-Hinkley transformation (GHT) of read-depth ratios between either paired test-control samples, or between test samples and a reference template constructed from reference samples, to call the localized CNVs, followed by use of a GISTIC-like algorithm to identify recurrent CNVs and circular binary segmentation (CBS) to reveal large extended CNVs. To evaluate the utility of CNVs called from AluScan data, the AluScans from 23 non-cancer and 38 cancer genomes were analyzed in this study. The glioma samples analyzed yielded the familiar extended copy-number losses on chromosomes 1p and 9. Also, the recurrent somatic CNVs identified from liver cancer samples were similar to those reported for liver cancer WGS with respect to a striking enrichment of copy-number gains in chromosomes 1q and 8q. When localized or recurrent CNV-features capable of distinguishing between liver and non-liver cancer samples were selected by correlation-based machine learning, a highly accurate separation of the liver and non-liver cancer classes was attained. The results obtained from non-cancer and cancerous tissues indicated that the AluScanCNV package can be employed to call localized, recurrent and extended CNVs from AluScan sequences. Moreover, both the localized and recurrent CNVs identified by this method could be subjected to machine-learning selection to yield distinguishing CNV-features that were capable of separating between liver cancers and other types of cancers. Since the method is applicable to any human DNA sample with or without the availability of a paired control, it can also be employed to analyze the constitutional CNVs of individuals.</description><identifier>ISSN: 2043-9113</identifier><identifier>EISSN: 2043-9113</identifier><identifier>DOI: 10.1186/s13336-014-0015-z</identifier><identifier>PMID: 25558350</identifier><language>eng</language><publisher>England: BioMed Central Ltd</publisher><subject>Algorithms ; Analysis ; Chromosomes ; DNA ; DNA sequencing ; Genetic research ; Genomes ; Genomics ; Gliomas ; Liver ; Liver cancer ; Machine learning ; Methodology ; Nucleotide sequencing</subject><ispartof>Journal of clinical bioinformatics, 2014-12, Vol.4 (1), p.15-15, Article 15</ispartof><rights>COPYRIGHT 2014 BioMed Central Ltd.</rights><rights>Yang et al.; licensee BioMed Central. 2014</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c381z-3866c06edef09009f2abf5587723d3bfc82ac86fe45b9bf2c815708ba6364b7f3</citedby><cites>FETCH-LOGICAL-c381z-3866c06edef09009f2abf5587723d3bfc82ac86fe45b9bf2c815708ba6364b7f3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC4273479/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC4273479/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,885,27923,27924,53790,53792</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/25558350$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Yang, Jian-Feng</creatorcontrib><creatorcontrib>Ding, Xiao-Fan</creatorcontrib><creatorcontrib>Chen, Lei</creatorcontrib><creatorcontrib>Mat, Wai-Kin</creatorcontrib><creatorcontrib>Xu, Michelle Zhi</creatorcontrib><creatorcontrib>Chen, Jin-Fei</creatorcontrib><creatorcontrib>Wang, Jian-Min</creatorcontrib><creatorcontrib>Xu, Lin</creatorcontrib><creatorcontrib>Poon, Wai-Sang</creatorcontrib><creatorcontrib>Kwong, Ava</creatorcontrib><creatorcontrib>Leung, Gilberto Ka-Kit</creatorcontrib><creatorcontrib>Tan, Tze-Ching</creatorcontrib><creatorcontrib>Yu, Chi-Hung</creatorcontrib><creatorcontrib>Ke, Yue-Bin</creatorcontrib><creatorcontrib>Xu, Xin-Yun</creatorcontrib><creatorcontrib>Ke, Xiao-Yan</creatorcontrib><creatorcontrib>Ma, Ronald Cw</creatorcontrib><creatorcontrib>Chan, Juliana Cn</creatorcontrib><creatorcontrib>Wan, Wei-Qing</creatorcontrib><creatorcontrib>Zhang, Li-Wei</creatorcontrib><creatorcontrib>Kumar, Yogesh</creatorcontrib><creatorcontrib>Tsang, Shui-Ying</creatorcontrib><creatorcontrib>Li, Shao</creatorcontrib><creatorcontrib>Wang, Hong-Yang</creatorcontrib><creatorcontrib>Xue, Hong</creatorcontrib><title>Copy number variation analysis based on AluScan sequences</title><title>Journal of clinical bioinformatics</title><addtitle>J Clin Bioinforma</addtitle><description>AluScan combines inter-Alu PCR using multiple Alu-based primers with opposite orientations and next-generation sequencing to capture a huge number of Alu-proximal genomic sequences for investigation. Its requirement of only sub-microgram quantities of DNA facilitates the examination of large numbers of samples. However, the special features of AluScan data rendered difficult the calling of copy number variation (CNV) directly using the calling algorithms designed for whole genome sequencing (WGS) or exome sequencing. In this study, an AluScanCNV package has been assembled for efficient CNV calling from AluScan sequencing data employing a Geary-Hinkley transformation (GHT) of read-depth ratios between either paired test-control samples, or between test samples and a reference template constructed from reference samples, to call the localized CNVs, followed by use of a GISTIC-like algorithm to identify recurrent CNVs and circular binary segmentation (CBS) to reveal large extended CNVs. To evaluate the utility of CNVs called from AluScan data, the AluScans from 23 non-cancer and 38 cancer genomes were analyzed in this study. The glioma samples analyzed yielded the familiar extended copy-number losses on chromosomes 1p and 9. Also, the recurrent somatic CNVs identified from liver cancer samples were similar to those reported for liver cancer WGS with respect to a striking enrichment of copy-number gains in chromosomes 1q and 8q. When localized or recurrent CNV-features capable of distinguishing between liver and non-liver cancer samples were selected by correlation-based machine learning, a highly accurate separation of the liver and non-liver cancer classes was attained. The results obtained from non-cancer and cancerous tissues indicated that the AluScanCNV package can be employed to call localized, recurrent and extended CNVs from AluScan sequences. Moreover, both the localized and recurrent CNVs identified by this method could be subjected to machine-learning selection to yield distinguishing CNV-features that were capable of separating between liver cancers and other types of cancers. Since the method is applicable to any human DNA sample with or without the availability of a paired control, it can also be employed to analyze the constitutional CNVs of individuals.</description><subject>Algorithms</subject><subject>Analysis</subject><subject>Chromosomes</subject><subject>DNA</subject><subject>DNA sequencing</subject><subject>Genetic research</subject><subject>Genomes</subject><subject>Genomics</subject><subject>Gliomas</subject><subject>Liver</subject><subject>Liver cancer</subject><subject>Machine learning</subject><subject>Methodology</subject><subject>Nucleotide sequencing</subject><issn>2043-9113</issn><issn>2043-9113</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2014</creationdate><recordtype>article</recordtype><recordid>eNptkc1qGzEUhUVpaUKSB8gmDHTTzTj6H82mYEzaBAJZpFkLSXPlKsxI7shjsJ--cp0GGyotJF2dc3TFh9A1wTNClLzNhDEma0x4jTER9e4DOqeYs7olhH082p-hq5xfcRm8HBv1GZ1RIYRiAp-jdpFW2ypOg4Wx2pgxmHVIsTLR9NsccmVNhq4qlXk_PTsTqwy_J4gO8iX65E2f4eptvUAv3-9-Lu7rx6cfD4v5Y-2YIruaKSkdltCBxy3GrafG-vJ601DWMeudosYp6YEL21pPnSKiwcoaySS3jWcX6NshdzXZAToHcT2aXq_GMJhxq5MJ-vQmhl96mTaa04bxpi0BX98CxlR6z2s9hOyg702ENGVNJKcSt7yVRfrlIF2aHnSIPpVEt5frueCECkz-Bs7-oyqzgyG4FMGHUj8xkIPBjSnnEfx79wTrPUx9gKkLTL2HqXfFc3P87XfHP3TsDxgQmYU</recordid><startdate>20141205</startdate><enddate>20141205</enddate><creator>Yang, Jian-Feng</creator><creator>Ding, Xiao-Fan</creator><creator>Chen, Lei</creator><creator>Mat, Wai-Kin</creator><creator>Xu, Michelle Zhi</creator><creator>Chen, Jin-Fei</creator><creator>Wang, Jian-Min</creator><creator>Xu, Lin</creator><creator>Poon, Wai-Sang</creator><creator>Kwong, Ava</creator><creator>Leung, Gilberto Ka-Kit</creator><creator>Tan, Tze-Ching</creator><creator>Yu, Chi-Hung</creator><creator>Ke, Yue-Bin</creator><creator>Xu, Xin-Yun</creator><creator>Ke, Xiao-Yan</creator><creator>Ma, Ronald Cw</creator><creator>Chan, Juliana Cn</creator><creator>Wan, Wei-Qing</creator><creator>Zhang, Li-Wei</creator><creator>Kumar, Yogesh</creator><creator>Tsang, Shui-Ying</creator><creator>Li, Shao</creator><creator>Wang, Hong-Yang</creator><creator>Xue, Hong</creator><general>BioMed Central Ltd</general><general>BioMed Central</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20141205</creationdate><title>Copy number variation analysis based on AluScan sequences</title><author>Yang, Jian-Feng ; Ding, Xiao-Fan ; Chen, Lei ; Mat, Wai-Kin ; Xu, Michelle Zhi ; Chen, Jin-Fei ; Wang, Jian-Min ; Xu, Lin ; Poon, Wai-Sang ; Kwong, Ava ; Leung, Gilberto Ka-Kit ; Tan, Tze-Ching ; Yu, Chi-Hung ; Ke, Yue-Bin ; Xu, Xin-Yun ; Ke, Xiao-Yan ; Ma, Ronald Cw ; Chan, Juliana Cn ; Wan, Wei-Qing ; Zhang, Li-Wei ; Kumar, Yogesh ; Tsang, Shui-Ying ; Li, Shao ; Wang, Hong-Yang ; Xue, Hong</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c381z-3866c06edef09009f2abf5587723d3bfc82ac86fe45b9bf2c815708ba6364b7f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2014</creationdate><topic>Algorithms</topic><topic>Analysis</topic><topic>Chromosomes</topic><topic>DNA</topic><topic>DNA sequencing</topic><topic>Genetic research</topic><topic>Genomes</topic><topic>Genomics</topic><topic>Gliomas</topic><topic>Liver</topic><topic>Liver cancer</topic><topic>Machine learning</topic><topic>Methodology</topic><topic>Nucleotide sequencing</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Yang, Jian-Feng</creatorcontrib><creatorcontrib>Ding, Xiao-Fan</creatorcontrib><creatorcontrib>Chen, Lei</creatorcontrib><creatorcontrib>Mat, Wai-Kin</creatorcontrib><creatorcontrib>Xu, Michelle Zhi</creatorcontrib><creatorcontrib>Chen, Jin-Fei</creatorcontrib><creatorcontrib>Wang, Jian-Min</creatorcontrib><creatorcontrib>Xu, Lin</creatorcontrib><creatorcontrib>Poon, Wai-Sang</creatorcontrib><creatorcontrib>Kwong, Ava</creatorcontrib><creatorcontrib>Leung, Gilberto Ka-Kit</creatorcontrib><creatorcontrib>Tan, Tze-Ching</creatorcontrib><creatorcontrib>Yu, Chi-Hung</creatorcontrib><creatorcontrib>Ke, Yue-Bin</creatorcontrib><creatorcontrib>Xu, Xin-Yun</creatorcontrib><creatorcontrib>Ke, Xiao-Yan</creatorcontrib><creatorcontrib>Ma, Ronald Cw</creatorcontrib><creatorcontrib>Chan, Juliana Cn</creatorcontrib><creatorcontrib>Wan, Wei-Qing</creatorcontrib><creatorcontrib>Zhang, Li-Wei</creatorcontrib><creatorcontrib>Kumar, Yogesh</creatorcontrib><creatorcontrib>Tsang, Shui-Ying</creatorcontrib><creatorcontrib>Li, Shao</creatorcontrib><creatorcontrib>Wang, Hong-Yang</creatorcontrib><creatorcontrib>Xue, Hong</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Journal of clinical bioinformatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Yang, Jian-Feng</au><au>Ding, Xiao-Fan</au><au>Chen, Lei</au><au>Mat, Wai-Kin</au><au>Xu, Michelle Zhi</au><au>Chen, Jin-Fei</au><au>Wang, Jian-Min</au><au>Xu, Lin</au><au>Poon, Wai-Sang</au><au>Kwong, Ava</au><au>Leung, Gilberto Ka-Kit</au><au>Tan, Tze-Ching</au><au>Yu, Chi-Hung</au><au>Ke, Yue-Bin</au><au>Xu, Xin-Yun</au><au>Ke, Xiao-Yan</au><au>Ma, Ronald Cw</au><au>Chan, Juliana Cn</au><au>Wan, Wei-Qing</au><au>Zhang, Li-Wei</au><au>Kumar, Yogesh</au><au>Tsang, Shui-Ying</au><au>Li, Shao</au><au>Wang, Hong-Yang</au><au>Xue, Hong</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Copy number variation analysis based on AluScan sequences</atitle><jtitle>Journal of clinical bioinformatics</jtitle><addtitle>J Clin Bioinforma</addtitle><date>2014-12-05</date><risdate>2014</risdate><volume>4</volume><issue>1</issue><spage>15</spage><epage>15</epage><pages>15-15</pages><artnum>15</artnum><issn>2043-9113</issn><eissn>2043-9113</eissn><abstract>AluScan combines inter-Alu PCR using multiple Alu-based primers with opposite orientations and next-generation sequencing to capture a huge number of Alu-proximal genomic sequences for investigation. Its requirement of only sub-microgram quantities of DNA facilitates the examination of large numbers of samples. However, the special features of AluScan data rendered difficult the calling of copy number variation (CNV) directly using the calling algorithms designed for whole genome sequencing (WGS) or exome sequencing. In this study, an AluScanCNV package has been assembled for efficient CNV calling from AluScan sequencing data employing a Geary-Hinkley transformation (GHT) of read-depth ratios between either paired test-control samples, or between test samples and a reference template constructed from reference samples, to call the localized CNVs, followed by use of a GISTIC-like algorithm to identify recurrent CNVs and circular binary segmentation (CBS) to reveal large extended CNVs. To evaluate the utility of CNVs called from AluScan data, the AluScans from 23 non-cancer and 38 cancer genomes were analyzed in this study. The glioma samples analyzed yielded the familiar extended copy-number losses on chromosomes 1p and 9. Also, the recurrent somatic CNVs identified from liver cancer samples were similar to those reported for liver cancer WGS with respect to a striking enrichment of copy-number gains in chromosomes 1q and 8q. When localized or recurrent CNV-features capable of distinguishing between liver and non-liver cancer samples were selected by correlation-based machine learning, a highly accurate separation of the liver and non-liver cancer classes was attained. The results obtained from non-cancer and cancerous tissues indicated that the AluScanCNV package can be employed to call localized, recurrent and extended CNVs from AluScan sequences. Moreover, both the localized and recurrent CNVs identified by this method could be subjected to machine-learning selection to yield distinguishing CNV-features that were capable of separating between liver cancers and other types of cancers. Since the method is applicable to any human DNA sample with or without the availability of a paired control, it can also be employed to analyze the constitutional CNVs of individuals.</abstract><cop>England</cop><pub>BioMed Central Ltd</pub><pmid>25558350</pmid><doi>10.1186/s13336-014-0015-z</doi><tpages>1</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2043-9113
ispartof Journal of clinical bioinformatics, 2014-12, Vol.4 (1), p.15-15, Article 15
issn 2043-9113
2043-9113
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_4273479
source PubMed Central
subjects Algorithms
Analysis
Chromosomes
DNA
DNA sequencing
Genetic research
Genomes
Genomics
Gliomas
Liver
Liver cancer
Machine learning
Methodology
Nucleotide sequencing
title Copy number variation analysis based on AluScan sequences
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-11T19%3A01%3A35IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Copy%20number%20variation%20analysis%20based%20on%20AluScan%20sequences&rft.jtitle=Journal%20of%20clinical%20bioinformatics&rft.au=Yang,%20Jian-Feng&rft.date=2014-12-05&rft.volume=4&rft.issue=1&rft.spage=15&rft.epage=15&rft.pages=15-15&rft.artnum=15&rft.issn=2043-9113&rft.eissn=2043-9113&rft_id=info:doi/10.1186/s13336-014-0015-z&rft_dat=%3Cgale_pubme%3EA541250179%3C/gale_pubme%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c381z-3866c06edef09009f2abf5587723d3bfc82ac86fe45b9bf2c815708ba6364b7f3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=1642609496&rft_id=info:pmid/25558350&rft_galeid=A541250179&rfr_iscdi=true