Loading…
TIA: algorithms for development of identity-linked SNP islands for analysis by massively parallel DNA sequencing
Single nucleotide polymorphisms (SNPs) located within the human genome have been shown to have utility as markers of identity in the differentiation of DNA from individual contributors. Massively parallel DNA sequencing (MPS) technologies and human genome SNP databases allow for the design of suites...
Saved in:
Published in: | BMC bioinformatics 2018-04, Vol.19 (1), p.126-126, Article 126 |
---|---|
Main Authors: | , , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c566t-b5d658ee756f68b7677117d9376a8bb31ecd832c8e99b3237576c2bac40a9d363 |
---|---|
cites | cdi_FETCH-LOGICAL-c566t-b5d658ee756f68b7677117d9376a8bb31ecd832c8e99b3237576c2bac40a9d363 |
container_end_page | 126 |
container_issue | 1 |
container_start_page | 126 |
container_title | BMC bioinformatics |
container_volume | 19 |
creator | Farris, M Heath Scott, Andrew R Texter, Pamela A Bartlett, Marta Coleman, Patricia Masters, David |
description | Single nucleotide polymorphisms (SNPs) located within the human genome have been shown to have utility as markers of identity in the differentiation of DNA from individual contributors. Massively parallel DNA sequencing (MPS) technologies and human genome SNP databases allow for the design of suites of identity-linked target regions, amenable to sequencing in a multiplexed and massively parallel manner. Therefore, tools are needed for leveraging the genotypic information found within SNP databases for the discovery of genomic targets that can be evaluated on MPS platforms.
The SNP island target identification algorithm (TIA) was developed as a user-tunable system to leverage SNP information within databases. Using data within the 1000 Genomes Project SNP database, human genome regions were identified that contain globally ubiquitous identity-linked SNPs and that were responsive to targeted resequencing on MPS platforms. Algorithmic filters were used to exclude target regions that did not conform to user-tunable SNP island target characteristics. To validate the accuracy of TIA for discovering these identity-linked SNP islands within the human genome, SNP island target regions were amplified from 70 contributor genomic DNA samples using the polymerase chain reaction. Multiplexed amplicons were sequenced using the Illumina MiSeq platform, and the resulting sequences were analyzed for SNP variations. 166 putative identity-linked SNPs were targeted in the identified genomic regions. Of the 309 SNPs that provided discerning power across individual SNP profiles, 74 previously undefined SNPs were identified during evaluation of targets from individual genomes. Overall, DNA samples of 70 individuals were uniquely identified using a subset of the suite of identity-linked SNP islands.
TIA offers a tunable genome search tool for the discovery of targeted genomic regions that are scalable in the population frequency and numbers of SNPs contained within the SNP island regions. It also allows the definition of sequence length and sequence variability of the target region as well as the less variable flanking regions for tailoring to MPS platforms. As shown in this study, TIA can be used to discover identity-linked SNP islands within the human genome, useful for differentiating individuals by targeted resequencing on MPS technologies. |
doi_str_mv | 10.1186/s12859-018-2133-2 |
format | article |
fullrecord | <record><control><sourceid>gale_doaj_</sourceid><recordid>TN_cdi_doaj_primary_oai_doaj_org_article_6cc83976c1a843abaee08c3ba2ed790b</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A546841458</galeid><doaj_id>oai_doaj_org_article_6cc83976c1a843abaee08c3ba2ed790b</doaj_id><sourcerecordid>A546841458</sourcerecordid><originalsourceid>FETCH-LOGICAL-c566t-b5d658ee756f68b7677117d9376a8bb31ecd832c8e99b3237576c2bac40a9d363</originalsourceid><addsrcrecordid>eNptkktv1DAUhSMEoqXwA9ggS2xgkeJH_AiLSqPyGqkqiJa15cdN6uLEQ5ypmH-PS0rVkZAX8eM7R7lHp6peEnxMiBLvMqGKtzUmqqaEsZo-qg5JI0k5Yf74wf6gepbzNcZEKsyfVge0FQ1VrD2sNpfr1XtkYp-mMF8NGXVpQh5uIKbNAOOMUoeCL5sw7-oYxp_g0cX5NxRyNKNfcDOauMshI7tDg8k5FPUObcxkYoSIPpyvUIZfWxhdGPvn1ZPOxAwv7r5H1Y9PHy9Pv9RnXz-vT1dnteNCzLXlXnAFILnohLJSSEmI9C2TwihrGQHnFaNOQdtaRpnkUjhqjWuwaT0T7KhaL74-mWu9mcJgpp1OJui_F2nqtZnm4CJo4VzJouiJUQ0z1gBg5Zg1FLxssS1eJ4vXZmsH8K7EUWbbM91_GcOV7tON5qoVhLXF4M2dwZRKEHnWQ8gOYskQ0jZrimnTSFqGLejrBe1N-bUwdqk4ultcr3gjVEMargp1_B-qLA9DcGmELpT7PcHbPUFhZvg992abs15ffN9nycK6KeU8QXc_KcH6tnh6KZ4uxdO3xdO0aF49jOhe8a9p7A9TvdNI</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2024472756</pqid></control><display><type>article</type><title>TIA: algorithms for development of identity-linked SNP islands for analysis by massively parallel DNA sequencing</title><source>PubMed (Medline)</source><source>ProQuest Publicly Available Content database</source><creator>Farris, M Heath ; Scott, Andrew R ; Texter, Pamela A ; Bartlett, Marta ; Coleman, Patricia ; Masters, David</creator><creatorcontrib>Farris, M Heath ; Scott, Andrew R ; Texter, Pamela A ; Bartlett, Marta ; Coleman, Patricia ; Masters, David</creatorcontrib><description>Single nucleotide polymorphisms (SNPs) located within the human genome have been shown to have utility as markers of identity in the differentiation of DNA from individual contributors. Massively parallel DNA sequencing (MPS) technologies and human genome SNP databases allow for the design of suites of identity-linked target regions, amenable to sequencing in a multiplexed and massively parallel manner. Therefore, tools are needed for leveraging the genotypic information found within SNP databases for the discovery of genomic targets that can be evaluated on MPS platforms.
The SNP island target identification algorithm (TIA) was developed as a user-tunable system to leverage SNP information within databases. Using data within the 1000 Genomes Project SNP database, human genome regions were identified that contain globally ubiquitous identity-linked SNPs and that were responsive to targeted resequencing on MPS platforms. Algorithmic filters were used to exclude target regions that did not conform to user-tunable SNP island target characteristics. To validate the accuracy of TIA for discovering these identity-linked SNP islands within the human genome, SNP island target regions were amplified from 70 contributor genomic DNA samples using the polymerase chain reaction. Multiplexed amplicons were sequenced using the Illumina MiSeq platform, and the resulting sequences were analyzed for SNP variations. 166 putative identity-linked SNPs were targeted in the identified genomic regions. Of the 309 SNPs that provided discerning power across individual SNP profiles, 74 previously undefined SNPs were identified during evaluation of targets from individual genomes. Overall, DNA samples of 70 individuals were uniquely identified using a subset of the suite of identity-linked SNP islands.
TIA offers a tunable genome search tool for the discovery of targeted genomic regions that are scalable in the population frequency and numbers of SNPs contained within the SNP island regions. It also allows the definition of sequence length and sequence variability of the target region as well as the less variable flanking regions for tailoring to MPS platforms. As shown in this study, TIA can be used to discover identity-linked SNP islands within the human genome, useful for differentiating individuals by targeted resequencing on MPS technologies.</description><identifier>ISSN: 1471-2105</identifier><identifier>EISSN: 1471-2105</identifier><identifier>DOI: 10.1186/s12859-018-2133-2</identifier><identifier>PMID: 29642839</identifier><language>eng</language><publisher>England: BioMed Central Ltd</publisher><subject>Algorithm ; Analysis ; DNA sequencing ; Gene mutation ; Human identity ; Massively parallel sequencing (MPS) ; Population frequency ; Single nucleotide polymorphism (SNP) ; Single nucleotide polymorphisms</subject><ispartof>BMC bioinformatics, 2018-04, Vol.19 (1), p.126-126, Article 126</ispartof><rights>COPYRIGHT 2018 BioMed Central Ltd.</rights><rights>The Author(s). 2018</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c566t-b5d658ee756f68b7677117d9376a8bb31ecd832c8e99b3237576c2bac40a9d363</citedby><cites>FETCH-LOGICAL-c566t-b5d658ee756f68b7677117d9376a8bb31ecd832c8e99b3237576c2bac40a9d363</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC5896139/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC5896139/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,885,27924,27925,37013,53791,53793</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/29642839$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Farris, M Heath</creatorcontrib><creatorcontrib>Scott, Andrew R</creatorcontrib><creatorcontrib>Texter, Pamela A</creatorcontrib><creatorcontrib>Bartlett, Marta</creatorcontrib><creatorcontrib>Coleman, Patricia</creatorcontrib><creatorcontrib>Masters, David</creatorcontrib><title>TIA: algorithms for development of identity-linked SNP islands for analysis by massively parallel DNA sequencing</title><title>BMC bioinformatics</title><addtitle>BMC Bioinformatics</addtitle><description>Single nucleotide polymorphisms (SNPs) located within the human genome have been shown to have utility as markers of identity in the differentiation of DNA from individual contributors. Massively parallel DNA sequencing (MPS) technologies and human genome SNP databases allow for the design of suites of identity-linked target regions, amenable to sequencing in a multiplexed and massively parallel manner. Therefore, tools are needed for leveraging the genotypic information found within SNP databases for the discovery of genomic targets that can be evaluated on MPS platforms.
The SNP island target identification algorithm (TIA) was developed as a user-tunable system to leverage SNP information within databases. Using data within the 1000 Genomes Project SNP database, human genome regions were identified that contain globally ubiquitous identity-linked SNPs and that were responsive to targeted resequencing on MPS platforms. Algorithmic filters were used to exclude target regions that did not conform to user-tunable SNP island target characteristics. To validate the accuracy of TIA for discovering these identity-linked SNP islands within the human genome, SNP island target regions were amplified from 70 contributor genomic DNA samples using the polymerase chain reaction. Multiplexed amplicons were sequenced using the Illumina MiSeq platform, and the resulting sequences were analyzed for SNP variations. 166 putative identity-linked SNPs were targeted in the identified genomic regions. Of the 309 SNPs that provided discerning power across individual SNP profiles, 74 previously undefined SNPs were identified during evaluation of targets from individual genomes. Overall, DNA samples of 70 individuals were uniquely identified using a subset of the suite of identity-linked SNP islands.
TIA offers a tunable genome search tool for the discovery of targeted genomic regions that are scalable in the population frequency and numbers of SNPs contained within the SNP island regions. It also allows the definition of sequence length and sequence variability of the target region as well as the less variable flanking regions for tailoring to MPS platforms. As shown in this study, TIA can be used to discover identity-linked SNP islands within the human genome, useful for differentiating individuals by targeted resequencing on MPS technologies.</description><subject>Algorithm</subject><subject>Analysis</subject><subject>DNA sequencing</subject><subject>Gene mutation</subject><subject>Human identity</subject><subject>Massively parallel sequencing (MPS)</subject><subject>Population frequency</subject><subject>Single nucleotide polymorphism (SNP)</subject><subject>Single nucleotide polymorphisms</subject><issn>1471-2105</issn><issn>1471-2105</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><sourceid>DOA</sourceid><recordid>eNptkktv1DAUhSMEoqXwA9ggS2xgkeJH_AiLSqPyGqkqiJa15cdN6uLEQ5ypmH-PS0rVkZAX8eM7R7lHp6peEnxMiBLvMqGKtzUmqqaEsZo-qg5JI0k5Yf74wf6gepbzNcZEKsyfVge0FQ1VrD2sNpfr1XtkYp-mMF8NGXVpQh5uIKbNAOOMUoeCL5sw7-oYxp_g0cX5NxRyNKNfcDOauMshI7tDg8k5FPUObcxkYoSIPpyvUIZfWxhdGPvn1ZPOxAwv7r5H1Y9PHy9Pv9RnXz-vT1dnteNCzLXlXnAFILnohLJSSEmI9C2TwihrGQHnFaNOQdtaRpnkUjhqjWuwaT0T7KhaL74-mWu9mcJgpp1OJui_F2nqtZnm4CJo4VzJouiJUQ0z1gBg5Zg1FLxssS1eJ4vXZmsH8K7EUWbbM91_GcOV7tON5qoVhLXF4M2dwZRKEHnWQ8gOYskQ0jZrimnTSFqGLejrBe1N-bUwdqk4ultcr3gjVEMargp1_B-qLA9DcGmELpT7PcHbPUFhZvg992abs15ffN9nycK6KeU8QXc_KcH6tnh6KZ4uxdO3xdO0aF49jOhe8a9p7A9TvdNI</recordid><startdate>20180411</startdate><enddate>20180411</enddate><creator>Farris, M Heath</creator><creator>Scott, Andrew R</creator><creator>Texter, Pamela A</creator><creator>Bartlett, Marta</creator><creator>Coleman, Patricia</creator><creator>Masters, David</creator><general>BioMed Central Ltd</general><general>BioMed Central</general><general>BMC</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>ISR</scope><scope>7X8</scope><scope>5PM</scope><scope>DOA</scope></search><sort><creationdate>20180411</creationdate><title>TIA: algorithms for development of identity-linked SNP islands for analysis by massively parallel DNA sequencing</title><author>Farris, M Heath ; Scott, Andrew R ; Texter, Pamela A ; Bartlett, Marta ; Coleman, Patricia ; Masters, David</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c566t-b5d658ee756f68b7677117d9376a8bb31ecd832c8e99b3237576c2bac40a9d363</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Algorithm</topic><topic>Analysis</topic><topic>DNA sequencing</topic><topic>Gene mutation</topic><topic>Human identity</topic><topic>Massively parallel sequencing (MPS)</topic><topic>Population frequency</topic><topic>Single nucleotide polymorphism (SNP)</topic><topic>Single nucleotide polymorphisms</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Farris, M Heath</creatorcontrib><creatorcontrib>Scott, Andrew R</creatorcontrib><creatorcontrib>Texter, Pamela A</creatorcontrib><creatorcontrib>Bartlett, Marta</creatorcontrib><creatorcontrib>Coleman, Patricia</creatorcontrib><creatorcontrib>Masters, David</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>Gale In Context: Science</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><collection>Directory of Open Access Journals</collection><jtitle>BMC bioinformatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Farris, M Heath</au><au>Scott, Andrew R</au><au>Texter, Pamela A</au><au>Bartlett, Marta</au><au>Coleman, Patricia</au><au>Masters, David</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>TIA: algorithms for development of identity-linked SNP islands for analysis by massively parallel DNA sequencing</atitle><jtitle>BMC bioinformatics</jtitle><addtitle>BMC Bioinformatics</addtitle><date>2018-04-11</date><risdate>2018</risdate><volume>19</volume><issue>1</issue><spage>126</spage><epage>126</epage><pages>126-126</pages><artnum>126</artnum><issn>1471-2105</issn><eissn>1471-2105</eissn><abstract>Single nucleotide polymorphisms (SNPs) located within the human genome have been shown to have utility as markers of identity in the differentiation of DNA from individual contributors. Massively parallel DNA sequencing (MPS) technologies and human genome SNP databases allow for the design of suites of identity-linked target regions, amenable to sequencing in a multiplexed and massively parallel manner. Therefore, tools are needed for leveraging the genotypic information found within SNP databases for the discovery of genomic targets that can be evaluated on MPS platforms.
The SNP island target identification algorithm (TIA) was developed as a user-tunable system to leverage SNP information within databases. Using data within the 1000 Genomes Project SNP database, human genome regions were identified that contain globally ubiquitous identity-linked SNPs and that were responsive to targeted resequencing on MPS platforms. Algorithmic filters were used to exclude target regions that did not conform to user-tunable SNP island target characteristics. To validate the accuracy of TIA for discovering these identity-linked SNP islands within the human genome, SNP island target regions were amplified from 70 contributor genomic DNA samples using the polymerase chain reaction. Multiplexed amplicons were sequenced using the Illumina MiSeq platform, and the resulting sequences were analyzed for SNP variations. 166 putative identity-linked SNPs were targeted in the identified genomic regions. Of the 309 SNPs that provided discerning power across individual SNP profiles, 74 previously undefined SNPs were identified during evaluation of targets from individual genomes. Overall, DNA samples of 70 individuals were uniquely identified using a subset of the suite of identity-linked SNP islands.
TIA offers a tunable genome search tool for the discovery of targeted genomic regions that are scalable in the population frequency and numbers of SNPs contained within the SNP island regions. It also allows the definition of sequence length and sequence variability of the target region as well as the less variable flanking regions for tailoring to MPS platforms. As shown in this study, TIA can be used to discover identity-linked SNP islands within the human genome, useful for differentiating individuals by targeted resequencing on MPS technologies.</abstract><cop>England</cop><pub>BioMed Central Ltd</pub><pmid>29642839</pmid><doi>10.1186/s12859-018-2133-2</doi><tpages>1</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1471-2105 |
ispartof | BMC bioinformatics, 2018-04, Vol.19 (1), p.126-126, Article 126 |
issn | 1471-2105 1471-2105 |
language | eng |
recordid | cdi_doaj_primary_oai_doaj_org_article_6cc83976c1a843abaee08c3ba2ed790b |
source | PubMed (Medline); ProQuest Publicly Available Content database |
subjects | Algorithm Analysis DNA sequencing Gene mutation Human identity Massively parallel sequencing (MPS) Population frequency Single nucleotide polymorphism (SNP) Single nucleotide polymorphisms |
title | TIA: algorithms for development of identity-linked SNP islands for analysis by massively parallel DNA sequencing |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-25T04%3A50%3A08IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=TIA:%20algorithms%20for%20development%20of%20identity-linked%20SNP%20islands%20for%20analysis%20by%20massively%20parallel%20DNA%20sequencing&rft.jtitle=BMC%20bioinformatics&rft.au=Farris,%20M%20Heath&rft.date=2018-04-11&rft.volume=19&rft.issue=1&rft.spage=126&rft.epage=126&rft.pages=126-126&rft.artnum=126&rft.issn=1471-2105&rft.eissn=1471-2105&rft_id=info:doi/10.1186/s12859-018-2133-2&rft_dat=%3Cgale_doaj_%3EA546841458%3C/gale_doaj_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c566t-b5d658ee756f68b7677117d9376a8bb31ecd832c8e99b3237576c2bac40a9d363%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2024472756&rft_id=info:pmid/29642839&rft_galeid=A546841458&rfr_iscdi=true |