Loading…
SNP Selection for Association Studies: Maximizing Power across SNP Choice and Study Size
Summary Selection of single nucleotide polymorphisms (SNPs) is a problem of primary importance in association studies and several approaches have been proposed. However, none provides a satisfying answer to the problem of how many SNPs should be selected, and how this should depend on the pattern of...
Saved in:
Published in: | Annals of human genetics 2005-11, Vol.69 (6), p.733-746 |
---|---|
Main Authors: | , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c3972-853512d5789e8a3659244fe03731fa6121967022e27f9d20369c9a6708deb0713 |
---|---|
cites | cdi_FETCH-LOGICAL-c3972-853512d5789e8a3659244fe03731fa6121967022e27f9d20369c9a6708deb0713 |
container_end_page | 746 |
container_issue | 6 |
container_start_page | 733 |
container_title | Annals of human genetics |
container_volume | 69 |
creator | Pardi, F. Lewis, C. M. Whittaker, J. C. |
description | Summary
Selection of single nucleotide polymorphisms (SNPs) is a problem of primary importance in association studies and several approaches have been proposed. However, none provides a satisfying answer to the problem of how many SNPs should be selected, and how this should depend on the pattern of linkage disequilibrium (LD) in the region under consideration. Moreover, SNP selection is usually considered as independent from deciding the sample size of the study. However, when resources are limited there is a tradeoff between the study size and the number of SNPs to genotype. We show that tuning the SNP density to the LD pattern can be achieved by looking for the best solution to this tradeoff. Our approach consists of formulating SNP selection as an optimization problem: the objective is to maximize the power of the final association study, whilst keeping the total costs below a given budget. We also propose two alternative algorithms for the solution of this optimization problem: a genetic algorithm and a hill climbing search. These standard techniques efficiently find good solutions, even when the number of possible SNPs to choose from is large. We compare the performance of these two algorithms on different chromosomal regions and show that, as expected, the selected SNPs reflect the LD pattern: the optimal SNP density varies dramatically between chromosomal regions. |
doi_str_mv | 10.1111/j.1529-8817.2005.00202.x |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_68765021</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>68765021</sourcerecordid><originalsourceid>FETCH-LOGICAL-c3972-853512d5789e8a3659244fe03731fa6121967022e27f9d20369c9a6708deb0713</originalsourceid><addsrcrecordid>eNqNkE1vGjEQhq0qUUNJ_0LlU267mbF3_VHlglALldIEiVTqzXJ2Z1ujhSVrEJBfn11A6rHxxR7reWc0D2McIcXu3C5SzIVNjEGdCoA8BRAg0v0HNsBM2QQN2As2AACZZAbgin2KcQGAwmTyI7tCJZTKEAfs9_xhxudUU7EJzYpXTctHMTZF8Md6vtmWgeJX_tPvwzK8htUfPmt21HJftE2MvI-P_zahIO5X5ZE_8Hl4pWt2Wfk60ufzPWS_vn97Gk-T-8fJj_HoPimk1SIxucxRlLk2loyXKrciyyoCqSVWXqFAqzQIQUJXthQglS2s775MSc-gUQ7Zzanvum1ethQ3bhliQXXtV9Rso1NGqxzE_0G0WTct1x1oTuBxw5Yqt27D0rcHh-B6_W7hev2u1-96_e6o3-276JfzjO3zksp_wbPvDrg7AbtQ0-Hdjd1oOuke8g2xBZBU</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>19412157</pqid></control><display><type>article</type><title>SNP Selection for Association Studies: Maximizing Power across SNP Choice and Study Size</title><source>Wiley-Blackwell Read & Publish Collection</source><creator>Pardi, F. ; Lewis, C. M. ; Whittaker, J. C.</creator><creatorcontrib>Pardi, F. ; Lewis, C. M. ; Whittaker, J. C.</creatorcontrib><description>Summary
Selection of single nucleotide polymorphisms (SNPs) is a problem of primary importance in association studies and several approaches have been proposed. However, none provides a satisfying answer to the problem of how many SNPs should be selected, and how this should depend on the pattern of linkage disequilibrium (LD) in the region under consideration. Moreover, SNP selection is usually considered as independent from deciding the sample size of the study. However, when resources are limited there is a tradeoff between the study size and the number of SNPs to genotype. We show that tuning the SNP density to the LD pattern can be achieved by looking for the best solution to this tradeoff. Our approach consists of formulating SNP selection as an optimization problem: the objective is to maximize the power of the final association study, whilst keeping the total costs below a given budget. We also propose two alternative algorithms for the solution of this optimization problem: a genetic algorithm and a hill climbing search. These standard techniques efficiently find good solutions, even when the number of possible SNPs to choose from is large. We compare the performance of these two algorithms on different chromosomal regions and show that, as expected, the selected SNPs reflect the LD pattern: the optimal SNP density varies dramatically between chromosomal regions.</description><identifier>ISSN: 0003-4800</identifier><identifier>EISSN: 1469-1809</identifier><identifier>DOI: 10.1111/j.1529-8817.2005.00202.x</identifier><identifier>PMID: 16266411</identifier><language>eng</language><publisher>350 Main Street , Malden , MA 02148 , USA , and 9600 Garsington Road , Oxford OX4 2DQ , UK: Blackwell Science Ltd</publisher><subject>Gene Frequency ; Genotype ; Haplotypes - genetics ; Humans ; Linkage Disequilibrium - genetics ; Matrix Metalloproteinase 2 - genetics ; Models, Genetic ; Polymorphism, Single Nucleotide - genetics ; Research Design ; Sample Size</subject><ispartof>Annals of human genetics, 2005-11, Vol.69 (6), p.733-746</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c3972-853512d5789e8a3659244fe03731fa6121967022e27f9d20369c9a6708deb0713</citedby><cites>FETCH-LOGICAL-c3972-853512d5789e8a3659244fe03731fa6121967022e27f9d20369c9a6708deb0713</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,777,781,27905,27906</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/16266411$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Pardi, F.</creatorcontrib><creatorcontrib>Lewis, C. M.</creatorcontrib><creatorcontrib>Whittaker, J. C.</creatorcontrib><title>SNP Selection for Association Studies: Maximizing Power across SNP Choice and Study Size</title><title>Annals of human genetics</title><addtitle>Ann Hum Genet</addtitle><description>Summary
Selection of single nucleotide polymorphisms (SNPs) is a problem of primary importance in association studies and several approaches have been proposed. However, none provides a satisfying answer to the problem of how many SNPs should be selected, and how this should depend on the pattern of linkage disequilibrium (LD) in the region under consideration. Moreover, SNP selection is usually considered as independent from deciding the sample size of the study. However, when resources are limited there is a tradeoff between the study size and the number of SNPs to genotype. We show that tuning the SNP density to the LD pattern can be achieved by looking for the best solution to this tradeoff. Our approach consists of formulating SNP selection as an optimization problem: the objective is to maximize the power of the final association study, whilst keeping the total costs below a given budget. We also propose two alternative algorithms for the solution of this optimization problem: a genetic algorithm and a hill climbing search. These standard techniques efficiently find good solutions, even when the number of possible SNPs to choose from is large. We compare the performance of these two algorithms on different chromosomal regions and show that, as expected, the selected SNPs reflect the LD pattern: the optimal SNP density varies dramatically between chromosomal regions.</description><subject>Gene Frequency</subject><subject>Genotype</subject><subject>Haplotypes - genetics</subject><subject>Humans</subject><subject>Linkage Disequilibrium - genetics</subject><subject>Matrix Metalloproteinase 2 - genetics</subject><subject>Models, Genetic</subject><subject>Polymorphism, Single Nucleotide - genetics</subject><subject>Research Design</subject><subject>Sample Size</subject><issn>0003-4800</issn><issn>1469-1809</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2005</creationdate><recordtype>article</recordtype><recordid>eNqNkE1vGjEQhq0qUUNJ_0LlU267mbF3_VHlglALldIEiVTqzXJ2Z1ujhSVrEJBfn11A6rHxxR7reWc0D2McIcXu3C5SzIVNjEGdCoA8BRAg0v0HNsBM2QQN2As2AACZZAbgin2KcQGAwmTyI7tCJZTKEAfs9_xhxudUU7EJzYpXTctHMTZF8Md6vtmWgeJX_tPvwzK8htUfPmt21HJftE2MvI-P_zahIO5X5ZE_8Hl4pWt2Wfk60ufzPWS_vn97Gk-T-8fJj_HoPimk1SIxucxRlLk2loyXKrciyyoCqSVWXqFAqzQIQUJXthQglS2s775MSc-gUQ7Zzanvum1ethQ3bhliQXXtV9Rso1NGqxzE_0G0WTct1x1oTuBxw5Yqt27D0rcHh-B6_W7hev2u1-96_e6o3-276JfzjO3zksp_wbPvDrg7AbtQ0-Hdjd1oOuke8g2xBZBU</recordid><startdate>200511</startdate><enddate>200511</enddate><creator>Pardi, F.</creator><creator>Lewis, C. M.</creator><creator>Whittaker, J. C.</creator><general>Blackwell Science Ltd</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>8FD</scope><scope>FR3</scope><scope>P64</scope><scope>RC3</scope><scope>7X8</scope></search><sort><creationdate>200511</creationdate><title>SNP Selection for Association Studies: Maximizing Power across SNP Choice and Study Size</title><author>Pardi, F. ; Lewis, C. M. ; Whittaker, J. C.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c3972-853512d5789e8a3659244fe03731fa6121967022e27f9d20369c9a6708deb0713</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2005</creationdate><topic>Gene Frequency</topic><topic>Genotype</topic><topic>Haplotypes - genetics</topic><topic>Humans</topic><topic>Linkage Disequilibrium - genetics</topic><topic>Matrix Metalloproteinase 2 - genetics</topic><topic>Models, Genetic</topic><topic>Polymorphism, Single Nucleotide - genetics</topic><topic>Research Design</topic><topic>Sample Size</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Pardi, F.</creatorcontrib><creatorcontrib>Lewis, C. M.</creatorcontrib><creatorcontrib>Whittaker, J. C.</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><jtitle>Annals of human genetics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Pardi, F.</au><au>Lewis, C. M.</au><au>Whittaker, J. C.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>SNP Selection for Association Studies: Maximizing Power across SNP Choice and Study Size</atitle><jtitle>Annals of human genetics</jtitle><addtitle>Ann Hum Genet</addtitle><date>2005-11</date><risdate>2005</risdate><volume>69</volume><issue>6</issue><spage>733</spage><epage>746</epage><pages>733-746</pages><issn>0003-4800</issn><eissn>1469-1809</eissn><abstract>Summary
Selection of single nucleotide polymorphisms (SNPs) is a problem of primary importance in association studies and several approaches have been proposed. However, none provides a satisfying answer to the problem of how many SNPs should be selected, and how this should depend on the pattern of linkage disequilibrium (LD) in the region under consideration. Moreover, SNP selection is usually considered as independent from deciding the sample size of the study. However, when resources are limited there is a tradeoff between the study size and the number of SNPs to genotype. We show that tuning the SNP density to the LD pattern can be achieved by looking for the best solution to this tradeoff. Our approach consists of formulating SNP selection as an optimization problem: the objective is to maximize the power of the final association study, whilst keeping the total costs below a given budget. We also propose two alternative algorithms for the solution of this optimization problem: a genetic algorithm and a hill climbing search. These standard techniques efficiently find good solutions, even when the number of possible SNPs to choose from is large. We compare the performance of these two algorithms on different chromosomal regions and show that, as expected, the selected SNPs reflect the LD pattern: the optimal SNP density varies dramatically between chromosomal regions.</abstract><cop>350 Main Street , Malden , MA 02148 , USA , and 9600 Garsington Road , Oxford OX4 2DQ , UK</cop><pub>Blackwell Science Ltd</pub><pmid>16266411</pmid><doi>10.1111/j.1529-8817.2005.00202.x</doi><tpages>14</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0003-4800 |
ispartof | Annals of human genetics, 2005-11, Vol.69 (6), p.733-746 |
issn | 0003-4800 1469-1809 |
language | eng |
recordid | cdi_proquest_miscellaneous_68765021 |
source | Wiley-Blackwell Read & Publish Collection |
subjects | Gene Frequency Genotype Haplotypes - genetics Humans Linkage Disequilibrium - genetics Matrix Metalloproteinase 2 - genetics Models, Genetic Polymorphism, Single Nucleotide - genetics Research Design Sample Size |
title | SNP Selection for Association Studies: Maximizing Power across SNP Choice and Study Size |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-17T21%3A28%3A35IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=SNP%20Selection%20for%20Association%20Studies:%20Maximizing%20Power%20across%20SNP%20Choice%20and%20Study%20Size&rft.jtitle=Annals%20of%20human%20genetics&rft.au=Pardi,%20F.&rft.date=2005-11&rft.volume=69&rft.issue=6&rft.spage=733&rft.epage=746&rft.pages=733-746&rft.issn=0003-4800&rft.eissn=1469-1809&rft_id=info:doi/10.1111/j.1529-8817.2005.00202.x&rft_dat=%3Cproquest_cross%3E68765021%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c3972-853512d5789e8a3659244fe03731fa6121967022e27f9d20369c9a6708deb0713%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=19412157&rft_id=info:pmid/16266411&rfr_iscdi=true |