Loading…

SNP Selection for Association Studies: Maximizing Power across SNP Choice and Study Size

Summary Selection of single nucleotide polymorphisms (SNPs) is a problem of primary importance in association studies and several approaches have been proposed. However, none provides a satisfying answer to the problem of how many SNPs should be selected, and how this should depend on the pattern of...

Full description

Saved in:
Bibliographic Details
Published in:Annals of human genetics 2005-11, Vol.69 (6), p.733-746
Main Authors: Pardi, F., Lewis, C. M., Whittaker, J. C.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c3972-853512d5789e8a3659244fe03731fa6121967022e27f9d20369c9a6708deb0713
cites cdi_FETCH-LOGICAL-c3972-853512d5789e8a3659244fe03731fa6121967022e27f9d20369c9a6708deb0713
container_end_page 746
container_issue 6
container_start_page 733
container_title Annals of human genetics
container_volume 69
creator Pardi, F.
Lewis, C. M.
Whittaker, J. C.
description Summary Selection of single nucleotide polymorphisms (SNPs) is a problem of primary importance in association studies and several approaches have been proposed. However, none provides a satisfying answer to the problem of how many SNPs should be selected, and how this should depend on the pattern of linkage disequilibrium (LD) in the region under consideration. Moreover, SNP selection is usually considered as independent from deciding the sample size of the study. However, when resources are limited there is a tradeoff between the study size and the number of SNPs to genotype. We show that tuning the SNP density to the LD pattern can be achieved by looking for the best solution to this tradeoff. Our approach consists of formulating SNP selection as an optimization problem: the objective is to maximize the power of the final association study, whilst keeping the total costs below a given budget. We also propose two alternative algorithms for the solution of this optimization problem: a genetic algorithm and a hill climbing search. These standard techniques efficiently find good solutions, even when the number of possible SNPs to choose from is large. We compare the performance of these two algorithms on different chromosomal regions and show that, as expected, the selected SNPs reflect the LD pattern: the optimal SNP density varies dramatically between chromosomal regions.
doi_str_mv 10.1111/j.1529-8817.2005.00202.x
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_68765021</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>68765021</sourcerecordid><originalsourceid>FETCH-LOGICAL-c3972-853512d5789e8a3659244fe03731fa6121967022e27f9d20369c9a6708deb0713</originalsourceid><addsrcrecordid>eNqNkE1vGjEQhq0qUUNJ_0LlU267mbF3_VHlglALldIEiVTqzXJ2Z1ujhSVrEJBfn11A6rHxxR7reWc0D2McIcXu3C5SzIVNjEGdCoA8BRAg0v0HNsBM2QQN2As2AACZZAbgin2KcQGAwmTyI7tCJZTKEAfs9_xhxudUU7EJzYpXTctHMTZF8Md6vtmWgeJX_tPvwzK8htUfPmt21HJftE2MvI-P_zahIO5X5ZE_8Hl4pWt2Wfk60ufzPWS_vn97Gk-T-8fJj_HoPimk1SIxucxRlLk2loyXKrciyyoCqSVWXqFAqzQIQUJXthQglS2s775MSc-gUQ7Zzanvum1ethQ3bhliQXXtV9Rso1NGqxzE_0G0WTct1x1oTuBxw5Yqt27D0rcHh-B6_W7hev2u1-96_e6o3-276JfzjO3zksp_wbPvDrg7AbtQ0-Hdjd1oOuke8g2xBZBU</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>19412157</pqid></control><display><type>article</type><title>SNP Selection for Association Studies: Maximizing Power across SNP Choice and Study Size</title><source>Wiley-Blackwell Read &amp; Publish Collection</source><creator>Pardi, F. ; Lewis, C. M. ; Whittaker, J. C.</creator><creatorcontrib>Pardi, F. ; Lewis, C. M. ; Whittaker, J. C.</creatorcontrib><description>Summary Selection of single nucleotide polymorphisms (SNPs) is a problem of primary importance in association studies and several approaches have been proposed. However, none provides a satisfying answer to the problem of how many SNPs should be selected, and how this should depend on the pattern of linkage disequilibrium (LD) in the region under consideration. Moreover, SNP selection is usually considered as independent from deciding the sample size of the study. However, when resources are limited there is a tradeoff between the study size and the number of SNPs to genotype. We show that tuning the SNP density to the LD pattern can be achieved by looking for the best solution to this tradeoff. Our approach consists of formulating SNP selection as an optimization problem: the objective is to maximize the power of the final association study, whilst keeping the total costs below a given budget. We also propose two alternative algorithms for the solution of this optimization problem: a genetic algorithm and a hill climbing search. These standard techniques efficiently find good solutions, even when the number of possible SNPs to choose from is large. We compare the performance of these two algorithms on different chromosomal regions and show that, as expected, the selected SNPs reflect the LD pattern: the optimal SNP density varies dramatically between chromosomal regions.</description><identifier>ISSN: 0003-4800</identifier><identifier>EISSN: 1469-1809</identifier><identifier>DOI: 10.1111/j.1529-8817.2005.00202.x</identifier><identifier>PMID: 16266411</identifier><language>eng</language><publisher>350 Main Street , Malden , MA 02148 , USA , and 9600 Garsington Road , Oxford OX4 2DQ , UK: Blackwell Science Ltd</publisher><subject>Gene Frequency ; Genotype ; Haplotypes - genetics ; Humans ; Linkage Disequilibrium - genetics ; Matrix Metalloproteinase 2 - genetics ; Models, Genetic ; Polymorphism, Single Nucleotide - genetics ; Research Design ; Sample Size</subject><ispartof>Annals of human genetics, 2005-11, Vol.69 (6), p.733-746</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c3972-853512d5789e8a3659244fe03731fa6121967022e27f9d20369c9a6708deb0713</citedby><cites>FETCH-LOGICAL-c3972-853512d5789e8a3659244fe03731fa6121967022e27f9d20369c9a6708deb0713</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,777,781,27905,27906</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/16266411$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Pardi, F.</creatorcontrib><creatorcontrib>Lewis, C. M.</creatorcontrib><creatorcontrib>Whittaker, J. C.</creatorcontrib><title>SNP Selection for Association Studies: Maximizing Power across SNP Choice and Study Size</title><title>Annals of human genetics</title><addtitle>Ann Hum Genet</addtitle><description>Summary Selection of single nucleotide polymorphisms (SNPs) is a problem of primary importance in association studies and several approaches have been proposed. However, none provides a satisfying answer to the problem of how many SNPs should be selected, and how this should depend on the pattern of linkage disequilibrium (LD) in the region under consideration. Moreover, SNP selection is usually considered as independent from deciding the sample size of the study. However, when resources are limited there is a tradeoff between the study size and the number of SNPs to genotype. We show that tuning the SNP density to the LD pattern can be achieved by looking for the best solution to this tradeoff. Our approach consists of formulating SNP selection as an optimization problem: the objective is to maximize the power of the final association study, whilst keeping the total costs below a given budget. We also propose two alternative algorithms for the solution of this optimization problem: a genetic algorithm and a hill climbing search. These standard techniques efficiently find good solutions, even when the number of possible SNPs to choose from is large. We compare the performance of these two algorithms on different chromosomal regions and show that, as expected, the selected SNPs reflect the LD pattern: the optimal SNP density varies dramatically between chromosomal regions.</description><subject>Gene Frequency</subject><subject>Genotype</subject><subject>Haplotypes - genetics</subject><subject>Humans</subject><subject>Linkage Disequilibrium - genetics</subject><subject>Matrix Metalloproteinase 2 - genetics</subject><subject>Models, Genetic</subject><subject>Polymorphism, Single Nucleotide - genetics</subject><subject>Research Design</subject><subject>Sample Size</subject><issn>0003-4800</issn><issn>1469-1809</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2005</creationdate><recordtype>article</recordtype><recordid>eNqNkE1vGjEQhq0qUUNJ_0LlU267mbF3_VHlglALldIEiVTqzXJ2Z1ujhSVrEJBfn11A6rHxxR7reWc0D2McIcXu3C5SzIVNjEGdCoA8BRAg0v0HNsBM2QQN2As2AACZZAbgin2KcQGAwmTyI7tCJZTKEAfs9_xhxudUU7EJzYpXTctHMTZF8Md6vtmWgeJX_tPvwzK8htUfPmt21HJftE2MvI-P_zahIO5X5ZE_8Hl4pWt2Wfk60ufzPWS_vn97Gk-T-8fJj_HoPimk1SIxucxRlLk2loyXKrciyyoCqSVWXqFAqzQIQUJXthQglS2s775MSc-gUQ7Zzanvum1ethQ3bhliQXXtV9Rso1NGqxzE_0G0WTct1x1oTuBxw5Yqt27D0rcHh-B6_W7hev2u1-96_e6o3-276JfzjO3zksp_wbPvDrg7AbtQ0-Hdjd1oOuke8g2xBZBU</recordid><startdate>200511</startdate><enddate>200511</enddate><creator>Pardi, F.</creator><creator>Lewis, C. M.</creator><creator>Whittaker, J. C.</creator><general>Blackwell Science Ltd</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>8FD</scope><scope>FR3</scope><scope>P64</scope><scope>RC3</scope><scope>7X8</scope></search><sort><creationdate>200511</creationdate><title>SNP Selection for Association Studies: Maximizing Power across SNP Choice and Study Size</title><author>Pardi, F. ; Lewis, C. M. ; Whittaker, J. C.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c3972-853512d5789e8a3659244fe03731fa6121967022e27f9d20369c9a6708deb0713</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2005</creationdate><topic>Gene Frequency</topic><topic>Genotype</topic><topic>Haplotypes - genetics</topic><topic>Humans</topic><topic>Linkage Disequilibrium - genetics</topic><topic>Matrix Metalloproteinase 2 - genetics</topic><topic>Models, Genetic</topic><topic>Polymorphism, Single Nucleotide - genetics</topic><topic>Research Design</topic><topic>Sample Size</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Pardi, F.</creatorcontrib><creatorcontrib>Lewis, C. M.</creatorcontrib><creatorcontrib>Whittaker, J. C.</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><jtitle>Annals of human genetics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Pardi, F.</au><au>Lewis, C. M.</au><au>Whittaker, J. C.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>SNP Selection for Association Studies: Maximizing Power across SNP Choice and Study Size</atitle><jtitle>Annals of human genetics</jtitle><addtitle>Ann Hum Genet</addtitle><date>2005-11</date><risdate>2005</risdate><volume>69</volume><issue>6</issue><spage>733</spage><epage>746</epage><pages>733-746</pages><issn>0003-4800</issn><eissn>1469-1809</eissn><abstract>Summary Selection of single nucleotide polymorphisms (SNPs) is a problem of primary importance in association studies and several approaches have been proposed. However, none provides a satisfying answer to the problem of how many SNPs should be selected, and how this should depend on the pattern of linkage disequilibrium (LD) in the region under consideration. Moreover, SNP selection is usually considered as independent from deciding the sample size of the study. However, when resources are limited there is a tradeoff between the study size and the number of SNPs to genotype. We show that tuning the SNP density to the LD pattern can be achieved by looking for the best solution to this tradeoff. Our approach consists of formulating SNP selection as an optimization problem: the objective is to maximize the power of the final association study, whilst keeping the total costs below a given budget. We also propose two alternative algorithms for the solution of this optimization problem: a genetic algorithm and a hill climbing search. These standard techniques efficiently find good solutions, even when the number of possible SNPs to choose from is large. We compare the performance of these two algorithms on different chromosomal regions and show that, as expected, the selected SNPs reflect the LD pattern: the optimal SNP density varies dramatically between chromosomal regions.</abstract><cop>350 Main Street , Malden , MA 02148 , USA , and 9600 Garsington Road , Oxford OX4 2DQ , UK</cop><pub>Blackwell Science Ltd</pub><pmid>16266411</pmid><doi>10.1111/j.1529-8817.2005.00202.x</doi><tpages>14</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0003-4800
ispartof Annals of human genetics, 2005-11, Vol.69 (6), p.733-746
issn 0003-4800
1469-1809
language eng
recordid cdi_proquest_miscellaneous_68765021
source Wiley-Blackwell Read & Publish Collection
subjects Gene Frequency
Genotype
Haplotypes - genetics
Humans
Linkage Disequilibrium - genetics
Matrix Metalloproteinase 2 - genetics
Models, Genetic
Polymorphism, Single Nucleotide - genetics
Research Design
Sample Size
title SNP Selection for Association Studies: Maximizing Power across SNP Choice and Study Size
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-17T21%3A28%3A35IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=SNP%20Selection%20for%20Association%20Studies:%20Maximizing%20Power%20across%20SNP%20Choice%20and%20Study%20Size&rft.jtitle=Annals%20of%20human%20genetics&rft.au=Pardi,%20F.&rft.date=2005-11&rft.volume=69&rft.issue=6&rft.spage=733&rft.epage=746&rft.pages=733-746&rft.issn=0003-4800&rft.eissn=1469-1809&rft_id=info:doi/10.1111/j.1529-8817.2005.00202.x&rft_dat=%3Cproquest_cross%3E68765021%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c3972-853512d5789e8a3659244fe03731fa6121967022e27f9d20369c9a6708deb0713%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=19412157&rft_id=info:pmid/16266411&rfr_iscdi=true