Loading…

Linkage-Disequilibrium-Based Binning Affects the Interpretation of GWASs

Genome-wide association studies (GWASs) are critically dependent on detailed knowledge of the pattern of linkage disequilibrium (LD) in the human genome. GWASs generate lists of variants, usually SNPs, ranked according to the significance of their association to a trait. Downstream analyses generall...

Full description

Saved in:
Bibliographic Details
Published in:American journal of human genetics 2012-04, Vol.90 (4), p.727-733
Main Authors: Christoforou, Andrea, Dondrup, Michael, Mattingsdal, Morten, Mattheisen, Manuel, Giddaluru, Sudheer, Nöthen, Markus M., Rietschel, Marcella, Cichon, Sven, Djurovic, Srdjan, Andreassen, Ole A., Jonassen, Inge, Steen, Vidar M., Puntervoll, Pål, Le Hellard, Stéphanie
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Genome-wide association studies (GWASs) are critically dependent on detailed knowledge of the pattern of linkage disequilibrium (LD) in the human genome. GWASs generate lists of variants, usually SNPs, ranked according to the significance of their association to a trait. Downstream analyses generally focus on the gene or genes that are physically closest to these SNPs and ignore their LD profile with other SNPs. We have developed a flexible R package (LDsnpR) that efficiently assigns SNPs to genes on the basis of both their physical position and their pairwise LD with other SNPs. We used the positional-binning and LD-based-binning approaches to investigate whether including these “LD-based” SNPs would affect the interpretation of three published GWASs on bipolar affective disorder (BP) and of the imputed versions of two of these GWASs. We show how including LD can be important for interpreting and comparing GWASs. In the published, unimputed GWASs, LD-based binning effectively “recovered” 6.1%–8.3% of Ensembl-defined genes. It altered the ranks of the genes and resulted in nonnegligible differences between the lists of the top 2,000 genes emerging from the two binning approaches. It also improved the overall gene-based concordance between independent BP studies. In the imputed datasets, although the increases in coverage (>0.4%) and rank changes were more modest, even greater concordance between the studies was observed, attesting to the potential of LD-based binning on imputed data as well. Thus, ignoring LD can result in the misinterpretation of the GWAS findings and have an impact on subsequent genetic and functional studies.
ISSN:0002-9297
1537-6605
DOI:10.1016/j.ajhg.2012.02.025