Loading…

1006-LBP

Aim Analyses of highly polymorphic HLA and KIR locus data require specialized tools and methods, but most modern analytical programs are tailored for SNPs. A typical analytical workflow for HLA and KIR data requires trafficking data between several programs, which is time intensive, error prone and...

Full description

Saved in:
Bibliographic Details
Published in:Human immunology 2014-06, Vol.75 (6), p.481-481
Main Authors: Mack, Steven J, Yamaguchi, Ken D, Salamon, Hugh, Pena, Anthony G, Horton, Robert M, Francis-Lyon, Patricia, Maiers, Martin, Hollenbach, Jill A
Format: Article
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Aim Analyses of highly polymorphic HLA and KIR locus data require specialized tools and methods, but most modern analytical programs are tailored for SNPs. A typical analytical workflow for HLA and KIR data requires trafficking data between several programs, which is time intensive, error prone and limits reproducibility. The IDAWG has been developing an integrated data-management and analysis system tailored to these important genetic systems. Methods We have developed the Toolkit for Immunogenomic Data Exchange and Storage (TIDES) and Bridging ImmunoGenomic Data-Analysis Workflow Gaps (BIGDAWG) systems to address the unmet need for consistent HLA and KIR data management and analysis. TIDES is a free, open-source system that converts HLA and KIR genotypes derived from widely used genotyping platforms into Genotype List (GL) Strings, which are registered with the NMDP’s GL Service (gl.immunogenomics.org). BIGDAWG is an automated pipeline, scripted in R, that performs common data analyses of multi-locus highly polymorphic genetic data characteristic of HLA and KIR genes. Results TIDES stores GL String-encoded data with project-related data, and exports these to PED and POP data-analysis formats. TIDES can be deployed on an AWS EC2 or equivalent Linux environment, on a local network or on a standalone machine, per user preference. A prototype TIDES implementation is deployed at tides.immunogenomics.org. Starting with unambiguous multi-locus genotype data for case-control groups, BIGDAWG estimates user-specified haplotypes, bins low-frequency haplotypes to enable chi-squared testing, calculates odds-ratios, confidence intervals and p-values for each haplotype, and generates figures and tables for each comparison. Conclusions The integration of TIDES and BIGDAWG will create a workflow that accepts ambiguous genotype data and returns case-control analysis results. This approach streamlines data analysis and allows the consistent storage and exchange of HLA and KIR genotype data.
ISSN:0198-8859
DOI:10.1016/j.humimm.2014.01.018