A haplotype-based framework for group-wise transmission/disequilibrium tests for rare variant association analysis.

Chen R, Wei Q, Zhan X, Zhong X, Sutcliffe JS, Cox NJ, Cook EH, Li C, Chen W, Li B
Bioinformatics. 2015 31 (9): 1452-9

PMID: 25568282 · PMCID: PMC4410665 · DOI:10.1093/bioinformatics/btu860

MOTIVATION - A major focus of current sequencing studies for human genetics is to identify rare variants associated with complex diseases. Aside from reduced power of detecting associated rare variants, controlling for population stratification is particularly challenging for rare variants. Transmission/disequilibrium tests (TDT) based on family designs are robust to population stratification and admixture, and therefore provide an effective approach to rare variant association studies to eliminate spurious associations. To increase power of rare variant association analysis, gene-based collapsing methods become standard approaches for analyzing rare variants. Existing methods that extend this strategy to rare variants in families usually combine TDT statistics at individual variants and therefore lack the flexibility of incorporating other genetic models.

RESULTS - In this study, we describe a haplotype-based framework for group-wise TDT (gTDT) that is flexible to encompass a variety of genetic models such as additive, dominant and compound heterozygous (CH) (i.e. recessive) models as well as other complex interactions. Unlike existing methods, gTDT constructs haplotypes by transmission when possible and inherently takes into account the linkage disequilibrium among variants. Through extensive simulations we showed that type I error was correctly controlled for rare variants under all models investigated, and this remained true in the presence of population stratification. Under a variety of genetic models, gTDT showed increased power compared with the single marker TDT. Application of gTDT to an autism exome sequencing data of 118 trios identified potentially interesting candidate genes with CH rare variants.

AVAILABILITY AND IMPLEMENTATION - We implemented gTDT in C++ and the source code and the detailed usage are available on the authors' website (https://medschool.vanderbilt.edu/cgg).

CONTACT - bingshan.li@vanderbilt.edu or wei.chen@chp.edu

SUPPLEMENTARY INFORMATION - Supplementary data are available at Bioinformatics online.

© The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

MeSH Terms (11)

Autistic Disorder Computer Simulation Data Interpretation, Statistical Exome Genetic Association Studies Genetic Variation Haplotypes Humans Linkage Disequilibrium Models, Genetic Sequence Analysis, DNA

Connections (3)

This publication is referenced by other Labnodes entities:

Links