Imputation of coding variants in African Americans: better performance using data from the exome sequencing project.

Duan Q, Liu EY, Auer PL, Zhang G, Lange EM, Jun G, Bizon C, Jiao S, Buyske S, Franceschini N, Carlson CS, Hsu L, Reiner AP, Peters U, Haessler J, Curtis K, Wassel CL, Robinson JG, Martin LW, Haiman CA, Le Marchand L, Matise TC, Hindorff LA, Crawford DC, Assimes TL, Kang HM, Heiss G, Jackson RD, Kooperberg C, Wilson JG, Abecasis GR, North KE, Nickerson DA, Lange LA, Li Y
Bioinformatics. 2013 29 (21): 2744-9

PMID: 23956302 · PMCID: PMC3799474 · DOI:10.1093/bioinformatics/btt477

SUMMARY - Although the 1000 Genomes haplotypes are the most commonly used reference panel for imputation, medical sequencing projects are generating large alternate sets of sequenced samples. Imputation in African Americans using 3384 haplotypes from the Exome Sequencing Project, compared with 2184 haplotypes from 1000 Genomes Project, increased effective sample size by 8.3-11.4% for coding variants with minor allele frequency <1%. No loss of imputation quality was observed using a panel built from phenotypic extremes. We recommend using haplotypes from Exome Sequencing Project alone or concatenation of the two panels over quality score-based post-imputation selection or IMPUTE2's two-panel combination.

CONTACT - yunli@med.unc.edu.

SUPPLEMENTARY INFORMATION - Supplementary data are available at Bioinformatics online.

MeSH Terms (11)

African Americans Exome Gene Frequency Genetic Variation Genome, Human Genome-Wide Association Study Haplotypes Humans Phenotype Polymorphism, Single Nucleotide Sequence Analysis, DNA

Connections (1)

This publication is referenced by other Labnodes entities:

Links