Three-stage quality control strategies for DNA re-sequencing data.

Guo Y, Ye F, Sheng Q, Clark T, Samuels DC
Brief Bioinform. 2014 15 (6): 879-89

PMID: 24067931 · PMCID: PMC4492405 · DOI:10.1093/bib/bbt069

Advances in next-generation sequencing (NGS) technologies have greatly improved our ability to detect genomic variants for biomedical research. In particular, NGS technologies have been recently applied with great success to the discovery of mutations associated with the growth of various tumours and in rare Mendelian diseases. The advance in NGS technologies has also created significant challenges in bioinformatics. One of the major challenges is quality control of the sequencing data. In this review, we discuss the proper quality control procedures and parameters for Illumina technology-based human DNA re-sequencing at three different stages of sequencing: raw data, alignment and variant calling. Monitoring quality control metrics at each of the three stages of NGS data provides unique and independent evaluations of data quality from differing perspectives. Properly conducting quality control protocols at all three stages and correctly interpreting the quality control results are crucial to ensure a successful and meaningful study.

© The Author 2013. Published by Oxford University Press. For Permissions, please email:

MeSH Terms (10)

Computational Biology DNA Gene Library High-Throughput Nucleotide Sequencing Humans Neoplasms Polymorphism, Single Nucleotide Quality Control Sequence Alignment Sequence Analysis, DNA

Connections (1)

This publication is referenced by other Labnodes entities: