Dana Crawford
Last active: 5/23/2014


My primary research interest is how genetic variation impacts common, complex human phenotypes. The Human Genome Project, the International HapMap Project, the 1000 Genomes Project, and other public/private projects have generated an enormous amount of data for common and rare genetic variation across human populations. Currently, greater than 10 million single nucleotide polymorphisms (SNPs), the most common form of DNA variation, are available in public databases, and advances in technology now make it possible to interrogate a million sites in a single assay. Despite the increasing ease of generating genetic data, we are still faced with the challenge of understanding how these genetic variants affect susceptibility to common disease in the context of environmental exposures.

To meet this challenge, my laboratory is applying genetic variation data to clinical trials, large-scale epidemiological studies, and biobanks linked to quantitative traits and extensive clinical data. As an example, the Centers for Disease Control and Prevention has collected >15,000 DNAs with phenotypic information linked to the samples in a population-based survey known as the National Health and Nutrition Examination Surveys (NHANES). We are using NHANES for candidate gene studies as well as for a recently funded U01 to characterize the genetic architecture of genetics variants identified in genome-wide association studies (GWAS) with an emphasis on African Americans and Mexican Americans. In addition to NHANES, my laboratory also accesses BioVU, the Vanderbilt DNA biobank containing >125,000 DNA samples linked to de-identified electronic medical records. These and other projects in my laboratory cut across various phenotypes and traits and will involve gathering genotyping and sequencing data for various cohorts, as well as developing tools to store, retrieve, and ultimately analyze the combined phenotypic and genetic data. For analysis, traditional (e.g., regressions; haplotype inference) and experimental genetic epidemiological methods will be used to identify SNPs associated with phenotypes, and bioinformatic/genomic tools will be used to make informed decisions to target specific genes/genomic regions and to interpret these associations. Finally, many methodological issues such as adjusting for population stratification and identifying interactions can be explored by mining existing data to aid in the design and analysis of future studies.


The following timeline graph is generated from all co-authored publications.

Featured publications are shown below:

  1. Genome-wide study of resistant hypertension identified from electronic health records. Dumitrescu L, Ritchie MD, Denny JC, El Rouby NM, McDonough CW, Bradford Y, Ramirez AH, Bielinski SJ, Basford MA, Chai HS, Peissig P, Carrell D, Pathak J, Rasmussen LV, Wang X, Pacheco JA, Kho AN, Hayes MG, Matsumoto M, Smith ME, Li R, Cooper-DeHoff RM, Kullo IJ, Chute CG, Chisholm RL, Jarvik GP, Larson EB, Carey D, McCarty CA, Williams MS, Roden DM, Bottinger E, Johnson JA, de Andrade M, Crawford DC (2017) PLoS One 12(2): e0171745
    › Primary publication · 28222112 (PubMed) · PMC5319785 (PubMed Central)
    › Primary publication · 27896978 (PubMed) · PMC5147499 (PubMed Central)
  3. Shared Genetic Etiology of Autoimmune Diseases in Patients from a Biorepository Linked to De-identified Electronic Health Records. Restrepo NA, Butkiewicz M, McGrath JA, Crawford DC (2016) Front Genet : 185
    › Primary publication · 27812365 (PubMed) · PMC5071319 (PubMed Central)
  4. Searching in the Dark: Phenotyping Diabetic Retinopathy in a De-Identified Electronic Medical Record Sample of African Americans. Restrepo NA, Farber-Eger E, Crawford DC (2016) AMIA Jt Summits Transl Sci Proc : 221-30
    › Primary publication · 27570675 (PubMed) · PMC5001772 (PubMed Central)
    › Primary publication · 26776201 (PubMed) · PMC4720980 (PubMed Central)
  6. KIDNEY DISEASE GENETICS AND THE IMPORTANCE OF DIVERSITY IN PRECISION MEDICINE. Cooke Bailey JN, Wilson S, Brown-Gentry K, Goodloe R, Crawford DC (2016) Pac Symp Biocomput : 285-96
    › Primary publication · 26776194 (PubMed) · PMC4720994 (PubMed Central)
    › Primary publication · 26776177 (PubMed) · PMC4720978 (PubMed Central)
  8. Towards a phenome-wide catalog of human clinical traits impacted by genetic ancestry. Dumitrescu L, Restrepo NA, Goodloe R, Boston J, Farber-Eger E, Pendergrass SA, Bush WS, Crawford DC (2015) BioData Min : 35
    › Primary publication · 26566401 (PubMed) · PMC4642611 (PubMed Central)
  9. Leveraging Epidemiologic and Clinical Collections for Genomic Studies of Complex Traits. Crawford DC, Goodloe R, Farber-Eger E, Boston J, Pendergrass SA, Haines JL, Ritchie MD, Bush WS (2015) Hum Hered 79(3-4): 137-46
    › Primary publication · 26201699 (PubMed) · PMC4528966 (PubMed Central)
  10. Extracting Primary Open-Angle Glaucoma from Electronic Medical Records for Genetic Association Studies. Restrepo NA, Farber-Eger E, Goodloe R, Haines JL, Crawford DC (2015) PLoS One 10(6): e0127817
    › Primary publication · 26061293 (PubMed) · PMC4465698 (PubMed Central)
  11. Genome-Wide Association Study of Serum Creatinine Levels during Vancomycin Therapy. Van Driest SL, McGregor TL, Velez Edwards DR, Saville BR, Kitchner TE, Hebbring SJ, Brilliant M, Jouni H, Kullo IJ, Creech CB, Kannankeril PJ, Vear SI, Brothers KB, Bowton EA, Shaffer CM, Patel N, Delaney JT, Bradford Y, Wilson S, Olson LM, Crawford DC, Potts AL, Ho RH, Roden DM, Denny JC (2015) PLoS One 10(6): e0127791
    › Primary publication · 26030142 (PubMed) · PMC4452656 (PubMed Central)
  12. The effects of electronic medical record phenotyping details on genetic association studies: HDL-C as a case study. Dumitrescu L, Goodloe R, Bradford Y, Farber-Eger E, Boston J, Crawford DC (2015) BioData Min : 15
    › Primary publication · 25969697 (PubMed) · PMC4428098 (PubMed Central)
  13. Pleiotropic and sex-specific effects of cancer GWAS SNPs on melanoma risk in the population architecture using genomics and epidemiology (PAGE) study. Kocarnik JM, Park SL, Han J, Dumitrescu L, Cheng I, Wilkens LR, Schumacher FR, Kolonel L, Carlson CS, Crawford DC, Goodloe RJ, Dilks HH, Baker P, Richardson D, Matise TC, Ambite JL, Song F, Qureshi AA, Zhang M, Duggan D, Hutter C, Hindorff L, Bush WS, Kooperberg C, Le Marchand L, Peters U (2015) PLoS One 10(3): e0120491
    › Primary publication · 25789475 (PubMed) · PMC4366224 (PubMed Central)
  14. Replication of Associations with Electrocardio-graphic Traits in African Americans from Clinical and Epidemiologic Studies. Jeff JM, Brown-Gentry K, Goodloe R, Ritchie MD, Denny JC, Kho AN, Armstrong LL, McClellan B, Mayo P, Allen M, Jin H, Gillani NB, Schnetz-Boutaud N, Dilks HH, Basford MA, Pacheco JA, Jarvik GP, Chisholm RL, Roden DM, Hayes MG, Crawford DC (2014) Evol Comput Mach Learn Data Min Bioinform : 939-951
    › Primary publication · 25590050 (PubMed) · PMC4290789 (PubMed Central)
  15. Controlling for population structure and genotyping platform bias in the eMERGE multi-institutional biobank linked to electronic health records. Crosslin DR, Tromp G, Burt A, Kim DS, Verma SS, Lucas AM, Bradford Y, Crawford DC, Armasu SM, Heit JA, Hayes MG, Kuivaniemi H, Ritchie MD, Jarvik GP, de Andrade M, electronic Medical Records and Genomics (eMERGE) Network (2014) Front Genet : 352
    › Primary publication · 25414722 (PubMed) · PMC4220165 (PubMed Central)
  16. Extraction of echocardiographic data from the electronic medical record is a rapid and efficient method for study of cardiac structure and function. Wells QS, Farber-Eger E, Crawford DC (2014) J Clin Bioinforma : 12
    › Primary publication · 25276338 (PubMed) · PMC4177384 (PubMed Central)
  17. Phenome-wide association studies demonstrating pleiotropy of genetic variants within FTO with and without adjustment for body mass index. Cronin RM, Field JR, Bradford Y, Shaffer CM, Carroll RJ, Mosley JD, Bastarache L, Edwards TL, Hebbring SJ, Lin S, Hindorff LA, Crane PK, Pendergrass SA, Ritchie MD, Crawford DC, Pathak J, Bielinski SJ, Carrell DS, Crosslin DR, Ledbetter DH, Carey DJ, Tromp G, Williams MS, Larson EB, Jarvik GP, Peissig PL, Brilliant MH, McCarty CA, Chute CG, Kullo IJ, Bottinger E, Chisholm R, Smith ME, Roden DM, Denny JC (2014) Front Genet : 250
    › Primary publication · 25177340 (PubMed) · PMC4134007 (PubMed Central)
  18. Accuracy of administratively-assigned ancestry for diverse populations in an electronic medical record-linked biobank. Hall JB, Dumitrescu L, Dilks HH, Crawford DC, Bush WS (2014) PLoS One 9(6): e99161
    › Primary publication · 24896101 (PubMed) · PMC4045967 (PubMed Central)
  19. Cross-cancer pleiotropic analysis of endometrial cancer: PAGE and E2C2 consortia. Setiawan VW, Schumacher F, Prescott J, Haessler J, Malinowski J, Wentzensen N, Yang H, Chanock S, Brinton L, Hartge P, Lissowska J, Park SL, Cheng I, Bush WS, Crawford DC, Ursin G, Horn-Ross P, Bernstein L, Lu L, Risch H, Yu H, Sakoda LC, Doherty J, Chen C, Jackson R, Yasmeen S, Cote M, Kocarnik JM, Peters U, Kraft P, De Vivo I, Haiman CA, Kooperberg C, Le Marchand L (2014) Carcinogenesis 35(9): 2068-73
    › Primary publication · 24832084 (PubMed) · PMC4146418 (PubMed Central)
  20. Investigating the relationship between mitochondrial genetic variation and cardiovascular-related traits to develop a framework for mitochondrial phenome-wide association studies. Mitchell SL, Hall JB, Goodloe RJ, Boston J, Farber-Eger E, Pendergrass SA, Bush WS, Crawford DC (2014) BioData Min : 6
    › Primary publication · 24731735 (PubMed) · PMC4021623 (PubMed Central)
  21. Pleiotropic associations of risk variants identified for other cancers with lung cancer risk: the PAGE and TRICL consortia. Park SL, Fesinmeyer MD, Timofeeva M, Caberto CP, Kocarnik JM, Han Y, Love SA, Young A, Dumitrescu L, Lin Y, Goodloe R, Wilkens LR, Hindorff L, Fowke JH, Carty C, Buyske S, Schumacher FR, Butler A, Dilks H, Deelman E, Cote ML, Chen W, Pande M, Christiani DC, Field JK, Bickebller H, Risch A, Heinrich J, Brennan P, Wang Y, Eisen T, Houlston RS, Thun M, Albanes D, Caporaso N, Peters U, North KE, Heiss G, Crawford DC, Bush WS, Haiman CA, Landi MT, Hung RJ, Kooperberg C, Amos CI, Le Marchand L, Cheng I (2014) J Natl Cancer Inst 106(4): dju061
    › Primary publication · 24681604 (PubMed) · PMC3982896 (PubMed Central)
  22. Multiancestral analysis of inflammation-related genetic variants and C-reactive protein in the population architecture using genomics and epidemiology study. Kocarnik JM, Pendergrass SA, Carty CL, Pankow JS, Schumacher FR, Cheng I, Durda P, Ambite JL, Deelman E, Cook NR, Liu S, Wactawski-Wende J, Hutter C, Brown-Gentry K, Wilson S, Best LG, Pankratz N, Hong CP, Cole SA, Voruganti VS, Bůžkova P, Jorgensen NW, Jenny NS, Wilkens LR, Haiman CA, Kolonel LN, Lacroix A, North K, Jackson R, Le Marchand L, Hindorff LA, Crawford DC, Gross M, Peters U (2014) Circ Cardiovasc Genet 7(2): 178-88
    › Primary publication · 24622110 (PubMed) · PMC4104750 (PubMed Central)
  23. Admixture mapping and subsequent fine-mapping suggests a biologically relevant and novel association on chromosome 11 for type 2 diabetes in African Americans. Jeff JM, Armstrong LL, Ritchie MD, Denny JC, Kho AN, Basford MA, Wolf WA, Pacheco JA, Li R, Chisholm RL, Roden DM, Hayes MG, Crawford DC (2014) PLoS One 9(3): e86931
    › Primary publication · 24595071 (PubMed) · PMC3940426 (PubMed Central)
  24. Characterization of mitochondrial haplogroups in a large population-based sample from the United States. Mitchell SL, Goodloe R, Brown-Gentry K, Pendergrass SA, Murdock DG, Crawford DC (2014) Hum Genet 133(7): 861-8
    › Primary publication · 24488180 (PubMed) · PMC4113317 (PubMed Central)
  25. Replication of associations between GWAS SNPs and melanoma risk in the Population Architecture Using Genomics and Epidemiology (PAGE) Study. Kocarnik JM, Park SL, Han J, Dumitrescu L, Cheng I, Wilkens LR, Schumacher FR, Kolonel L, Carlson CS, Crawford DC, Goodloe RJ, Dilks H, Baker P, Richardson D, Ambite JL, Song F, Quresh AA, Zhang M, Duggan D, Hutter C, Hindorff LA, Bush WS, Kooperberg C, Le Marchand L, Peters U (2014) J Invest Dermatol 134(7): 2049-2052
    › Primary publication · 24480881 (PubMed) · PMC4057959 (PubMed Central)
  26. Development of a data-mining algorithm to identify ages at reproductive milestones in electronic medical records. Malinowski J, Farber-Eger E, Crawford DC (2014) Pac Symp Biocomput : 376-87
    › Primary publication · 24297563 (PubMed) · PMC3905575 (PubMed Central)
  27. Utilization of an EMR-biorepository to identify the genetic predictors of calcineurin-inhibitor toxicity in heart transplant recipients. Oetjens M, Bush WS, Birdwell KA, Dilks HH, Bowton EA, Denny JC, Wilke RA, Roden DM, Crawford DC (2014) Pac Symp Biocomput : 253-64
    › Primary publication · 24297552 (PubMed) · PMC3923429 (PubMed Central)
  28. Systematic comparison of phenome-wide association study of electronic medical record data and genome-wide association study data. Denny JC, Bastarache L, Ritchie MD, Carroll RJ, Zink R, Mosley JD, Field JR, Pulley JM, Ramirez AH, Bowton E, Basford MA, Carrell DS, Peissig PL, Kho AN, Pacheco JA, Rasmussen LV, Crosslin DR, Crane PK, Pathak J, Bielinski SJ, Pendergrass SA, Xu H, Hindorff LA, Li R, Manolio TA, Chute CG, Chisholm RL, Larson EB, Jarvik GP, Brilliant MH, McCarty CA, Kullo IJ, Haines JL, Crawford DC, Masys DR, Roden DM (2013) Nat Biotechnol 31(12): 1102-10
    › Primary publication · 24270849 (PubMed) · PMC3969265 (PubMed Central)
  29. Fine Mapping and Identification of BMI Loci in African Americans. Gong J, Schumacher F, Lim U, Hindorff LA, Haessler J, Buyske S, Carlson CS, Rosse S, Bůžková P, Fornage M, Gross M, Pankratz N, Pankow JS, Schreiner PJ, Cooper R, Ehret G, Gu CC, Houston D, Irvin MR, Jackson R, Kuller L, Henderson B, Cheng I, Wilkens L, Leppert M, Lewis CE, Li R, Nguyen KD, Goodloe R, Farber-Eger E, Boston J, Dilks HH, Ritchie MD, Fowke J, Pooler L, Graff M, Fernandez-Rhodes L, Cochrane B, Boerwinkle E, Kooperberg C, Matise TC, Le Marchand L, Crawford DC, Haiman CA, North KE, Peters U (2013) Am J Hum Genet 93(4): 661-71
    › Primary publication · 24094743 (PubMed) · PMC3791273 (PubMed Central)
  30. Pleiotropic effects of genetic risk variants for other cancers on colorectal cancer risk: PAGE, GECCO and CCFR consortia. Cheng I, Kocarnik JM, Dumitrescu L, Lindor NM, Chang-Claude J, Avery CL, Caberto CP, Love SA, Slattery ML, Chan AT, Baron JA, Hindorff LA, Park SL, Schumacher FR, Hoffmeister M, Kraft P, Butler AM, Duggan DJ, Hou L, Carlson CS, Monroe KR, Lin Y, Carty CL, Mann S, Ma J, Giovannucci EL, Fuchs CS, Newcomb PA, Jenkins MA, Hopper JL, Haile RW, Conti DV, Campbell PT, Potter JD, Caan BJ, Schoen RE, Hayes RB, Chanock SJ, Berndt SI, Küry S, Bézieau S, Ambite JL, Kumaraguruparan G, Richardson DM, Goodloe RJ, Dilks HH, Baker P, Zanke BW, Lemire M, Gallinger S, Hsu L, Jiao S, Harrison TA, Seminara D, Haiman CA, Kooperberg C, Wilkens LR, Hutter CM, White E, Crawford DC, Heiss G, Hudson TJ, Brenner H, Bush WS, Casey G, Le Marchand L, Peters U (2014) Gut 63(5): 800-7
    › Primary publication · 23935004 (PubMed) · PMC3918490 (PubMed Central)
  31. Phenome-wide association study (PheWAS) for detection of pleiotropy within the Population Architecture using Genomics and Epidemiology (PAGE) Network. Pendergrass SA, Brown-Gentry K, Dudek S, Frase A, Torstenson ES, Goodloe R, Ambite JL, Avery CL, Buyske S, Bůžková P, Deelman E, Fesinmeyer MD, Haiman CA, Heiss G, Hindorff LA, Hsu CN, Jackson RD, Kooperberg C, Le Marchand L, Lin Y, Matise TC, Monroe KR, Moreland L, Park SL, Reiner A, Wallace R, Wilkens LR, Crawford DC, Ritchie MD (2013) PLoS Genet 9(1): e1003087
    › Primary publication · 23382687 (PubMed) · PMC3561060 (PubMed Central)
  32. Visual integration of results from a large DNA biobank (BioVU) using synthesis-view. Pendergrass S, Dudek SM, Roden DM, Crawford DC, Ritchie MD (2011) Pac Symp Biocomput : 265-75
    › Primary publication · 21121054 (PubMed) · PMC3065108 (PubMed Central)
  33. Identification of genomic predictors of atrioventricular conduction: using electronic medical records as a tool for genome science. Denny JC, Ritchie MD, Crawford DC, Schildcrout JS, Ramirez AH, Pulley JM, Basford MA, Masys DR, Haines JL, Roden DM (2010) Circulation 122(20): 2016-21
    › Primary publication · 21041692 (PubMed) · PMC2991609 (PubMed Central)
  34. Assessing the accuracy of observer-reported ancestry in a biorepository linked to electronic medical records. Dumitrescu L, Ritchie MD, Brown-Gentry K, Pulley JM, Basford M, Denny JC, Oksenberg JR, Roden DM, Haines JL, Crawford DC (2010) Genet Med 12(10): 648-50
    › Primary publication · 20733501 (PubMed) · PMC2952033 (PubMed Central)