, a bio/informatics shared resource is still "open for business" - Visit the CDS website


Dan Roden
Faculty Member
Last active: 3/24/2020

A comparative study of different methods for automatic identification of clopidogrel-induced bleedings in electronic health records.

Lee HJ, Jiang M, Wu Y, Shaffer CM, Cleator JH, Friedman EA, Lewis JP, Roden DM, Denny J, Xu H
AMIA Jt Summits Transl Sci Proc. 2017 2017: 185-192

PMID: 28815128 · PMCID: PMC5543340

Electronic health records (EHRs) linked with biobanks have been recognized as valuable data sources for pharmacogenomic studies, which require identification of patients with certain adverse drug reactions (ADRs) from a large population. Since manual chart review is costly and time-consuming, automatic methods to accurately identify patients with ADRs have been called for. In this study, we developed and compared different informatics approaches to identify ADRs from EHRs, using clopidogrel-induced bleeding as our case study. Three different types of methods were investigated: 1) rule-based methods; 2) machine learning-based methods; and 3) scoring function-based methods. Our results show that both machine learning and scoring methods are effective and the scoring method can achieve a high precision with a reasonable recall. We also analyzed the contributions of different types of features and found that the temporality information between clopidogrel and bleeding events, as well as textual evidence from physicians' assertion of the adverse events are helpful. We believe that our findings are valuable in advancing EHR-based pharmacogenomic studies.

MeSH Terms (0)

Connections (1)

This publication is referenced by other Labnodes entities:

Links