Tracking medical students' clinical experiences using natural language processing.

Denny JC, Bastarache L, Sastre EA, Spickard A
J Biomed Inform. 2009 42 (5): 781-9

PMID: 19236956 · PMCID: PMC5490452 · DOI:10.1016/j.jbi.2009.02.004

Graduate medical students must demonstrate competency in clinical skills. Current tracking methods rely either on manual efforts or on simple electronic entry to record clinical experience. We evaluated automated methods to locate 10 institution-defined core clinical problems from three medical students' clinical notes (n=290). Each note was processed with section header identification algorithms and the KnowledgeMap concept identifier to locate Unified Medical Language System (UMLS) concepts. The best performing automated search strategies accurately classified documents containing primary discussions to the core clinical problems with area under receiver operator characteristic curve of 0.90-0.94. Recall and precision for UMLS concept identification was 0.91 and 0.92, respectively. Of the individual note section, concepts found within the chief complaint, history of present illness, and assessment and plan were the strongest predictors of relevance. This automated method of tracking can provide detailed, pertinent reports of clinical experience that does not require additional work from medical trainees. The coupling of section header identification and concept identification holds promise for other natural language processing tasks, such as clinical research or phenotype identification.

MeSH Terms (9)

Clinical Competence Data Interpretation, Statistical Education, Medical, Graduate Humans Medical Informatics Natural Language Processing Students, Medical Unified Medical Language System User-Computer Interface

Connections (1)

This publication is referenced by other Labnodes entities: