JOURNAL ARTICLE

Deriving comorbidities from medical records using natural language processing

Hojjat SalmasianDaniel E. FreedbergCarol Friedman

Year: 2013 Journal:   Journal of the American Medical Informatics Association Vol: 20 (e2)Pages: e239-e242   Publisher: Oxford University Press

Abstract

Extracting comorbidity information is crucial for phenotypic studies because of the confounding effect of comorbidities. We developed an automated method that accurately determines comorbidities from electronic medical records. Using a modified version of the Charlson comorbidity index (CCI), two physicians created a reference standard of comorbidities by manual review of 100 admission notes. We processed the notes using the MedLEE natural language processing system, and wrote queries to extract comorbidities automatically from its structured output. Interrater agreement for the reference set was very high (97.7%). Our method yielded an F1 score of 0.761 and the summed CCI score was not different from the reference standard (p=0.329, power 80.4%). In comparison, obtaining comorbidities from claims data yielded an F1 score of 0.741, due to lower sensitivity (66.1%). Because CCI has previously been validated as a predictor of mortality and readmission, our method could allow automated prediction of these outcomes.

Keywords:
Comorbidity Confounding Medical record Medicine Data mining Health records Computer science Inter-rater reliability Data set Artificial intelligence Natural language processing Internal medicine Statistics

Metrics

40
Cited By
1.95
FWCI (Field Weighted Citation Impact)
30
Refs
0.85
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Chronic Disease Management Strategies
Health Sciences →  Medicine →  Epidemiology
Machine Learning in Healthcare
Physical Sciences →  Computer Science →  Artificial Intelligence
Medical Coding and Health Information
Health Sciences →  Health Professions →  Health Information Management
© 2026 ScienceGate Book Chapters — All rights reserved.