Deriving comorbidities from medical records using natural language processing

Hojjat Salmasian; Daniel E. Freedberg; Carol Friedman

doi:10.1136/amiajnl-2013-001889

ScienceGate Book Chapters

JOURNAL ARTICLE

Deriving comorbidities from medical records using natural language processing

Hojjat Salmasian Daniel E. Freedberg Carol Friedman

Year: 2013 Journal: Journal of the American Medical Informatics Association Vol: 20 (e2)Pages: e239-e242 Publisher: Oxford University Press

DOI: 10.1136/amiajnl-2013-001889

Get Full-Text PDF Get Analytical Report

Abstract

Extracting comorbidity information is crucial for phenotypic studies because of the confounding effect of comorbidities. We developed an automated method that accurately determines comorbidities from electronic medical records. Using a modified version of the Charlson comorbidity index (CCI), two physicians created a reference standard of comorbidities by manual review of 100 admission notes. We processed the notes using the MedLEE natural language processing system, and wrote queries to extract comorbidities automatically from its structured output. Interrater agreement for the reference set was very high (97.7%). Our method yielded an F1 score of 0.761 and the summed CCI score was not different from the reference standard (p=0.329, power 80.4%). In comparison, obtaining comorbidities from claims data yielded an F1 score of 0.741, due to lower sensitivity (66.1%). Because CCI has previously been validated as a predictor of mortality and readmission, our method could allow automated prediction of these outcomes.

Keywords:

Comorbidity Confounding Medical record Medicine Data mining Health records Computer science Inter-rater reliability Data set Artificial intelligence Natural language processing Internal medicine Statistics

Metrics

Cited By

1.95

FWCI (Field Weighted Citation Impact)

Refs

0.85

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Chronic Disease Management Strategies

Health Sciences → Medicine → Epidemiology

Machine Learning in Healthcare

Physical Sciences → Computer Science → Artificial Intelligence

Medical Coding and Health Information

Health Sciences → Health Professions → Health Information Management

Deriving comorbidities from medical records using natural language processing

Abstract

Metrics

Citation History

Topics

Related Documents

Information Extraction from Electronic Medical Records using Natural Language Processing Techniques

Natural Language Processing to Extract Information from Portuguese-Language Medical Records

Natural Language Processing and Electronic Medical Records—Reply

Using natural language processing to extract clinically useful information from Chinese electronic medical records

Extracting social determinants of health from inpatient electronic medical records using natural language processing