JOURNAL ARTICLE

Medical entity recognition using conditional random field (CRF)

Abstract

The main objective of this research is to extract the health information, such as diseases, symptoms, treatments and drugs from the health online forum discussion. The task is referred as the medical entity recognition (MER) in which is defined as the Named Entity Recognition (NER) task to extract the information from the unstructured text and transform it into the structured forms in the health field. The approach for the task used in this research is a supervised learning using Conditional Random Field(CRF). We experimented several combinations of features in order to produce the results with the best accuracy. As the final result, this research obtained the best accuracy of precision 70.97%, recall 57.83%, and f-measures 63.69%. The best combination of features resulting the best overall result consists of the word itself, phrase, dictionary, the first preceding word and the word length.

Keywords:
Conditional random field Named-entity recognition Natural language processing Computer science Task (project management) Artificial intelligence Phrase Recall Word (group theory) Field (mathematics) Speech recognition F1 score Pattern recognition (psychology) Mathematics Psychology

Metrics

6
Cited By
0.46
FWCI (Field Weighted Citation Impact)
12
Refs
0.73
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Text Analysis Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.