JOURNAL ARTICLE

Data Augmentation with Nearest Neighbor Classifier for Few-Shot Named Entity Recognition

Yao GeMohammed Ali Al-GaradiAbeed Sarker

Year: 2024 Journal:   Studies in health technology and informatics Vol: 310 Pages: 690-694   Publisher: IOS Press

Abstract

Few-shot learning (FSL) is a category of machine learning models that are designed with the intent of solving problems that have small amounts of labeled data available for training. FSL research progress in natural language processing (NLP), particularly within the medical domain, has been notably slow, primarily due to greater difficulties posed by domain-specific characteristics and data sparsity problems. We explored the use of novel methods for text representation and encoding combined with distance-based measures for improving FSL entity detection. In this paper, we propose a data augmentation method to incorporate semantic information from medical texts into the learning process and combine it with a nearest-neighbor classification strategy for predicting entities. Experiments performed on five biomedical text datasets demonstrate that our proposed approach often outperforms other approaches.

Keywords:
Computer science Artificial intelligence Classifier (UML) k-nearest neighbors algorithm Machine learning Domain (mathematical analysis) Natural language processing Representation (politics) Labeled data Pattern recognition (psychology)

Metrics

4
Cited By
5.04
FWCI (Field Weighted Citation Impact)
8
Refs
0.92
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.