JOURNAL ARTICLE

Bi-encoder-based approach to biomedical document-level entity recognition and relation extraction

Pengyuan NieMengxuan LinJinzhong NingZhihao YangLei Wang

Year: 2025 Journal:   IEEE Transactions on Computational Biology and Bioinformatics Vol: 22 (5)Pages: 1-12

Abstract

Large-scale biomedical entity recognition and relation extraction are essential foundational tasks for downstream text mining tasks and applications, such as knowledge graph construction. Because many relations span sentence boundaries, document-level entity recognition and relation extraction is closely aligned with real-world demands. However, identifying complex and diverse entities and relations within a limited timeframe is challenging. Therefore, we propose an end-to-end approach called BioECR for biomedical document-level named entity recognition, coreference resolution, and relation extraction. This approach utilizes a bi-encoder structure combined with biomedical entity types and descriptions to solve nested biomedical entities in linear time, thereby enhancing the ability to recognise complex entities and relations. Then a composition graph convolutional neural network was proposed to address the noise in conventional graph convolutional networks, thereby reducing time overhead and selectively fusing multiple entities or contextual information. Finally, by combining entity type clustering methods, the problem of coreference errors among multiple types of entities is solved easily and quickly. Experimental results demonstrate that our approach achieves state-of-the-art performance on all subtasks across three biomedical document-level datasets called CDR, GDA, and BioRED, and our approach reduces the inference time by approximately 60%.

Keywords:
Relation (database) Computer science Relationship extraction Encoder Natural language processing Information retrieval Artificial intelligence Extraction (chemistry) Pattern recognition (psychology) Data mining Chemistry Chromatography

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.18
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Biomedical Text Mining and Ontologies
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.