JOURNAL ARTICLE

REaMA: Building Biomedical Relation Extraction Specialized Large Language Models Through Instruction Tuning

Yidan ZhangJunlin YuGuo‐Bo LiZhenan HeGary G. Yen

Year: 2025 Journal:   IEEE Transactions on Neural Networks and Learning Systems Vol: 36 (12)Pages: 20258-20272   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Aiming to identify entity pairs with biomedical semantic relations and assign specific relation types, biomedical relation extraction (BioRE) plays a critical role in biomedical text mining and information extraction (IE). Recent studies indicate that general large language models (LLMs) have made some breakthroughs in general relation extraction (RE) tasks. However, even the advanced open-source LLMs struggle with BioRE tasks. For example, WizardLM-70B and LLaMA-2-70B achieve F-scores of 14.05 and 12.21 on the BioRED dataset, respectively, significantly lagging behind the state-of-the-art (SOTA) method which scores 65.17. To address this gap, a multitask instruction-tuning framework is proposed, which can transform general LLMs into BioRE-specialized models with our meticulously curated instruction dataset, REInstruct, comprising 150000 diverse and quality instruction-response pairs. Consequently, we introduce REaMA, a series of open-source LLMs with sizes of 7B and 13B specifically tailored for BioRE tasks. Experimental results on seven representative BioRE datasets show that both REaMA-2-7B and REaMA-2-13B acquire promising performance on all datasets. Remarkably, the larger REaMA-2-13B outperforms the current SOTA method on five out of seven datasets. The result exhibits the effectiveness of instruction-tuning on REInstruct in eliciting strong RE capabilities in LLMs. Furthermore, we show that incorporating chain of thought (CoT) into REInstruct can further enhance the generalization ability of REaMA. The project is available at https://github.com/stzpp/REaMA.

Keywords:
Relation (database) Computer science Relationship extraction Extraction (chemistry) Natural language processing Chemistry Data mining

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.14
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Biomedical Text Mining and Ontologies
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology

Related Documents

BOOK-CHAPTER

Instruction Tuning Large Language Models for Multimodal Relation Extraction Using LoRA

Li ZouNing PangXiang Zhao

Lecture notes in computer science Year: 2024 Pages: 364-376
JOURNAL ARTICLE

Benchmarking Large Language Models for Biomedical Relation Extraction

Claudiu CreangăTeodor MarchitanLiviu P. Dinu

Journal:   Procedia Computer Science Year: 2025 Vol: 270 Pages: 592-601
JOURNAL ARTICLE

Instruction Tuning for Developing Large Language Models Specialized in Chemical Domain

Jung‐Min LeeH.Y. KimSungsu LeeYunsoo KimK.S. LeeSeoung-Bum Kim

Journal:   Journal of Korean Institute of Industrial Engineers Year: 2025 Vol: 51 (2)Pages: 150-160
JOURNAL ARTICLE

BioInstruct: instruction tuning of large language models for biomedical natural language processing

Hieu TranZhichao YangZonghai YaoHong Yu

Journal:   Journal of the American Medical Informatics Association Year: 2024 Vol: 31 (9)Pages: 1821-1832
© 2026 ScienceGate Book Chapters — All rights reserved.