中文 English

您当前所在位置:首页 > 学术成果

学术成果

论文丨Yidan Zhang; Junlin Yu; Guobo Li; Zhenan He; Gary G. Yen:REaMA: Building Biomedical Relation Extraction Specialized Large Language Models Through Instruction Tuning

时间:2026-03-30

本文(REaMA: Building Biomedical Relation Extraction Specialized Large Language Models Through Instruction Tuning原载IEEE Transactions on Neural Networks and Learning Systems,四川大学李国菠教授等科研人员创作,系四川大学智慧法治超前部署学科系列学术成果。后续会持续分享四川大学智慧法治超前部署学科系列学术成果,欢迎大家阅读。

Aiming to identify entity pairs with biomedical semantic relations and assign specific relation types, biomedical relation extraction (BioRE) plays a critical role in biomedical text mining and information extraction (IE). Recent studies indicate that general large language models (LLMs) have made some breakthroughs in general relation extraction (RE) tasks. However, even the advanced open-source LLMs struggle with BioRE tasks. For example, WizardLM-70B and LLaMA-2-70B achieve F-scores of 14.05 and 12.21 on the BioRED dataset, respectively, significantly lagging behind the state-of-the-art (SOTA) method which scores 65.17. To address this gap, a multitask instruction-tuning framework is proposed, which can transform general LLMs into BioRE-specialized models with our meticulously curated instruction dataset, REInstruct, comprising 150000 diverse and quality instruction-response pairs. Consequently, we introduce REaMA, a series of open-source LLMs with sizes of 7B and 13B specifically tailored for BioRE tasks. Experimental results on seven representative BioRE datasets show that both REaMA-2-7B and REaMA-2-13B acquire promising performance on all datasets. Remarkably, the larger REaMA-2-13B outperforms the current SOTA method on five out of seven datasets. The result exhibits the effectiveness of instruction-tuning on REInstruct in eliciting strong RE capabilities in LLMs. Furthermore, we show that incorporating chain of thought (CoT) into REInstruct can further enhance the generalization ability of REaMA. The project is available at https://github.



Zhang, Y.; Yu, J.; Li, G.; He, Z.; Yen, G. G. REaMA: Building Biomedical Relation Extraction Specialized Large Language Models Through Instruction Tuning. IEEE Transactions on Neural Networks and Learning Systems 2025, 1-15.(论文下载)