AI4Meder
返回论文列表
论文ICLR 2026 Poster2026 年clinical prediction

MedAraBench:大规模阿拉伯语医学问答数据集与基准

ICLR 2026 Poster accepted paper at ICLR 2026. Arabic remains one of the most underrepresented languages in natural language processing research, particularly in medical applications, due to the limited availability of open-source data and benchmarks. The lack of resources hinders efforts to evaluate and advance the multilingual capabilities of Large Language Models (LLMs). In this paper, we introduce MedAraBench, a large-scale dataset consisting of Arabic multiple-choice question-answer pairs across various medical specialties. We constructed the dataset by manually digitizing a large repository of academic materials created by medical professionals in the Arabic-speaking region.

论文默认配图 - 医学影像计算

论文详情

英文标题
MedAraBench: Large-scale Arabic Medical Question Answering Dataset and Benchmark
作者
Mouath Abu Daoud, Leen Kharouf, Omar El Hajj, Dana El Samad, Mariam Al-Omari, Jihad Mallat, Khaled Saleh, Nizar Habash, Farah E. Shamout
期刊/会议
ICLR 2026 Poster
发表年份
2026 年
研究方向
clinical prediction