AI4Meder
返回论文列表
论文ICLR 2026 Poster2026 年trustworthy medical AI

LiveClin:无泄漏的实时临床基准

ICLR 2026 Poster accepted paper at ICLR 2026. The reliability of medical LLM evaluation is critically undermined by data contamination and knowledge obsolescence, leading to inflated scores on static benchmarks. To address these challenges, we introduce LiveClin, a live benchmark designed for the approximating real-world clinical practice. Built from contemporary, peer-reviewed case reports and updated biannually, LiveClin ensures clinical currency and resists data contamination. Using a verified AI–human workflow involving 239 physicians, we transform authentic patient cases into complex, multimodal evaluation scenarios that span the entire clinical pathway. Code/project link: https://github.com/AQ-MedAI/LiveClin

论文默认配图 - 医学影像计算

论文详情

英文标题
LiveClin: A Live Clinical Benchmark without Leakage
作者
Xidong Wang, Guo shuqi, Yue Shen, Junying Chen, Jian Wang, Jinjie Gu, Ping Zhang, Lei Liu, Benyou Wang
期刊/会议
ICLR 2026 Poster
发表年份
2026 年
研究方向
trustworthy medical AI