站内搜索 - AI4Meder

论文ICLR 2026 Poster2026 年trustworthy medical AI

LiveClin：无泄漏的实时临床基准

ICLR 2026 Poster accepted paper at ICLR 2026. The reliability of medical LLM evaluation is critically undermined by data contamination and knowledge obsolescence, leading to inflated scores on static benchmarks. To address these challenges, we introduce LiveClin, a live benchmark designed for the approximating real-world clinical practice. Built from contemporary, peer-reviewed case reports and updated biannually, LiveClin ensures clinical currency and resists data contamination. Using a verified AI–human workflow involving 239 physicians, we transform authentic patient cases into complex, multimodal evaluation scenarios that span the entire clinical pathway. Code/project link: https://github.com/AQ-MedAI/LiveClin

医学影像计算医疗多模态临床语言智能论文 MultiModal Medical Benchmark ICLR 2026 查看论文详情

搜索医学 AI 论文与资源

1 条结果

LiveClin：无泄漏的实时临床基准