AI4Meder 站内搜索

搜索医学 AI 论文与资源

按论文、数据资源、技术竞赛、投稿截止日期和课程资源检索社区内容，快速进入对应详情页。

9 条结果

输入关键词或点击标签，按论文、数据资源、竞赛截止日期、征稿与课程缩小范围。标签：数据集范围：论文

论文ICLR 2026 Poster2026 年clinical prediction

MRI 运动校正的可靠评测：数据集与洞见

ICLR 2026 Poster accepted paper at ICLR 2026. Correcting motion artifacts in scientific and medical imaging is important, as they significantly impact image quality. However, evaluating deep learning-based and classical motion correction methods remains fundamentally difficult due to the lack of accessible ground-truth target data. To address this challenge, we study three evaluation approaches: real-world evaluation based on reference scans, simulated motion, and reference-free evaluation, each with its merits and shortcomings. To enable evaluation with real-world motion artifacts, we release PMoC3D, a dataset consisting of unprocessed $\textbf{P}$aired $\textbf{Mo}$tion-$\textbf{C}$orrupted $\textbf{3D}$ brain MRI data.

医学影像计算 EHR 与临床预测论文 3D MRI motion correction Accelerated MRI Dataset 查看论文详情

论文ICLR 2026 Poster2026 年clinical prediction

MedAraBench：大规模阿拉伯语医学问答数据集与基准

ICLR 2026 Poster accepted paper at ICLR 2026. Arabic remains one of the most underrepresented languages in natural language processing research, particularly in medical applications, due to the limited availability of open-source data and benchmarks. The lack of resources hinders efforts to evaluate and advance the multilingual capabilities of Large Language Models (LLMs). In this paper, we introduce MedAraBench, a large-scale dataset consisting of Arabic multiple-choice question-answer pairs across various medical specialties. We constructed the dataset by manually digitizing a large repository of academic materials created by medical professionals in the Arabic-speaking region.

医学影像计算临床语言智能 EHR 与临床预测论文 Dataset Benchmark Large Language Models 查看论文详情

论文ICLR 2026 Poster2026 年medical LLM agent

AnesSuite：面向 LLM 麻醉学推理的综合基准与数据集套件

ICLR 2026 Poster accepted paper at ICLR 2026. The application of large language models (LLMs) in the medical field has garnered significant attention, yet their reasoning capabilities in more specialized domains like anesthesiology remain underexplored. To bridge this gap, we introduce AnesSuite, the first comprehensive dataset suite specifically designed for anesthesiology reasoning in LLMs. The suite features AnesBench, an evaluation benchmark tailored to assess anesthesiology-related reasoning across three levels: factual retrieval (System 1), hybrid reasoning (System 1.x), and complex decision-making (System 2). Alongside this benchmark, the suite includes three training datasets that provide an infrastructure for continued pre-training (CPT), supervised fine-tuning (SFT), and reinforcement learning with verifiable rewards (RLVR). Code/project link: https://github.com/MiliLab/AnesSuite

医学影像计算临床语言智能论文 Large language model Reasoning Anesthesiology 查看论文详情

论文ICLR 2026 Poster2026 年clinical NLP

用于胸部 X 光图像的结构化、标注式、定位化 VQA 数据集：含完整句答案与场景图

ICLR 2026 Poster accepted paper at ICLR 2026. Visual Question Answering (VQA) enables targeted and context-dependent analysis of medical images, such as chest X-rays (CXRs). However, existing VQA datasets for CXRs are typically constrained by simplistic and brief answer formats, lacking localization annotations (e.g., bounding boxes) and structured tags (e.g., region or radiological finding/disease tags). To address these limitations, we introduce MIMIC-Ext-CXR-QBA (abbr. CXR-QBA), a large-scale CXR VQA dataset derived from MIMIC-CXR, comprising 42 million QA-pairs with multi-granular, multi-part answers, detailed bounding boxes, and structured tags. Code/project link: https://github.com/philip-mueller/mimic-ext-cxr-qba/

医学影像计算医疗多模态临床语言智能论文 VQA Localization 查看论文详情

论文ICLR 2026 Poster2026 年trustworthy medical AI

超越医学考试：面向心理健康真实任务与模糊性的临床医生标注公平性数据集

ICLR 2026 Poster accepted paper at ICLR 2026. Current medical language model (LM) benchmarks often over-simplify the complexities of day-to-day clinical practice tasks and instead rely on evaluating LMs on multiple-choice board exam questions. In psychiatry especially, these challenges are worsened by fairness and bias issues, since models can be swayed by patient demographics even when those factors should not influence clinical decisions. Thus, we present an expert-created and annotated dataset spanning five critical domains of decision-making in mental healthcare: treatment, diagnosis, documentation, monitoring, and triage. This U.S. centric dataset — created without any LM assistance — is designed to capture the nuanced clinical reasoning and daily ambiguities mental health practitioners encounter, reflecting the inherent complexities of care delivery that are missing from existing datasets.

医学影像计算临床语言智能可信、安全、公平与隐私论文 AI for Healthcare mental health 查看论文详情

论文ICLR 2026 Poster2026 年clinical prediction

从病历到诊断对话：面向精神共病的临床扎根方法与数据集

ICLR 2026 Poster accepted paper at ICLR 2026. Psychiatric comorbidity is clinically significant yet challenging due to the complexity of multiple co-occurring disorders. To address this, we develop a novel approach integrating synthetic patient electronic medical record (EMR) construction and multi-agent diagnostic dialogue generation. We create 502 synthetic EMRs for common comorbid conditions using a pipeline that ensures clinical relevance and diversity. Our multi-agent framework transfers the clinical interview protocol into a hierarchical state machine and context tree, supporting over 130 diagnostic states while maintaining clinical standards.

医学影像计算临床语言智能 EHR 与临床预测论文 Psychiatric Comorbidity Diagnostic Dialogue 查看论文详情

论文ICLR 2026 Poster2026 年EHR 与临床预测

重用基础模型实现可泛化医学时间序列分类

FORMED 将通用时间序列基础模型重用于医学时间序列分类，并通过任务相关通道嵌入、标签查询和共享解码注意力层，在不同医学时间序列数据集上进行轻量适配。

EHR 与临床预测医疗 AI 论文会议论文查看论文详情

论文ICLR 2026 Poster2026 年医疗多模态

医学 MLLM 如何失效？医学图像视觉定位研究

系统研究医学 MLLM 在医学图像视觉定位中的失效模式，提出 VGMED 评估数据集与 VGRefine 推理时方法，面向医学视觉问答与医学图像解释场景。

医疗多模态医疗 AI 论文会议论文查看论文详情

论文ICLR 2026 Poster2026 年可信、安全、公平与隐私

超越医学考试：面向心理健康真实任务与模糊性的临床医生标注公平性数据集

ICLR 2026 Poster 论文提出 MENTAT：一个由临床专家创建和标注、面向心理健康真实任务与模糊性的公平性评测数据集，用于评估语言模型在临床决策任务中的表现与偏差。

可信、安全、公平与隐私医疗 AI 论文会议论文查看论文详情