AI4Meder

AI4Meder 站内搜索

搜索医学 AI 论文与资源

按论文、数据资源、技术竞赛、投稿截止日期和课程资源检索社区内容,快速进入对应详情页。

24 条结果

输入关键词或点击标签,按论文、数据资源、竞赛截止日期、征稿与课程缩小范围。 标签:deployment

清空筛选
论文ICLR 2026 Poster2026 年clinical prediction

MedAraBench:大规模阿拉伯语医学问答数据集与基准

ICLR 2026 Poster accepted paper at ICLR 2026. Arabic remains one of the most underrepresented languages in natural language processing research, particularly in medical applications, due to the limited availability of open-source data and benchmarks. The lack of resources hinders efforts to evaluate and advance the multilingual capabilities of Large Language Models (LLMs). In this paper, we introduce MedAraBench, a large-scale dataset consisting of Arabic multiple-choice question-answer pairs across various medical specialties. We constructed the dataset by manually digitizing a large repository of academic materials created by medical professionals in the Arabic-speaking region.

论文ICLR 2026 Poster2026 年trustworthy medical AI

随机锚点与低秩去相关学习:类增量医学图像分类的极简流程

ICLR 2026 Poster accepted paper at ICLR 2026. Class-incremental learning (CIL) in medical image-guided diagnosis requires models to preserve knowledge of historical disease classes while adapting to emerging categories. Pre-trained models (PTMs) with well-generalized features provide a strong foundation, yet most PTM-based CIL strategies, such as prompt tuning, task-specific adapters and model mixtures, rely on increasingly complex designs. While effective in general-domain benchmarks, these methods falter in medical imaging, where low intra-class variability and high inter-domain shifts (from scanners, protocols and institutions) make CIL particularly prone to representation collapse and domain misalignment. Under such conditions, we find that lightweight representation calibration strategies, often dismissed in general-domain CIL for their modest gains, can be remarkably effective for adapting PTMs in medical settings.

论文ICLR 2026 Poster2026 年trustworthy medical AI

从对话到查询执行:EHR 数据库 Agent 的用户与工具交互基准

ICLR 2026 Poster accepted paper at ICLR 2026. Despite the impressive performance of LLM-powered agents, their adoption for Electronic Health Record (EHR) data access remains limited by the absence of benchmarks that adequately capture real-world clinical data access flows. In practice, two core challenges hinder deployment: query ambiguity from vague user questions and value mismatch between user terminology and database entries. To address this, we introduce EHR-ChatQA, an interactive database question answering benchmark that evaluates the end-to-end workflow of database agents: clarifying user questions, using tools to resolve value mismatches, and generating correct SQL to deliver accurate answers. To cover diverse patterns of query ambiguity and value mismatch, EHR-ChatQA assesses agents in a simulated environment with an LLM-based user across two interaction flows: Incremental Query Refinement (IncreQA), where users add constraints to existing queries, and Adaptive Query Refinement (AdaptQA), where users adjust their search goals mid-conversation. Code/project link: https://github.com/glee4810/EHR-ChatQA

论文ICLR 2026 Poster2026 年clinical prediction

重用基础模型实现可泛化医学时间序列分类

ICLR 2026 Poster accepted paper at ICLR 2026. Medical time series (MedTS) classification suffers from poor generalizability in real-world deployment due to inter- and intra-dataset heterogeneity, such as varying numbers of channels, signal lengths, task definitions, and patient characteristics. % implicit patient characteristics, variable channel configurations, time series lengths, and diagnostic tasks. To address this, we propose FORMED, a novel framework for repurposing a backbone foundation model, pre-trained on generic time series, to enable highly generalizable MedTS classification on unseen datasets. FORMED combines the backbone with a novel classifier comprising two components: (1) task-specific channel embeddings and label queries, dynamically sized to match any number of channels and target classes, and (2) a shared decoding attention layer, jointly trained across datasets to capture medical domain knowledge through task-agnostic feature-query interactions.

论文ICLR 2026 Poster2026 年trustworthy medical AI

MedVR:通过 Agent 强化学习实现无标注医学视觉推理

ICLR 2026 Poster accepted paper at ICLR 2026. Medical Vision-Language Models (VLMs) hold immense promise for complex clinical tasks, but their reasoning capabilities are often constrained by text-only paradigms that fail to ground inferences in visual evidence. This limitation not only curtails performance on tasks requiring fine-grained visual analysis but also introduces risks of visual hallucination in safety-critical applications. Thus, we introduce MedVR, a novel reinforcement learning framework that enables annotation-free visual reasoning for medical VLMs. Its core innovation lies in two synergistic mechanisms: Entropy-guided Visual Regrounding (EVR) uses model uncertainty to direct exploration, while Consensus-based Credit Assignment (CCA) distills pseudo-supervision from rollout agreement.

论文ICLR 2026 Poster2026 年trustworthy medical AI

多中心队列中有创机械通气需求预测的自适应测试时训练

ICLR 2026 Poster accepted paper at ICLR 2026. Accurate prediction of the need for invasive mechanical ventilation (IMV) in intensive care units (ICUs) patients is crucial for timely interventions and resource allocation. However, variability in patient populations, clinical practices, and electronic health record (EHR) systems across institutions introduces domain shifts that degrade the generalization performance of predictive models during deployment. Test-Time Training (TTT) has emerged as a promising approach to mitigate such shifts by adapting models dynamically during inference without requiring labeled target-domain data. In this work, we introduce Adaptive Test-Time Training (AdaTTT), an enhanced TTT framework tailored for EHR-based IMV prediction in ICU settings.

论文ICLR 2026 Poster2026 年trustworthy medical AI

NurValues:临床情境中大语言模型的真实护理价值观评测

ICLR 2026 Poster accepted paper at ICLR 2026. While LLMs have demonstrated medical knowledge and conversational ability, their deployment in clinical practice raises new risks: patients may place greater trust in LLM-generated responses than in nurses' professional judgments, potentially intensifying nurse–patient conflicts. Such risks highlight the urgent need of evaluating whether LLMs align with the core nursing values upheld by human nurses. This work introduces the first benchmark for nursing value alignment, consisting of five core value dimensions distilled from international nursing codes: _Altruism_, _Human Dignity_, _Integrity_, _Justice_, and _Professionalism_. We define two-level tasks on the benchmark, considering the two characteristics of emerging nurse–patient conflicts.

论文ICLR 2026 Poster2026 年trustworthy medical AI

能否用 LLM 为临床时间序列数据生成可迁移表征?

ICLR 2026 Poster accepted paper at ICLR 2026. Deploying clinical ML is slow and brittle: models that work at one hospital often degrade under distribution shifts at the next. In this work, we study a simple question -- can large language models (LLMs) create portable patient embeddings i.e. representations of patients enable a downstream predictor built on one hospital to be used elsewhere with minimal-to-no retraining and fine-tuning. To do so, we map from irregular ICU time series onto concise natural language summaries using a frozen LLM, then embed each summary with a frozen text embedding model to obtain a fixed length vector capable of serving as input to a variety of downstream predictors.

论文npj Digital Medicine2025 年医学影像计算

基于智能手机视频深度学习准确评估帕金森病步态障碍

步态障碍是帕金森病(PD)中最常见且最具致残性的症状之一,其表现复杂且高度异质。在此,我们提出了一种基于深度学习的框架,利用智能手机录制的视频评估步态障碍。该框架在预测 PD 严重程度方面表现出色,微平均受试者工作特征曲线下面积(AUC)为 0.87,F1 分数为 0.806,与三位临床专家的平均表现相当。此外,它以 73.68%的精度有效区分了药物对步态障碍的整体疗效。特别是,它能够区分统一帕金森病评分量表(UPDRS)分辨率之外的药物诱导的细微粒度步态变化。此外,我们的可解释框架能够提取传统临床使用的运动指标,并发现对疾病进展和药物反应敏感的新数字生物标志物。 这些发现强调了其在临床和家庭环境中高效评估疾病进展的巨大潜力,以及在临床试验中评估疾病修饰效果的潜力,以促进个性化治疗。

征稿与合作npj Digital Medicine截止 北京时间 2026-06-03期刊专刊

npj Digital Medicine 专辑:AI 真实世界临床表现评估

This Nature Portfolio / npj Digital Medicine collection is open for submissions until 2026-06-03. It invites research on real-world clinical performance of AI, including clinical utility, safety, reliability, equity, generalizability, workflow integration, post-deployment monitoring, transparency, patient-centered outcomes, and clinician-AI interaction.

征稿与合作npj Digital Medicine截止 北京时间 2026-07-12期刊专刊

npj Digital Medicine 专辑:Agentic AI 对照护交付的影响

This Nature Portfolio / npj Digital Medicine collection is open for submissions until 2026-07-12. It calls for work on agentic AI in care delivery, including real-time evidence-based decision support, virtual and remote patient care, multimodal and longitudinal clinical data, EHRs, medical imaging, genomics, resource-limited deployment, ethics, regulation, quality, and patient safety.

征稿与合作npj Cardiovascular Health截止 北京时间 2026-06-12期刊专刊

npj Cardiovascular Health 专辑:使用 AI 与 ML 的临床照护和以患者为中心的互动

This Nature Portfolio / npj Cardiovascular Health collection is open for submissions until 2026-06-12. It calls for AI and machine learning work in cardiovascular care, including diagnostic evaluation, CVD risk prediction, cardiac imaging and ECG deep learning, EHR tooling, multi-omics, wearables, remote monitoring, deployment, transparency, ethics, and regulation.

征稿与合作Applied Sciences截止 北京时间 2026-08-30期刊专刊

MDPI Applied Sciences 专刊:AI 驱动医疗

This Applied Sciences special issue calls for AI-driven healthcare work, including explainable and trustworthy AI, EHR and clinical text, bias and fairness, public health and epidemiology, translational AI, real-world clinical deployment, benchmarks, and reproducibility. The page lists a manuscript submission deadline of 2026-08-30.

征稿与合作INFORMS Journal on Data Science截止 北京时间 2026-09-15期刊专刊

INFORMS Journal on Data Science 专刊:医疗人工智能与数据科学

The INFORMS Journal on Data Science CFP focuses on artificial intelligence and data science for healthcare. It is relevant to medical AI work on clinical prediction, decision support, operations, fairness, reliability, and deployment of data-driven healthcare systems. The CFP page lists a 2026 special issue call and submission timeline.