Skills 个人中心我的提交资源提交资源

AI4Meder 站内搜索

搜索医学 AI 论文与资源

按论文、数据资源、技术竞赛、投稿截止日期和课程资源检索社区内容，快速进入对应详情页。

全部论文数据资源技术竞赛征稿与合作课程资源

12 条结果

输入关键词或点击标签，按论文、数据资源、竞赛截止日期、征稿与课程缩小范围。标签：Robustness 范围：论文

论文ICLR 2026 Poster2026 年医学影像

CARL：面向光谱图像分析的相机无关表征学习

ICLR 2026 Poster accepted paper at ICLR 2026. Spectral imaging offers promising applications across diverse domains, including medicine and urban scene understanding, and is already established as a critical modality in remote sensing. However, variability in channel dimensionality and captured wavelengths among spectral cameras impede the development of AI-driven methodologies, leading to camera-specific models with limited generalizability and inadequate cross-camera applicability. To address this bottleneck, we introduce CARL, a model for Camera-Agnostic Representation Learning across RGB, multispectral, and hyperspectral imaging modalities. To enable the conversion of a spectral image with any channel dimensionality to a camera-agnostic representation, we introduce a novel spectral encoder, featuring a self-attention-cross-attention mechanism, to distill salient spectral information into learned spectral representations. Code/project link: https://github.com/IMSY-DKFZ/CARL

医学影像计算论文 Representation Learning Self-Supervised Learning Spectral Imaging ICLR 2026 查看论文详情

论文ICLR 2026 Poster2026 年trustworthy medical AI

面向葡萄糖预测的混合神经 ODE 自动结构感知稀疏化

ICLR 2026 Poster accepted paper at ICLR 2026. Hybrid neural ordinary differential equations (neural ODEs) integrate mechanistic models with neural ODEs, offering strong inductive bias and flexibility, and are particularly advantageous in data-scarce healthcare settings. However, excessive latent states and interactions from mechanistic models can lead to training inefficiency and over-fitting, limiting practical effectiveness of hybrid neural ODEs. In response, we propose a new hybrid pipeline for automatic state selection and structure optimization in mechanistic neural ODEs, combining domain-informed graph modifications with data-driven regularization to sparsify the model for improving predictive performance and stability while retaining mechanistic plausibility. Experiments on synthetic and real-world data show improved predictive performance and robustness with desired sparsity, establishing an effective solution for hybrid model reduction in healthcare applications.

EHR 与临床预测可信、安全、公平与隐私论文 Predictive Sparsity Hybrid Neural ODE Group LASSO 查看论文详情

论文ICLR 2026 Poster2026 年trustworthy medical AI

序贯信息瓶颈融合：迈向鲁棒且可泛化的多模态脑肿瘤分割

ICLR 2026 Poster accepted paper at ICLR 2026. Brain tumor segmentation in multi-modal MRIs poses significant challenges when one or more modalities are missing. Recent approaches commonly employ parallel fusion strategies; however, these methods often risk losing crucial shared information across modalities, which can degrade segmentation performance. In this paper, we advocate leveraging sequential information bottleneck fusion to effectively preserve shared information across modalities. From an information-theoretic perspective, sequential fusion not only produces more robust fused representations in missing-data scenarios but also achieves a tighter generalization upper bound compared to parallel fusion approaches.

医学影像计算医疗多模态可信、安全、公平与隐私论文 Brain Tumor Segmentation Missing Modality 查看论文详情

论文ICLR 2026 Poster2026 年trustworthy medical AI

ODEBrain：用于动态脑网络建模的连续时间 EEG 图

ICLR 2026 Poster accepted paper at ICLR 2026. Modeling neural population dynamics is crucial for foundational neuroscientific research and various clinical applications. Conventional latent variable methods typically model continuous brain dynamics through discretizing time with recurrent architecture, which necessarily results in compounded cumulative prediction errors and failure of capturing instantaneous, nonlinear characteristics of EEGs. We propose ODEBrain, a Neural ODE latent dynamic forecasting framework to overcome these challenges by integrating spatio-temporal-frequency features into spectral graph nodes, followed by a Neural ODE modeling the continuous latent dynamics. Our design ensures that the latent representations can capture stochastic variations of complex brain states at any given time point.

医学影像计算 EHR 与临床预测可信、安全、公平与隐私论文 EEG ODE 查看论文详情

论文ICLR 2026 Poster2026 年trustworthy medical AI

Johnson-Lindenstrauss 引理引导的高效 3D 医学分割网络

ICLR 2026 Poster accepted paper at ICLR 2026. Lightweight 3D medical image segmentation remains constrained by a fundamental "efficiency / robustness conflict", particularly when processing complex anatomical structures and heterogeneous modalities. In this paper, we study how to redesign the framework based on the characteristics of high-dimensional 3D images, and explore data synergy to overcome the fragile representation of lightweight methods. Our approach, VeloxSeg, begins with a deployable and extensible dual-stream CNN-Transformer architecture composed of Paired Window Attention (PWA) and Johnson-Lindenstrauss lemma-guided convolution (JLC). For each 3D image, we invoke a "glance-and-focus" principle, where PWA rapidly retrieves multi-scale information, and JLC ensures robust local feature extraction with minimal parameters, significantly enhancing the model's ability to operate with low computational budget. Code/project link: https://github.com/JinPLu/VeloxSeg

医学影像计算医疗多模态 EHR 与临床预测论文 Efficient Medical segmentation multimodal learning 查看论文详情

论文ICLR 2026 Poster2026 年trustworthy medical AI

PathChat-SegR1：通过 SO-GRPO 实现病理推理分割

ICLR 2026 Poster accepted paper at ICLR 2026. Segmentation in pathology image requires handling out-of-domain tissue morphologies and new pathologies beyond training distributions, where traditional closed-set segmentation approaches fail to generalize. Reasoning segmentation enables zero-shot generalization via prompting with text queries. However, existing reasoning segmentation models face three barriers when applied to pathology: (1) the vision encoder lack pathology-specific knowledge and robustness to staining variations, (2) the large language model (LLM) backbone for reasoning fails to identify whether it has gathered sufficient semantic context to trigger the segmentation output, and (3) no reasoning segmentation benchmarks and datasets exist for pathology analysis. Consequently, we introduce PathChat-SegR1, a reasoning segmentation model built upon pathology-specific vision encoders trained with a novel stain-invariant self-distillation for robust pathology image representations.

医学影像计算临床语言智能论文 Clinical Reasoning Reinforcement Learning Reasoning Segmentation 查看论文详情

论文ICLR 2026 Poster2026 年trustworthy medical AI

Nef-Net v2：野外场景下适配 Electrocardio Panorama

ICLR 2026 Poster accepted paper at ICLR 2026. Conventional multi-lead electrocardiogram (ECG) systems capture cardiac signals from a fixed set of anatomical viewpoints defined by lead placement. However, cer- tain cardiac conditions (e.g., Brugada syndrome) require additional, non-standard viewpoints to reveal diagnostically critical patterns that may be absent in standard leads. To systematically overcome this limitation, Nef-Net was recently introduced to reconstruct a continuous electrocardiac field, enabling virtual observation of ECG signals from arbitrary views (termed Electrocardio Panorama). Despite its promise, Nef-Net operates under idealized assumptions and faces in-the-wild challenges, such as long-duration ECG modeling, robustness to device-specific signal artifacts, and suboptimal lead placement calibration. Code/project link: https://github.com/HKUSTGZ-ML4Health-Lab/NEFNET-v2

医学影像计算 EHR 与临床预测可信、安全、公平与隐私论文 ECG representation Cardiac Diagnosis 查看论文详情

论文ICLR 2026 Poster2026 年trustworthy medical AI

Cross-Timestep：用于医学分割的跨时序记忆 LSTM 与自适应先验解码 3D 扩散模型

ICLR 2026 Poster accepted paper at ICLR 2026. Diffusion models have recently demonstrated significant robustness in medical image segmentation, effectively accommodating variations across different imaging styles. However, their applications remain limited due to: (i) current successes being primarily confined to 2D segmentation tasks—we observe that diffusion models tend to collapse at the early stage when applied to 3D medical tasks; and (ii) the inherently isolated iteration along timesteps during training and inference. To tackle these limitations, we propose a novel framework named Cross-Timestep, which incorporates two key innovations: an Adaptive Priori Decoding Strategy (APDS) and a trans-temporal memory LSTM (tLSTM) mechanism. (i) The APDS provides prior guidance during the diffusion process by employing a Priori Decoder(PD) that focuses solely on the conditional branch, successfully stabilizing the reverse diffusion process.

医学影像计算可信、安全、公平与隐私论文 Diffusion Models Medical Image Segmentation LSTM 查看论文详情

论文ICLR 2026 Poster2026 年trustworthy medical AI

Resp-Agent：面向多模态呼吸音生成与疾病诊断的 Agent 系统

ICLR 2026 Poster accepted paper at ICLR 2026. Deep learning-based respiratory auscultation is currently hindered by two fundamental challenges: (i) inherent information loss, as converting signals into spectrograms discards transient acoustic events and clinical context; (ii) limited data availability, exacerbated by severe class imbalance. To bridge these gaps, we present **_Resp-Agent_**, an autonomous multimodal system orchestrated by a novel Active Adversarial Curriculum Agent (Thinker-A²CA). Unlike static pipelines, Thinker-A²CA serves as a central controller that actively identifies diagnostic weaknesses and schedules targeted synthesis in a closed loop. To address the representation gap, we introduce a modality-weaving Diagnoser that weaves clinical text with audio tokens via strategic global attention and sparse audio anchors, capturing both long-range clinical context and millisecond-level transients. Code/project link: https://github.com/zpforlove/Resp-Agent

医学影像计算医疗多模态临床语言智能论文 Respiratory sounds Multimodal learning 查看论文详情

论文ICLR 2026 Poster2026 年trustworthy medical AI

MedVR：通过 Agent 强化学习实现无标注医学视觉推理

ICLR 2026 Poster accepted paper at ICLR 2026. Medical Vision-Language Models (VLMs) hold immense promise for complex clinical tasks, but their reasoning capabilities are often constrained by text-only paradigms that fail to ground inferences in visual evidence. This limitation not only curtails performance on tasks requiring fine-grained visual analysis but also introduces risks of visual hallucination in safety-critical applications. Thus, we introduce MedVR, a novel reinforcement learning framework that enables annotation-free visual reasoning for medical VLMs. Its core innovation lies in two synergistic mechanisms: Entropy-guided Visual Regrounding (EVR) uses model uncertainty to direct exploration, while Consensus-based Credit Assignment (CCA) distills pseudo-supervision from rollout agreement.

医学影像计算医疗多模态临床语言智能论文 Think with images Medical visual reasoning 查看论文详情

论文ICLR 2026 Poster2026 年trustworthy medical AI

多中心队列中有创机械通气需求预测的自适应测试时训练

ICLR 2026 Poster accepted paper at ICLR 2026. Accurate prediction of the need for invasive mechanical ventilation (IMV) in intensive care units (ICUs) patients is crucial for timely interventions and resource allocation. However, variability in patient populations, clinical practices, and electronic health record (EHR) systems across institutions introduces domain shifts that degrade the generalization performance of predictive models during deployment. Test-Time Training (TTT) has emerged as a promising approach to mitigate such shifts by adapting models dynamically during inference without requiring labeled target-domain data. In this work, we introduce Adaptive Test-Time Training (AdaTTT), an enhanced TTT framework tailored for EHR-based IMV prediction in ICU settings.

医学影像计算临床语言智能 EHR 与临床预测论文 Test-Time Training Domain Adaptation 查看论文详情

论文ICLR 2026 Poster2026 年trustworthy medical AI

MedAgent-Pro：通过推理型 Agent 工作流迈向证据型多模态医学诊断

ICLR 2026 Poster accepted paper at ICLR 2026. Modern clinical diagnosis relies on the comprehensive analysis of multi-modal patient data, drawing on medical expertise to ensure systematic and rigorous reasoning. Recent advances in Vision–Language Models (VLMs) and agent-based methods are reshaping medical diagnosis by effectively integrating multi-modal information. However, they often output direct answers and empirical-driven conclusions without clinical evidence supported by quantitative analysis, which compromises their reliability and hinders clinical usability. Here we propose MedAgent-Pro, an agentic reasoning paradigm that mirrors modern diagnosis principles via a hierarchical diagnostic workflow, consisting of disease-level standardized plan generation and patient-level personalized step-by-step reasoning.

医学影像计算医疗多模态临床语言智能论文 Medical AI Agentic AI 查看论文详情