论文ICLR 2026 Poster2026 年trustworthy medical AI

特征归因解释中的缺失偏倚校准

ICLR 2026 Poster accepted paper at ICLR 2026. Popular explanation methods often produce unreliable feature importance scores due to missingness bias, a systematic distortion that arises when models are probed with ablated, out-of-distribution inputs. Existing solutions treat this as a deep representational flaw that requires expensive retraining or architectural modifications. In this work, we challenge this assumption and show that missingness bias can be effectively treated as a superficial artifact of the model's output space. We introduce MCal, a lightweight post-hoc method that corrects this bias by fine-tuning a simple linear head on the outputs of a frozen base model.

医学影像计算临床语言智能可信、安全、公平与隐私论文 explainability feature attribution calibration missingness bias medical medicine

论文详情

英文标题: Missingness Bias Calibration in Feature Attribution Explanations
作者: Shailesh Sridhar, Anton Xue, Eric Wong
期刊/会议: ICLR 2026 Poster
发表年份: 2026 年
研究方向: trustworthy medical AI