论文详情
- 英文标题
- Missingness Bias Calibration in Feature Attribution Explanations
- 作者
- Shailesh Sridhar, Anton Xue, Eric Wong
- 期刊/会议
- ICLR 2026 Poster
- 发表年份
- 2026 年
- 研究方向
- trustworthy medical AI
ICLR 2026 Poster accepted paper at ICLR 2026. Popular explanation methods often produce unreliable feature importance scores due to missingness bias, a systematic distortion that arises when models are probed with ablated, out-of-distribution inputs. Existing solutions treat this as a deep representational flaw that requires expensive retraining or architectural modifications. In this work, we challenge this assumption and show that missingness bias can be effectively treated as a superficial artifact of the model's output space. We introduce MCal, a lightweight post-hoc method that corrects this bias by fine-tuning a simple linear head on the outputs of a frozen base model.
