AI4Meder
返回论文列表
论文ICLR 2026 Poster2026 年clinical prediction

视频理解中的人脑:动态专家混合模型

ICLR 2026 Poster accepted paper at ICLR 2026. The human brain is the most efficient and versatile system for processing dynamic visual input. By comparing representations from deep video models to brain activity, we can gain insights into mechanistic solutions for effective video processing, important to better understand the brain and to build better models. Current works in model-brain alignment primarily focus on fMRI measurements, leaving open questions about fine-grained dynamic processing. Here, we introduce the first large-scale model benchmarking on alignment to dynamic electroencephalography (EEG) recordings of short natural videos. We analyze 100+ models across the axes of temporal integration, classification task, architecture, and pretraining, using our proposed Cross-Temporal Representational Similarity Analysis (CT-RSA) which matches the best time-unfolded model features to dynamically evolving brain responses, distilling $10^7$ alignment scores.

论文默认配图 - 医学影像计算

论文详情

英文标题
The Human Brain as a Dynamic Mixture of Expert Models in Video Understanding
作者
Christina Sartzetaki, Anne W. Zonneveld, Pablo Oyarzo, Alessandro Thomas Gifford, Radoslaw Martin Cichy, Pascal Mettes, Iris Groen
期刊/会议
ICLR 2026 Poster
发表年份
2026 年
研究方向
clinical prediction