论文ICLR 2026 Poster2026 年clinical prediction

视频理解中的人脑：动态专家混合模型

ICLR 2026 Poster accepted paper at ICLR 2026. The human brain is the most efficient and versatile system for processing dynamic visual input. By comparing representations from deep video models to brain activity, we can gain insights into mechanistic solutions for effective video processing, important to better understand the brain and to build better models. Current works in model-brain alignment primarily focus on fMRI measurements, leaving open questions about fine-grained dynamic processing. Here, we introduce the first large-scale model benchmarking on alignment to dynamic electroencephalography (EEG) recordings of short natural videos. We analyze 100+ models across the axes of temporal integration, classification task, architecture, and pretraining, using our proposed Cross-Temporal Representational Similarity Analysis (CT-RSA) which matches the best time-unfolded model features to dynamically evolving brain responses, distilling $10^7$ alignment scores.

医学影像计算 EHR 与临床预测论文 representational alignment Representational Similarity Analysis RSA benchmarking neuro-AI video AI neuroscience EEG ICLR 2026

论文详情

英文标题: The Human Brain as a Dynamic Mixture of Expert Models in Video Understanding
作者: Christina Sartzetaki, Anne W. Zonneveld, Pablo Oyarzo, Alessandro Thomas Gifford, Radoslaw Martin Cichy, Pascal Mettes, Iris Groen
期刊/会议: ICLR 2026 Poster
发表年份: 2026 年
研究方向: clinical prediction