站内搜索 - AI4Meder

论文ICLR 2026 Poster2026 年clinical prediction

Pixel-Level Residual Diffusion Transformer：可扩展 3D CT 体数据生成

ICLR 2026 Poster accepted paper at ICLR 2026. Generating high-resolution 3D CT volumes with fine details remains challenging due to substantial computational demands and optimization difficulties inherent to existing generative models. In this paper, we propose the Pixel-Level Residual Diffusion Transformer (PRDiT), a scalable generative framework that synthesizes high-quality 3D medical volumes directly at voxel-level. PRDiT introduces a two-stage training architecture comprising 1) a local denoiser in the form of an MLP-based blind estimator operating on overlapping 3D patches to separate low-frequency structures efficiently, and 2) a global residual diffusion transformer employing memory-efficient attention to model and refine high-frequency residuals across entire volumes. This coarse-to-fine modeling strategy simplifies optimization, enhances training stability, and effectively preserves subtle structures without the limitations of an autoencoder bottleneck.

医学影像计算 EHR 与临床预测论文 Medical Imaging 3D Diffusion Model Diffusion Transformer 查看论文详情

搜索医学 AI 论文与资源

1 条结果

Pixel-Level Residual Diffusion Transformer：可扩展 3D CT 体数据生成