数据资源abdominal CT and MRI with multi-organ annotationsabdominal multi-organ segmentation benchmarkAMOS 2022 challenge benchmark; see official Grand Challenge page申请访问 AMOS 腹部多器官分割基准
AMOS is an abdominal multi-organ segmentation benchmark with CT and MRI cases for evaluating versatile medical image segmentation models. It supports abdominal organ segmentation, modality-general segmentation, and benchmarking of robust 3D segmentation methods.
数据资源retinal fundus photographs with glaucoma and structure annotationsophthalmology fundus image challenge datasetREFUGE challenge dataset; official splits described on Grand Challenge申请访问 REFUGE 视网膜眼底青光眼挑战数据集
REFUGE is a retinal fundus imaging challenge dataset for glaucoma assessment. It supports glaucoma classification, optic disc and cup segmentation, fovea localization, and fair comparison of ophthalmology AI methods on color fundus photographs.
数据资源chest radiographs with pneumonia/lung opacity annotationschest X-ray pneumonia detection challenge datasetRSNA 2018 AI image challenge dataset开放访问 RSNA 肺炎检测挑战数据集
The RSNA Pneumonia Detection Challenge dataset is a chest radiograph benchmark for detecting pneumonia-related lung opacities. It supports object detection, chest X-ray classification, localization, and radiology AI evaluation under a competition framework.
数据资源upper extremity radiographs with abnormality labelsmusculoskeletal X-ray datasetLarge Stanford musculoskeletal radiograph dataset申请访问 MURA 肌骨 X 光数据集
MURA is a musculoskeletal radiograph dataset from Stanford for abnormality detection in upper extremity X-rays. It is used for radiology classification, fracture-related screening, musculoskeletal imaging AI, and human-AI comparison studies.
数据资源cine cardiac MRI with segmentation labelscardiac MRI segmentation datasetACDC challenge dataset; see official database page申请访问 ACDC 自动心脏诊断挑战数据集
ACDC is a cardiac MRI dataset for automated cardiac diagnosis and segmentation. It supports left and right ventricular segmentation, myocardium segmentation, cardiac function quantification, and evaluation of robust cardiac image analysis methods.
数据资源MRI, DXA, ultrasound, retinal imaging, genetics, and health recordspopulation-scale multimodal imaging cohortPopulation-scale UK Biobank imaging cohort; application required申请访问 UK Biobank 影像数据
UK Biobank Imaging provides large-scale imaging phenotypes linked to genetic, lifestyle, and health outcome data. It is used for population-scale medical imaging AI, disease risk prediction, representation learning, multimodal biomedical modeling, and epidemiological AI studies.
数据资源brain MRI with demographic and clinical variablesbrain MRI and neuroimaging dataset collectionOASIS cross-sectional and longitudinal releases; see official site开放访问 OASIS 脑 MRI 与神经影像数据集
OASIS provides open-access neuroimaging datasets for studying normal aging, dementia, and brain structure. It is useful for brain MRI segmentation, age prediction, dementia classification, longitudinal modeling, and neuroimaging method benchmarking.
数据资源MRI, PET, biomarkers, clinical and cognitive assessmentslongitudinal neuroimaging and clinical datasetLongitudinal ADNI cohort data; access through ADNI/LONI申请访问 ADNI 阿尔茨海默病神经影像倡议数据集
ADNI provides longitudinal neuroimaging, biomarker, clinical, and cognitive data for Alzheimer disease research. It supports disease progression modeling, dementia diagnosis, multimodal prediction, biomarker discovery, and clinical translation studies.
数据资源cardiac ultrasound videos with functional annotationsechocardiography video datasetLarge echocardiography video dataset; see official site申请访问 EchoNet-Dynamic 心脏超声视频数据集
EchoNet-Dynamic is a cardiac ultrasound video dataset with expert annotations for left ventricular function. It is used for echocardiography video understanding, ejection fraction estimation, cardiac segmentation, and clinical video AI research.
数据资源histopathology whole-slide imagesdigital pathology whole-slide image datasetCAMELYON17 challenge dataset; see Grand Challenge page申请访问 CAMELYON17 组织病理淋巴结转移数据集
CAMELYON17 is a digital pathology dataset for detecting breast cancer metastases in lymph node whole-slide images across multiple centers. It supports pathology classification, metastasis detection, weakly supervised learning, and domain generalization in histopathology AI.
数据资源dermoscopic and clinical skin lesion imagesdermatology image archiveLarge public ISIC dermatology image archive开放访问 ISIC Archive 皮肤病学图像数据集
The ISIC Archive is a large public dermatology image repository for skin lesion analysis. It is widely used for melanoma classification, lesion segmentation, dermoscopic image retrieval, bias and domain shift analysis, and clinical imaging benchmark development.
数据资源raw MRI k-space and reconstructed MRI dataMRI reconstruction datasetLarge raw MRI reconstruction dataset; see official site申请访问 fastMRI 原始 MRI 重建数据集
fastMRI is a raw MRI dataset for accelerated magnetic resonance image reconstruction, originally released by NYU Langone Health and Meta AI. It is used for MRI reconstruction, compressed sensing replacement, generative reconstruction, and robustness evaluation.
数据资源2D and 3D biomedical imagesstandardized biomedical image benchmark12 2D datasets and 6 3D datasets in MedMNIST v2开放访问 MedMNIST v2 生物医学图像基准
MedMNIST v2 is a standardized collection of lightweight biomedical image classification datasets, including 2D and 3D tasks. It is useful for quick benchmarking, AutoML, foundation model sanity checks, and reproducible evaluation across multiple medical imaging domains.
数据资源multimodal brain MRI with tumor annotationsbrain tumor MRI segmentation challenge datasetBraTS 2024 challenge dataset; see Synapse project申请访问 BraTS 2024 脑肿瘤分割挑战数据集
BraTS 2024 provides multimodal brain MRI data and expert annotations for brain tumor segmentation and related tumor subregion analysis. It is a major benchmark for glioma segmentation, radiology AI, and robust multimodal MRI segmentation methods.
数据资源abdominal CT with kidney and tumor annotationskidney tumor CT segmentation datasetTCIA C4KC-KiTS collection; see collection page开放访问 C4KC-KiTS 肾肿瘤分割集合
C4KC-KiTS is a TCIA imaging collection associated with kidney and kidney tumor segmentation benchmarks. It supports kidney segmentation, renal tumor segmentation, surgical planning research, and evaluation of abdominal CT segmentation models.
数据资源thoracic CT images with nodule annotationslung CT nodule datasetTCIA LIDC-IDRI collection开放访问 LIDC-IDRI 肺部 CT 结节数据集
LIDC-IDRI is a lung CT dataset with thoracic CT scans and expert nodule annotations. It is a classic benchmark for lung nodule detection, segmentation, malignancy characterization, radiomics, and computer-aided diagnosis research.
数据资源chest radiographs with radiologist annotationschest X-ray detection and classification datasetVinDr-CXR release on PhysioNet; version 1.0.0开放访问 VinDr-CXR:越南胸部 X 光数据集
VinDr-CXR is a chest X-ray dataset with radiologist annotations from Vietnamese hospitals. It supports abnormality classification, lesion localization, radiology object detection, and robustness studies across clinical sites and populations.
数据资源frontal chest radiographs with image-level labelschest X-ray classification datasetNIH public ChestX-ray14 release开放访问 NIH ChestX-ray14 数据集
NIH ChestX-ray14 is a public chest radiograph dataset with image-level labels for thoracic disease findings mined from reports. It is commonly used for chest X-ray classification, weak supervision, thoracic disease detection, and radiology benchmark comparisons.
数据资源chest radiographs with multi-label findingschest X-ray classification datasetLarge-scale Stanford chest X-ray dataset申请访问 CheXpert 胸部 X 光数据集
CheXpert is a large chest radiograph dataset from Stanford with uncertainty-aware labels for common chest X-ray findings. It is widely used for radiology classification, label uncertainty modeling, chest X-ray representation learning, and clinical imaging benchmarks.
数据资源EEG and polysomnography biosignalssleep physiology signal datasetExpanded Sleep-EDF PhysioNet dataset; version 1.0.0开放访问 Sleep-EDF Expanded 多导睡眠图数据集
Sleep-EDF Expanded contains polysomnographic sleep recordings with EEG and related physiological signals. It is used for sleep stage classification, biosignal time-series modeling, self-supervised learning on physiological signals, and clinical sleep research benchmarks.
数据资源chest radiographs with radiology reportschest X-ray image-report datasetLarge-scale CXR image-report dataset; version 2.1.0申请访问 MIMIC-CXR v2.1.0 胸部 X 光数据集
MIMIC-CXR is a large deidentified chest radiograph dataset with associated free-text radiology reports. It is widely used for chest X-ray classification, report generation, image-text representation learning, radiology retrieval, and medical multimodal foundation model evaluation.
数据资源medical images with bilingual visual questions and answersmedical visual question answering datasetBilingual medical VQA dataset; see official project page开放访问 SLAKE:语义标注、知识增强医学 VQA 数据集
SLAKE is a semantically labeled medical visual question answering dataset with bilingual English-Chinese questions, medical images, and knowledge-enhanced annotations. It is useful for medical multimodal learning, image-grounded QA, and radiology VQA evaluation.
数据资源CT癌症影像TCIA collection申请访问 National Lung Screening Trial 数据集合
Low-dose CT imaging collection from the National Lung Screening Trial, distributed by The Cancer Imaging Archive.
数据资源CT/MRI分割基准10 segmentation tasks开放访问 Medical Segmentation Decathlon 医学分割十项全能
Legacy multi-task biomedical image segmentation benchmark retained as a reference; newer segmentation benchmarks are listed above it.
数据资源胸部 X 光放射影像112,120 frontal-view X-ray images开放访问 NIH ChestX-ray14 数据集
NIH Clinical Center chest X-ray dataset released for computer-aided detection and radiology machine learning research.
数据资源胸部 X 光放射影像224,316 chest radiographs申请访问 CheXpert
Stanford chest radiograph dataset for automated chest X-ray interpretation and uncertainty-aware label evaluation.
数据资源胸部 X 光放射影像PhysioNet v2.1.0受限访问 MIMIC-CXR-JPG v2.1.0
JPG-formatted chest radiographs with labels derived from free-text reports, hosted by PhysioNet.
数据资源Biomedical imagesTool/modelFoundation model and code开放访问 BiomedParse 生物医学图像解析基础模型
Foundation model and toolkit for all-in-one biomedical image parsing across recognition, detection, and segmentation tasks.
数据资源Text and medical imagesModelMedGemma / MedSigLIP model family开放访问 MedGemma / MedSigLIP 医学 AI 模型
Google Health AI Developer Foundations open model resources for medical text and medical image understanding, including MedGemma 1.5 resources.
数据资源医学影像分割基准IMed-361M / IMIS-Bench开放访问 IMed-361M / IMIS-Bench 交互式医学图像分割基准
Interactive medical image segmentation benchmark and baseline from CVPR 2025, covering multiple modalities, organs, and target structures.
数据资源Multimodal clinical dataBenchmarkICML 2025 benchmark开放访问 CLIMB 临床基础模型基准
Multimodal clinical data foundation and benchmark introduced at ICML 2025 for clinical foundation model research.