数据资源critical care time-series variables and outcomesICU time-series benchmark datasetPhysioNet Challenge 2012 dataset; version 1.0.0开放访问 PhysioNet/CinC 2012 ICU 时间序列数据集
The PhysioNet/CinC Challenge 2012 dataset contains ICU time-series records used for mortality prediction and patient-specific outcome modeling. It remains a useful benchmark for clinical time-series modeling, missingness-aware learning, and early warning model development.
数据资源Chinese community medical questions and answersChinese medical QA datasetUpdated cMedQA dataset; see official repository开放访问 cMedQA2:中文社区医学问答数据集
cMedQA2 is an updated Chinese community medical question answering dataset for question-answer matching and medical QA research. It is useful for training and evaluating Chinese medical retrieval, ranking, and answer selection models.
数据资源chest radiographs with pneumonia/lung opacity annotationschest X-ray pneumonia detection challenge datasetRSNA 2018 AI image challenge dataset开放访问 RSNA 肺炎检测挑战数据集
The RSNA Pneumonia Detection Challenge dataset is a chest radiograph benchmark for detecting pneumonia-related lung opacities. It supports object detection, chest X-ray classification, localization, and radiology AI evaluation under a competition framework.
数据资源genomics, transcriptomics, clinical metadata, and pathology-related datacancer genomics and clinical datasetLarge multi-cancer TCGA program dataset开放访问 TCGA 癌症基因组数据集
The Cancer Genome Atlas is a large cancer genomics resource with molecular, clinical, and pathology-related data across many cancer types. It is a foundation dataset for oncology AI, survival prediction, subtype discovery, multimodal cancer modeling, and translational biomarker research.
数据资源brain MRI with demographic and clinical variablesbrain MRI and neuroimaging dataset collectionOASIS cross-sectional and longitudinal releases; see official site开放访问 OASIS 脑 MRI 与神经影像数据集
OASIS provides open-access neuroimaging datasets for studying normal aging, dementia, and brain structure. It is useful for brain MRI segmentation, age prediction, dementia classification, longitudinal modeling, and neuroimaging method benchmarking.
数据资源dermoscopic and clinical skin lesion imagesdermatology image archiveLarge public ISIC dermatology image archive开放访问 ISIC Archive 皮肤病学图像数据集
The ISIC Archive is a large public dermatology image repository for skin lesion analysis. It is widely used for melanoma classification, lesion segmentation, dermoscopic image retrieval, bias and domain shift analysis, and clinical imaging benchmark development.
数据资源2D and 3D biomedical imagesstandardized biomedical image benchmark12 2D datasets and 6 3D datasets in MedMNIST v2开放访问 MedMNIST v2 生物医学图像基准
MedMNIST v2 is a standardized collection of lightweight biomedical image classification datasets, including 2D and 3D tasks. It is useful for quick benchmarking, AutoML, foundation model sanity checks, and reproducible evaluation across multiple medical imaging domains.
数据资源abdominal CT with kidney and tumor annotationskidney tumor CT segmentation datasetTCIA C4KC-KiTS collection; see collection page开放访问 C4KC-KiTS 肾肿瘤分割集合
C4KC-KiTS is a TCIA imaging collection associated with kidney and kidney tumor segmentation benchmarks. It supports kidney segmentation, renal tumor segmentation, surgical planning research, and evaluation of abdominal CT segmentation models.
数据资源thoracic CT images with nodule annotationslung CT nodule datasetTCIA LIDC-IDRI collection开放访问 LIDC-IDRI 肺部 CT 结节数据集
LIDC-IDRI is a lung CT dataset with thoracic CT scans and expert nodule annotations. It is a classic benchmark for lung nodule detection, segmentation, malignancy characterization, radiomics, and computer-aided diagnosis research.
数据资源chest radiographs with radiologist annotationschest X-ray detection and classification datasetVinDr-CXR release on PhysioNet; version 1.0.0开放访问 VinDr-CXR:越南胸部 X 光数据集
VinDr-CXR is a chest X-ray dataset with radiologist annotations from Vietnamese hospitals. It supports abnormality classification, lesion localization, radiology object detection, and robustness studies across clinical sites and populations.
数据资源frontal chest radiographs with image-level labelschest X-ray classification datasetNIH public ChestX-ray14 release开放访问 NIH ChestX-ray14 数据集
NIH ChestX-ray14 is a public chest radiograph dataset with image-level labels for thoracic disease findings mined from reports. It is commonly used for chest X-ray classification, weak supervision, thoracic disease detection, and radiology benchmark comparisons.
数据资源EEG and polysomnography biosignalssleep physiology signal datasetExpanded Sleep-EDF PhysioNet dataset; version 1.0.0开放访问 Sleep-EDF Expanded 多导睡眠图数据集
Sleep-EDF Expanded contains polysomnographic sleep recordings with EEG and related physiological signals. It is used for sleep stage classification, biosignal time-series modeling, self-supervised learning on physiological signals, and clinical sleep research benchmarks.
数据资源12-lead ECG waveforms with diagnostic labelsECG waveform benchmarkLarge public ECG dataset; version 1.0.3开放访问 PTB-XL:大型开放 12 导联 ECG 数据集
PTB-XL is a large public 12-lead electrocardiography dataset with diagnostic statements and waveform records. It is a standard benchmark for ECG classification, cardiac abnormality detection, clinical signal representation learning, and robust evaluation of biosignal models.
数据资源medical images with bilingual visual questions and answersmedical visual question answering datasetBilingual medical VQA dataset; see official project page开放访问 SLAKE:语义标注、知识增强医学 VQA 数据集
SLAKE is a semantically labeled medical visual question answering dataset with bilingual English-Chinese questions, medical images, and knowledge-enhanced annotations. It is useful for medical multimodal learning, image-grounded QA, and radiology VQA evaluation.
数据资源Chinese conversational medical QA textChinese medical conversational QA datasetLarge-scale Chinese medical CQA dataset; see official repository开放访问 CMCQA:中文医学会话问答数据集
CMCQA is a large Chinese medical conversational question-answering dataset released with knowledge-grounded medical dialogue research. It supports medical conversation QA, knowledge-grounded response generation, and evaluation of Chinese medical dialogue systems.
数据资源Chinese medical instruction and dialogue textChinese medical instruction-tuning datasetAbout 140K medical SFT examples; see Hugging Face card开放访问 HuatuoGPT2-SFT-GPT4-140K 医学指令数据集
HuatuoGPT2-SFT-GPT4-140K is a Chinese medical supervised fine-tuning dataset containing medical instruction-style conversations and GPT-4-assisted responses. It is useful for Chinese medical assistant alignment and medical LLM instruction tuning.
数据资源Chinese medical question-answer textChinese medical QA corpusAbout 26 million medical QA pairs开放访问 Huatuo-26M:大规模中文医学问答数据集
Huatuo-26M is a large-scale Chinese medical question-answering dataset with about 26 million QA pairs collected for medical language modeling and medical dialogue research. It is suitable for Chinese medical LLM pretraining, fine-tuning, and QA system development.
数据资源medical exam question-answer textmedical exam QA benchmarkUSMLE, Mainland China, and Taiwan exam-style QA splits; see repository开放访问 MedQA:含美国、中国大陆与台湾拆分的医学考试问答数据集
MedQA is a medical examination question answering benchmark with English and Chinese medical licensing-style question sets, including mainland China and Taiwan variants. It is widely used for medical QA and medical reasoning evaluation.
数据资源Chinese consultation dialogue text with medical entity annotationsChinese medical dialogue generation datasetEntity-annotated dialogue dataset; see official repository开放访问 MedDG:实体中心中文医学对话生成数据集
MedDG is an entity-centric Chinese medical consultation dataset with domain entity annotations for medical dialogue generation. It supports entity-aware response generation, medical consultation modeling, and dialogue systems that ground responses in clinical concepts.
数据资源Chinese medical exam and QA textChinese medical LLM evaluation benchmarkMultiple Chinese medical exam and benchmark splits; see Hugging Face card开放访问 CMB:中文医学基准
CMB is a comprehensive Chinese medical benchmark for evaluating medical large language models on medical exams, reasoning, and clinical knowledge questions. It is suited for Chinese medical QA, LLM evaluation, and instruction-following assessment.
数据资源Chinese biomedical and clinical textChinese biomedical NLP benchmark8 biomedical NLU tasks; see official repository开放访问 CBLUE:中文生物医学语言理解评测基准
CBLUE is a Chinese biomedical language understanding benchmark covering real-world biomedical NLP tasks such as named entity recognition, relation extraction, term normalization, clinical trial classification, sentence similarity, and medical question answering. It is useful for evaluating Chinese clinical NLP models and medical language models.
数据资源CT/MRI分割基准10 segmentation tasks开放访问 Medical Segmentation Decathlon 医学分割十项全能
Legacy multi-task biomedical image segmentation benchmark retained as a reference; newer segmentation benchmarks are listed above it.
数据资源胸部 X 光放射影像112,120 frontal-view X-ray images开放访问 NIH ChestX-ray14 数据集
NIH Clinical Center chest X-ray dataset released for computer-aided detection and radiology machine learning research.
数据资源ECG 心电生理信号21,837 clinical 12-lead ECG records开放访问 PTB-XL ECG 数据库 v1.0.3
Large publicly available 12-lead ECG waveform dataset with diagnostic labels, hosted on PhysioNet.
数据资源Biomedical imagesTool/modelFoundation model and code开放访问 BiomedParse 生物医学图像解析基础模型
Foundation model and toolkit for all-in-one biomedical image parsing across recognition, detection, and segmentation tasks.
数据资源TextLLM benchmarkBenchmark and leaderboard开放访问 MedHELM 医学 LLM 评测基准
Medical LLM benchmark and leaderboard intended to broaden coverage beyond single medical QA datasets.
数据资源TextLLM evaluation benchmarkHealth AI evaluation benchmark开放访问 HealthBench 健康 AI 评测基准
Benchmark for evaluating health AI model safety, helpfulness, and clinical-relevance judgments with physician-reviewed rubrics.
数据资源Text and medical imagesModelMedGemma / MedSigLIP model family开放访问 MedGemma / MedSigLIP 医学 AI 模型
Google Health AI Developer Foundations open model resources for medical text and medical image understanding, including MedGemma 1.5 resources.
数据资源医学影像分割基准IMed-361M / IMIS-Bench开放访问 IMed-361M / IMIS-Bench 交互式医学图像分割基准
Interactive medical image segmentation benchmark and baseline from CVPR 2025, covering multiple modalities, organs, and target structures.
数据资源Multimodal clinical dataBenchmarkICML 2025 benchmark开放访问 CLIMB 临床基础模型基准
Multimodal clinical data foundation and benchmark introduced at ICML 2025 for clinical foundation model research.