AI4Meder 站内搜索

搜索医学 AI 论文与资源

按论文、数据资源、技术竞赛、投稿截止日期和课程资源检索社区内容，快速进入对应详情页。

48 条结果

输入关键词或点击标签，按论文、数据资源、竞赛截止日期、征稿与课程缩小范围。标签：数据集

论文ICLR 2026 Poster2026 年clinical prediction

MRI 运动校正的可靠评测：数据集与洞见

ICLR 2026 Poster accepted paper at ICLR 2026. Correcting motion artifacts in scientific and medical imaging is important, as they significantly impact image quality. However, evaluating deep learning-based and classical motion correction methods remains fundamentally difficult due to the lack of accessible ground-truth target data. To address this challenge, we study three evaluation approaches: real-world evaluation based on reference scans, simulated motion, and reference-free evaluation, each with its merits and shortcomings. To enable evaluation with real-world motion artifacts, we release PMoC3D, a dataset consisting of unprocessed $\textbf{P}$aired $\textbf{Mo}$tion-$\textbf{C}$orrupted $\textbf{3D}$ brain MRI data.

医学影像计算 EHR 与临床预测论文 3D MRI motion correction Accelerated MRI Dataset 查看论文详情

论文ICLR 2026 Poster2026 年clinical prediction

MedAraBench：大规模阿拉伯语医学问答数据集与基准

ICLR 2026 Poster accepted paper at ICLR 2026. Arabic remains one of the most underrepresented languages in natural language processing research, particularly in medical applications, due to the limited availability of open-source data and benchmarks. The lack of resources hinders efforts to evaluate and advance the multilingual capabilities of Large Language Models (LLMs). In this paper, we introduce MedAraBench, a large-scale dataset consisting of Arabic multiple-choice question-answer pairs across various medical specialties. We constructed the dataset by manually digitizing a large repository of academic materials created by medical professionals in the Arabic-speaking region.

医学影像计算临床语言智能 EHR 与临床预测论文 Dataset Benchmark Large Language Models 查看论文详情

论文ICLR 2026 Poster2026 年medical LLM agent

AnesSuite：面向 LLM 麻醉学推理的综合基准与数据集套件

ICLR 2026 Poster accepted paper at ICLR 2026. The application of large language models (LLMs) in the medical field has garnered significant attention, yet their reasoning capabilities in more specialized domains like anesthesiology remain underexplored. To bridge this gap, we introduce AnesSuite, the first comprehensive dataset suite specifically designed for anesthesiology reasoning in LLMs. The suite features AnesBench, an evaluation benchmark tailored to assess anesthesiology-related reasoning across three levels: factual retrieval (System 1), hybrid reasoning (System 1.x), and complex decision-making (System 2). Alongside this benchmark, the suite includes three training datasets that provide an infrastructure for continued pre-training (CPT), supervised fine-tuning (SFT), and reinforcement learning with verifiable rewards (RLVR). Code/project link: https://github.com/MiliLab/AnesSuite

医学影像计算临床语言智能论文 Large language model Reasoning Anesthesiology 查看论文详情

论文ICLR 2026 Poster2026 年clinical NLP

用于胸部 X 光图像的结构化、标注式、定位化 VQA 数据集：含完整句答案与场景图

ICLR 2026 Poster accepted paper at ICLR 2026. Visual Question Answering (VQA) enables targeted and context-dependent analysis of medical images, such as chest X-rays (CXRs). However, existing VQA datasets for CXRs are typically constrained by simplistic and brief answer formats, lacking localization annotations (e.g., bounding boxes) and structured tags (e.g., region or radiological finding/disease tags). To address these limitations, we introduce MIMIC-Ext-CXR-QBA (abbr. CXR-QBA), a large-scale CXR VQA dataset derived from MIMIC-CXR, comprising 42 million QA-pairs with multi-granular, multi-part answers, detailed bounding boxes, and structured tags. Code/project link: https://github.com/philip-mueller/mimic-ext-cxr-qba/

医学影像计算医疗多模态临床语言智能论文 VQA Localization 查看论文详情

论文ICLR 2026 Poster2026 年trustworthy medical AI

超越医学考试：面向心理健康真实任务与模糊性的临床医生标注公平性数据集

ICLR 2026 Poster accepted paper at ICLR 2026. Current medical language model (LM) benchmarks often over-simplify the complexities of day-to-day clinical practice tasks and instead rely on evaluating LMs on multiple-choice board exam questions. In psychiatry especially, these challenges are worsened by fairness and bias issues, since models can be swayed by patient demographics even when those factors should not influence clinical decisions. Thus, we present an expert-created and annotated dataset spanning five critical domains of decision-making in mental healthcare: treatment, diagnosis, documentation, monitoring, and triage. This U.S. centric dataset — created without any LM assistance — is designed to capture the nuanced clinical reasoning and daily ambiguities mental health practitioners encounter, reflecting the inherent complexities of care delivery that are missing from existing datasets.

医学影像计算临床语言智能可信、安全、公平与隐私论文 AI for Healthcare mental health 查看论文详情

论文ICLR 2026 Poster2026 年clinical prediction

从病历到诊断对话：面向精神共病的临床扎根方法与数据集

ICLR 2026 Poster accepted paper at ICLR 2026. Psychiatric comorbidity is clinically significant yet challenging due to the complexity of multiple co-occurring disorders. To address this, we develop a novel approach integrating synthetic patient electronic medical record (EMR) construction and multi-agent diagnostic dialogue generation. We create 502 synthetic EMRs for common comorbid conditions using a pipeline that ensures clinical relevance and diversity. Our multi-agent framework transfers the clinical interview protocol into a hierarchical state machine and context tree, supporting over 130 diagnostic states while maintaining clinical standards.

医学影像计算临床语言智能 EHR 与临床预测论文 Psychiatric Comorbidity Diagnostic Dialogue 查看论文详情

论文ICLR 2026 Poster2026 年EHR 与临床预测

重用基础模型实现可泛化医学时间序列分类

FORMED 将通用时间序列基础模型重用于医学时间序列分类，并通过任务相关通道嵌入、标签查询和共享解码注意力层，在不同医学时间序列数据集上进行轻量适配。

EHR 与临床预测医疗 AI 论文会议论文查看论文详情

论文ICLR 2026 Poster2026 年医疗多模态

医学 MLLM 如何失效？医学图像视觉定位研究

系统研究医学 MLLM 在医学图像视觉定位中的失效模式，提出 VGMED 评估数据集与 VGRefine 推理时方法，面向医学视觉问答与医学图像解释场景。

医疗多模态医疗 AI 论文会议论文查看论文详情

论文ICLR 2026 Poster2026 年可信、安全、公平与隐私

超越医学考试：面向心理健康真实任务与模糊性的临床医生标注公平性数据集

ICLR 2026 Poster 论文提出 MENTAT：一个由临床专家创建和标注、面向心理健康真实任务与模糊性的公平性评测数据集，用于评估语言模型在临床决策任务中的表现与偏差。

可信、安全、公平与隐私医疗 AI 论文会议论文查看论文详情

数据资源critical care time-series variables and outcomesICU time-series benchmark datasetPhysioNet Challenge 2012 dataset; version 1.0.0开放访问

PhysioNet/CinC 2012 ICU 时间序列数据集

The PhysioNet/CinC Challenge 2012 dataset contains ICU time-series records used for mortality prediction and patient-specific outcome modeling. It remains a useful benchmark for clinical time-series modeling, missingness-aware learning, and early warning model development.

EHR 与临床预测数据集重症监护 time series mortality prediction PhysioNet Challenge 查看数据资源

数据资源Chinese community medical questions and answersChinese medical QA datasetUpdated cMedQA dataset; see official repository开放访问

cMedQA2：中文社区医学问答数据集

cMedQA2 is an updated Chinese community medical question answering dataset for question-answer matching and medical QA research. It is useful for training and evaluating Chinese medical retrieval, ranking, and answer selection models.

临床语言智能数据集 Chinese medical QA answer selection community QA medical_llm_agent 查看数据资源

数据资源abdominal CT and MRI with multi-organ annotationsabdominal multi-organ segmentation benchmarkAMOS 2022 challenge benchmark; see official Grand Challenge page申请访问

AMOS 腹部多器官分割基准

AMOS is an abdominal multi-organ segmentation benchmark with CT and MRI cases for evaluating versatile medical image segmentation models. It supports abdominal organ segmentation, modality-general segmentation, and benchmarking of robust 3D segmentation methods.

医学影像计算数据集 abdominal imaging multi-organ segmentation CT MRI 查看数据资源

数据资源retinal fundus photographs with glaucoma and structure annotationsophthalmology fundus image challenge datasetREFUGE challenge dataset; official splits described on Grand Challenge申请访问

REFUGE 视网膜眼底青光眼挑战数据集

REFUGE is a retinal fundus imaging challenge dataset for glaucoma assessment. It supports glaucoma classification, optic disc and cup segmentation, fovea localization, and fair comparison of ophthalmology AI methods on color fundus photographs.

医学影像计算数据集 ophthalmology fundus glaucoma 分割查看数据资源

数据资源chest radiographs with pneumonia/lung opacity annotationschest X-ray pneumonia detection challenge datasetRSNA 2018 AI image challenge dataset开放访问

RSNA 肺炎检测挑战数据集

The RSNA Pneumonia Detection Challenge dataset is a chest radiograph benchmark for detecting pneumonia-related lung opacities. It supports object detection, chest X-ray classification, localization, and radiology AI evaluation under a competition framework.

医学影像计算数据集 CXR pneumonia object detection RSNA 查看数据资源

数据资源upper extremity radiographs with abnormality labelsmusculoskeletal X-ray datasetLarge Stanford musculoskeletal radiograph dataset申请访问

MURA 肌骨 X 光数据集

MURA is a musculoskeletal radiograph dataset from Stanford for abnormality detection in upper extremity X-rays. It is used for radiology classification, fracture-related screening, musculoskeletal imaging AI, and human-AI comparison studies.

医学影像计算数据集 X-ray musculoskeletal abnormality detection Stanford AIMI 查看数据资源

数据资源cine cardiac MRI with segmentation labelscardiac MRI segmentation datasetACDC challenge dataset; see official database page申请访问

ACDC 自动心脏诊断挑战数据集

ACDC is a cardiac MRI dataset for automated cardiac diagnosis and segmentation. It supports left and right ventricular segmentation, myocardium segmentation, cardiac function quantification, and evaluation of robust cardiac image analysis methods.

医学影像计算数据集 cardiac MRI 分割 ventricle challenge 查看数据资源

数据资源MRI, DXA, ultrasound, retinal imaging, genetics, and health recordspopulation-scale multimodal imaging cohortPopulation-scale UK Biobank imaging cohort; application required申请访问

UK Biobank 影像数据

UK Biobank Imaging provides large-scale imaging phenotypes linked to genetic, lifestyle, and health outcome data. It is used for population-scale medical imaging AI, disease risk prediction, representation learning, multimodal biomedical modeling, and epidemiological AI studies.

医学影像计算 EHR 与临床预测医疗多模态数据集 population cohort imaging 查看数据资源

数据资源genomics, transcriptomics, clinical metadata, and pathology-related datacancer genomics and clinical datasetLarge multi-cancer TCGA program dataset开放访问

TCGA 癌症基因组数据集

The Cancer Genome Atlas is a large cancer genomics resource with molecular, clinical, and pathology-related data across many cancer types. It is a foundation dataset for oncology AI, survival prediction, subtype discovery, multimodal cancer modeling, and translational biomarker research.

EHR 与临床预测医疗多模态数据集 cancer genomics 肿瘤学 multi-omics 查看数据资源

数据资源brain MRI with demographic and clinical variablesbrain MRI and neuroimaging dataset collectionOASIS cross-sectional and longitudinal releases; see official site开放访问

OASIS 脑 MRI 与神经影像数据集

OASIS provides open-access neuroimaging datasets for studying normal aging, dementia, and brain structure. It is useful for brain MRI segmentation, age prediction, dementia classification, longitudinal modeling, and neuroimaging method benchmarking.

医学影像计算 EHR 与临床预测数据集 brain MRI dementia aging 查看数据资源

数据资源MRI, PET, biomarkers, clinical and cognitive assessmentslongitudinal neuroimaging and clinical datasetLongitudinal ADNI cohort data; access through ADNI/LONI申请访问

ADNI 阿尔茨海默病神经影像倡议数据集

ADNI provides longitudinal neuroimaging, biomarker, clinical, and cognitive data for Alzheimer disease research. It supports disease progression modeling, dementia diagnosis, multimodal prediction, biomarker discovery, and clinical translation studies.

医学影像计算 EHR 与临床预测医疗多模态数据集 Alzheimer disease neuroimaging 查看数据资源

数据资源cardiac ultrasound videos with functional annotationsechocardiography video datasetLarge echocardiography video dataset; see official site申请访问

EchoNet-Dynamic 心脏超声视频数据集

EchoNet-Dynamic is a cardiac ultrasound video dataset with expert annotations for left ventricular function. It is used for echocardiography video understanding, ejection fraction estimation, cardiac segmentation, and clinical video AI research.

医学影像计算数据集 echocardiography ultrasound video cardiology 查看数据资源

数据资源histopathology whole-slide imagesdigital pathology whole-slide image datasetCAMELYON17 challenge dataset; see Grand Challenge page申请访问

CAMELYON17 组织病理淋巴结转移数据集

CAMELYON17 is a digital pathology dataset for detecting breast cancer metastases in lymph node whole-slide images across multiple centers. It supports pathology classification, metastasis detection, weakly supervised learning, and domain generalization in histopathology AI.

医学影像计算数据集 pathology whole-slide imaging breast cancer domain generalization 查看数据资源

数据资源dermoscopic and clinical skin lesion imagesdermatology image archiveLarge public ISIC dermatology image archive开放访问

ISIC Archive 皮肤病学图像数据集

The ISIC Archive is a large public dermatology image repository for skin lesion analysis. It is widely used for melanoma classification, lesion segmentation, dermoscopic image retrieval, bias and domain shift analysis, and clinical imaging benchmark development.

医学影像计算数据集皮肤病学 skin lesion melanoma ISIC 查看数据资源

数据资源raw MRI k-space and reconstructed MRI dataMRI reconstruction datasetLarge raw MRI reconstruction dataset; see official site申请访问

fastMRI 原始 MRI 重建数据集

fastMRI is a raw MRI dataset for accelerated magnetic resonance image reconstruction, originally released by NYU Langone Health and Meta AI. It is used for MRI reconstruction, compressed sensing replacement, generative reconstruction, and robustness evaluation.

医学影像计算数据集 MRI 重建 k-space accelerated imaging 查看数据资源

数据资源2D and 3D biomedical imagesstandardized biomedical image benchmark12 2D datasets and 6 3D datasets in MedMNIST v2开放访问

MedMNIST v2 生物医学图像基准

MedMNIST v2 is a standardized collection of lightweight biomedical image classification datasets, including 2D and 3D tasks. It is useful for quick benchmarking, AutoML, foundation model sanity checks, and reproducible evaluation across multiple medical imaging domains.

医学影像计算数据集 MedMNIST 分类 benchmark 2D 3D imaging 查看数据资源

数据资源multimodal brain MRI with tumor annotationsbrain tumor MRI segmentation challenge datasetBraTS 2024 challenge dataset; see Synapse project申请访问

BraTS 2024 脑肿瘤分割挑战数据集

BraTS 2024 provides multimodal brain MRI data and expert annotations for brain tumor segmentation and related tumor subregion analysis. It is a major benchmark for glioma segmentation, radiology AI, and robust multimodal MRI segmentation methods.

医学影像计算数据集 brain MRI glioma 分割 BraTS 查看数据资源

数据资源abdominal CT with kidney and tumor annotationskidney tumor CT segmentation datasetTCIA C4KC-KiTS collection; see collection page开放访问

C4KC-KiTS 肾肿瘤分割集合

C4KC-KiTS is a TCIA imaging collection associated with kidney and kidney tumor segmentation benchmarks. It supports kidney segmentation, renal tumor segmentation, surgical planning research, and evaluation of abdominal CT segmentation models.

医学影像计算数据集 kidney tumor CT segmentation TCIA KiTS 查看数据资源

数据资源thoracic CT images with nodule annotationslung CT nodule datasetTCIA LIDC-IDRI collection开放访问

LIDC-IDRI 肺部 CT 结节数据集

LIDC-IDRI is a lung CT dataset with thoracic CT scans and expert nodule annotations. It is a classic benchmark for lung nodule detection, segmentation, malignancy characterization, radiomics, and computer-aided diagnosis research.

医学影像计算数据集 lung CT nodules 分割 TCIA 查看数据资源

数据资源chest radiographs with radiologist annotationschest X-ray detection and classification datasetVinDr-CXR release on PhysioNet; version 1.0.0开放访问

VinDr-CXR：越南胸部 X 光数据集

VinDr-CXR is a chest X-ray dataset with radiologist annotations from Vietnamese hospitals. It supports abnormality classification, lesion localization, radiology object detection, and robustness studies across clinical sites and populations.

医学影像计算数据集 CXR radiologist labels object detection Vietnam 查看数据资源

数据资源frontal chest radiographs with image-level labelschest X-ray classification datasetNIH public ChestX-ray14 release开放访问

NIH ChestX-ray14 数据集

NIH ChestX-ray14 is a public chest radiograph dataset with image-level labels for thoracic disease findings mined from reports. It is commonly used for chest X-ray classification, weak supervision, thoracic disease detection, and radiology benchmark comparisons.

医学影像计算数据集 CXR thoracic disease weak labels NIH 查看数据资源

数据资源chest radiographs with multi-label findingschest X-ray classification datasetLarge-scale Stanford chest X-ray dataset申请访问

CheXpert 胸部 X 光数据集

CheXpert is a large chest radiograph dataset from Stanford with uncertainty-aware labels for common chest X-ray findings. It is widely used for radiology classification, label uncertainty modeling, chest X-ray representation learning, and clinical imaging benchmarks.

医学影像计算数据集 CXR 放射影像分类 Stanford AIMI 查看数据资源

数据资源EEG and polysomnography biosignalssleep physiology signal datasetExpanded Sleep-EDF PhysioNet dataset; version 1.0.0开放访问

Sleep-EDF Expanded 多导睡眠图数据集

Sleep-EDF Expanded contains polysomnographic sleep recordings with EEG and related physiological signals. It is used for sleep stage classification, biosignal time-series modeling, self-supervised learning on physiological signals, and clinical sleep research benchmarks.

EHR 与临床预测数据集 sleep staging EEG biosignal PhysioNet 查看数据资源

数据资源12-lead ECG waveforms with diagnostic labelsECG waveform benchmarkLarge public ECG dataset; version 1.0.3开放访问

PTB-XL：大型开放 12 导联 ECG 数据集

PTB-XL is a large public 12-lead electrocardiography dataset with diagnostic statements and waveform records. It is a standard benchmark for ECG classification, cardiac abnormality detection, clinical signal representation learning, and robust evaluation of biosignal models.

EHR 与临床预测数据集 ECG 心电 cardiology biosignal 分类查看数据资源

数据资源structured critical care EHR tablesmulticenter ICU EHR datasetMulticenter ICU database; version 2.0申请访问

eICU 协作研究数据库

The eICU Collaborative Research Database is a multicenter critical care database containing deidentified ICU data from many hospitals. It is commonly used for external validation, ICU outcome prediction, temporal modeling, and cross-site generalization studies in clinical AI.

EHR 与临床预测数据集重症监护电子病历 multicenter external validation 查看数据资源

数据资源12-lead ECG waveforms and diagnostic metadataECG waveform datasetLarge-scale diagnostic ECG dataset; version 1.0申请访问

MIMIC-IV-ECG 诊断心电图数据集

MIMIC-IV-ECG is a large deidentified electrocardiogram dataset linked to the MIMIC-IV clinical data ecosystem. It supports ECG classification, arrhythmia detection, representation learning, and multimodal modeling with structured EHR context.

EHR 与临床预测数据集 ECG 心电 biosignal clinical prediction PhysioNet 查看数据资源

数据资源chest radiographs with radiology reportschest X-ray image-report datasetLarge-scale CXR image-report dataset; version 2.1.0申请访问

MIMIC-CXR v2.1.0 胸部 X 光数据集

MIMIC-CXR is a large deidentified chest radiograph dataset with associated free-text radiology reports. It is widely used for chest X-ray classification, report generation, image-text representation learning, radiology retrieval, and medical multimodal foundation model evaluation.

医学影像计算医疗多模态临床语言智能数据集 CXR radiology reports 查看数据资源

数据资源deidentified clinical free textclinical notes datasetClinical note extension for MIMIC-IV; version 2.2申请访问

MIMIC-IV-Note v2.2 临床笔记数据集

MIMIC-IV-Note provides deidentified clinical notes linked to MIMIC-IV hospital data. It supports clinical NLP tasks such as note representation learning, discharge summary modeling, information extraction, summarization, and multimodal EHR-text modeling.

临床语言智能 EHR 与临床预测数据集 clinical NLP notes summarization 查看数据资源

数据资源deidentified structured EHR tablescritical care EHR datasetLarge-scale hospital and ICU EHR dataset; version 3.1申请访问

MIMIC-IV v3.1 重症监护与住院 EHR 数据集

MIMIC-IV is a large deidentified electronic health record dataset from Beth Israel Deaconess Medical Center, covering hospital and ICU data for critical care research. It is a core benchmark source for clinical prediction, temporal EHR modeling, phenotyping, and healthcare AI method development.

EHR 与临床预测数据集电子病历重症监护 clinical prediction PhysioNet 查看数据资源

数据资源medical images with bilingual visual questions and answersmedical visual question answering datasetBilingual medical VQA dataset; see official project page开放访问

SLAKE：语义标注、知识增强医学 VQA 数据集

SLAKE is a semantically labeled medical visual question answering dataset with bilingual English-Chinese questions, medical images, and knowledge-enhanced annotations. It is useful for medical multimodal learning, image-grounded QA, and radiology VQA evaluation.

医学影像计算医疗多模态数据集 medical VQA bilingual dataset medical multimodal 查看数据资源

数据资源Chinese conversational medical QA textChinese medical conversational QA datasetLarge-scale Chinese medical CQA dataset; see official repository开放访问

CMCQA：中文医学会话问答数据集

CMCQA is a large Chinese medical conversational question-answering dataset released with knowledge-grounded medical dialogue research. It supports medical conversation QA, knowledge-grounded response generation, and evaluation of Chinese medical dialogue systems.

临床语言智能数据集 Chinese medical CQA knowledge-grounded dialogue medical chatbot medical_llm_agent 查看数据资源

数据资源Chinese medical instruction and dialogue textChinese medical instruction-tuning datasetAbout 140K medical SFT examples; see Hugging Face card开放访问

HuatuoGPT2-SFT-GPT4-140K 医学指令数据集

HuatuoGPT2-SFT-GPT4-140K is a Chinese medical supervised fine-tuning dataset containing medical instruction-style conversations and GPT-4-assisted responses. It is useful for Chinese medical assistant alignment and medical LLM instruction tuning.

临床语言智能数据集 medical SFT Chinese medical LLM instruction tuning medical_llm_agent 查看数据资源

数据资源Chinese medical question-answer textChinese medical QA corpusAbout 26 million medical QA pairs开放访问

Huatuo-26M：大规模中文医学问答数据集

Huatuo-26M is a large-scale Chinese medical question-answering dataset with about 26 million QA pairs collected for medical language modeling and medical dialogue research. It is suitable for Chinese medical LLM pretraining, fine-tuning, and QA system development.

临床语言智能数据集 Chinese medical QA large-scale corpus LLM training medical_llm_agent 查看数据资源

数据资源medical exam question-answer textmedical exam QA benchmarkUSMLE, Mainland China, and Taiwan exam-style QA splits; see repository开放访问

MedQA：含美国、中国大陆与台湾拆分的医学考试问答数据集

MedQA is a medical examination question answering benchmark with English and Chinese medical licensing-style question sets, including mainland China and Taiwan variants. It is widely used for medical QA and medical reasoning evaluation.

临床语言智能数据集 medical QA Chinese exam QA USMLE LLM evaluation 查看数据资源

数据资源Chinese consultation dialogue text with medical entity annotationsChinese medical dialogue generation datasetEntity-annotated dialogue dataset; see official repository开放访问

MedDG：实体中心中文医学对话生成数据集

MedDG is an entity-centric Chinese medical consultation dataset with domain entity annotations for medical dialogue generation. It supports entity-aware response generation, medical consultation modeling, and dialogue systems that ground responses in clinical concepts.

临床语言智能数据集 Chinese medical dialogue entity annotation generation medical_llm_agent 查看数据资源

数据资源Chinese medical exam and QA textChinese medical LLM evaluation benchmarkMultiple Chinese medical exam and benchmark splits; see Hugging Face card开放访问

CMB：中文医学基准

CMB is a comprehensive Chinese medical benchmark for evaluating medical large language models on medical exams, reasoning, and clinical knowledge questions. It is suited for Chinese medical QA, LLM evaluation, and instruction-following assessment.

临床语言智能数据集 Chinese medical benchmark medical LLM exam QA medical_llm_agent 查看数据资源

数据资源Chinese biomedical and clinical textChinese biomedical NLP benchmark8 biomedical NLU tasks; see official repository开放访问

CBLUE：中文生物医学语言理解评测基准

CBLUE is a Chinese biomedical language understanding benchmark covering real-world biomedical NLP tasks such as named entity recognition, relation extraction, term normalization, clinical trial classification, sentence similarity, and medical question answering. It is useful for evaluating Chinese clinical NLP models and medical language models.

临床语言智能数据集 Chinese medical NLP benchmark information extraction QA 查看数据资源

数据资源CT癌症影像TCIA collection申请访问

National Lung Screening Trial 数据集合

Low-dose CT imaging collection from the National Lung Screening Trial, distributed by The Cancer Imaging Archive.

CT 肺癌 TCIA 查看数据资源

数据资源胸部 X 光放射影像112,120 frontal-view X-ray images开放访问

NIH ChestX-ray14 数据集

NIH Clinical Center chest X-ray dataset released for computer-aided detection and radiology machine learning research.

放射影像 NIH 胸部 X 光查看数据资源