TextMedical LLM benchmark and leaderboard intended to broaden coverage beyond single medical QA datasets.
查看数据集
TextMedical LLM benchmark and leaderboard intended to broaden coverage beyond single medical QA datasets.
查看数据集
TextBenchmark for evaluating health AI model safety, helpfulness, and clinical-relevance judgments with physician-reviewed rubrics.
查看数据集
Text and medical imagesGoogle Health AI Developer Foundations open model resources for medical text and medical image understanding, including MedGemma 1.5 resources.