top of page
Database Name/Link
Data Description
Category
Subcategory
American Community Survey (ACS)
The American Community Survey (ACS) helps local officials, community leaders, and businesses understand the changes taking place in their communities. It is the premier source for detailed population and housing information about our nation.
General Research Repositories
Population Trends
American Gut Project
Crowdsourced study of the human microbiome.
Immunology
Microbiome
An Open-Source Dataset on Dietary Behaviors and DASH Eating Plan Optimization Constraints
A dataset based on dietary behaviors, demographics, and pre-existing conditions, suitable for input to linear optimization models.
Public Health & Epidemiology
Diet and Health
Area Deprivation Index (ADI)
Neighborhood-level measure of socioeconomic disadvantage linked to health outcomes.
Public Health & Epidemiology
Health Disparities
Area Deprivation Index Datasets
Measures socio-economic disadvantage at the neighborhood level, widely used in health disparities research.
Public Health & Epidemiology
Health Disparities
ArrayExpress
A repository for functional genomics experiments.
Genomics & Multi-Omics
Human Genomics
Atlas of Genetics and Cytogenetics in Oncology and Haematology
An online journal and database covering chromosomes, genes, and cancers, integrating various types of knowledge in a single resource.
Genomics & Multi-Omics
Cancer
Australian Antarctic Data Centre (AADC)
A repository for Antarctic research data.
General Research Repositories
Large Dataset Distribution
Awesome Healthcare Datasets
Curated list of healthcare datasets for ML in clinical data, imaging, and genomics.
Clinical & Cohort Data
Large Dataset Distribution
BRENDA (The Comprehensive Enzyme Information System)
The comprehensive enzyme information system, providing data on enzyme functions, structures, and properties, supporting research in metabolism, drug development, and biotechnology.
Biological Data
Enzyme Function
Behavioral Risk Factor Surveillance System (BRFSS)
State-based surveillance system tracking chronic diseases and health behaviors.
Public Health & Epidemiology
Large Dataset Distribution
Berkeley Single Cell Computational Microscopy (BSCCM) Dataset
Contains over 12 million images of individual white blood cells, captured with multiple illumination patterns on an LED array microscope, aimed at advancing computational microscopy and computer vision applications.
Immunology
White Blood Cells
Bgee
Offers gene expression data across species and conditions, aiding AI-driven research in developmental biology and disease modeling.
Genomics & Multi-Omics
Human Genomics
BigBrain Atlas
Ultra-high-resolution human brain atlas for neuroscience research.
Neuro Data
Brain Atlas
BigQuery Public Datasets (Healthcare & Life Sciences)
Google Cloud's BigQuery provides large-scale, AI-ready datasets for healthcare and clinical trial analytics.
Clinical & Cohort Data
Multidisciplinary Data

 

© 2025 by Center of Excellence – Consortium of Educators.

 

bottom of page