top of page
Database Name/Link
Data Description
Category
Subcategory
1000 Genomes Project
Deep catalog of human genetic variation.
Genomics & Multi-Omics
Human Genomics
ADNI (Alzheimer Disease Neuroimaging Initiative)
Neuroimaging and biomarker database with Parkinson AD sub-studies.
Disease-Specific Data
Alzheimers Disease
AGRIS (FAO International System for Agricultural Science and Technology)
A database providing information on food systems, agriculture, and related nutritional outcomes.
Agriculture & Food Systems
Food Systems
AHRQ's SDOH Data
Provides details across five key SDOH domains: Social context, such as age, race/ethnicity, veteran status. Economic context, such as income, unemployment rate. Education Physical infrastructure, such as housing, crime, transportation. Healthcare context, such as health insurance.
Public Health & Epidemiology
Health Disparities
AIMI Dataset Index:
Managed by the Stanford Center for Artificial Intelligence in Medicine and Imaging, this repository features curated and annotated clinical imaging data across various modalities, including echocardiograms, brain CT scans, MRIs, radiographs, and ultrasounds.
Imaging & AI Datasets
REVIEW
AMP PD (Parkinson's Disease)
Public-private partnership providing large-scale Parkinson disease omics data.
Disease-Specific Data
Parkinsons Disease
AMP-AD (Alzheimer Disease)
Precision medicine partnership integrating multi-omics Alzheimer data.
Disease-Specific Data
Alzheimers Disease
AYUSH Research Portal
A governmental database offering information on research in Ayurveda, Yoga & Naturopathy, Unani, Siddha, and Homeopathy, including details on medicinal plants and their therapeutic uses.
Traditional & Integrative Medicine
Ayurveda / AYUSH
Academic Torrents
A distributed system for sharing enormous datasets, including those used in machine learning research
General Research Repositories
Large Dataset Distribution
All of Us Research Program
A national project collecting diverse health and genetic data.
Clinical & Cohort Data
REVIEW
Allen Brain Atlas
A comprehensive mapping of gene expression in the human and mouse brain.
Neuro Data
Brain Gene Expression
AlphaFold Protein Structure Database
Provides AI-predicted 3D structures of proteins, facilitating advancements in drug discovery and synthetic biology.
Genomics & Multi-Omics
Proteomics
Alzheimer's Disease Neuroimaging Initiative (ADNI)
Longitudinal study tracking Alzheimer's disease progression and risk factors.
Disease-Specific Data
Alzheimers Disease
Alzheimer's Disease Sequencing Project (ADSP)
NIH initiative sequencing genomes of Alzheimer? patients and controls.
Disease-Specific Data
Alzheimers Disease
America's Health Rankings
Provides comprehensive data on various health measures, including behaviors, community and environment, policy, clinical care, and outcomes, offering insights into how dietary habits influence health across different populations.
Public Health & Epidemiology
Nutrition Data
American Community Survey (ACS)
The American Community Survey (ACS) helps local officials, community leaders, and businesses understand the changes taking place in their communities. It is the premier source for detailed population and housing information about our nation.
Public Health & Epidemiology
Population Health
American Gut Project
Crowdsourced study of the human microbiome.
Imaging & AI Datasets
Multimodal Imaging
An Open-Source Dataset on Dietary Behaviors and DASH Eating Plan Optimization Constraints
A dataset based on dietary behaviors, demographics, and pre-existing conditions, suitable for input to linear optimization models.
Public Health & Epidemiology
Nutrition Data
Area Deprivation Index (ADI)
Neighborhood-level measure of socioeconomic disadvantage linked to health outcomes.
Public Health & Epidemiology
Health Disparities
Area Deprivation Index Datasets
Measures socio-economic disadvantage at the neighborhood level, widely used in health disparities research.
Public Health & Epidemiology
Health Disparities
ArrayExpress
A repository for functional genomics experiments.
Imaging & AI Datasets
Multimodal Imaging
Atlas of Genetics and Cytogenetics in Oncology and Haematology
An online journal and database covering chromosomes, genes, and cancers, integrating various types of knowledge in a single resource.
Disease-Specific Data
Cancer
Australian Antarctic Data Centre (AADC)
A repository for Antarctic research data.
Imaging & AI Datasets
Multimodal Imaging
Awesome Healthcare Datasets
Curated list of healthcare datasets for ML in clinical data, imaging, and genomics.
Imaging & AI Datasets
Multimodal Imaging
BRENDA (The Comprehensive Enzyme Information System)
The comprehensive enzyme information system, providing data on enzyme functions, structures, and properties, supporting research in metabolism, drug development, and biotechnology.
Imaging & AI Datasets
Multimodal Imaging
Behavioral Risk Factor Surveillance System (BRFSS)
State-based surveillance system tracking chronic diseases and health behaviors.
Public Health & Epidemiology
Population Health
Berkeley Single Cell Computational Microscopy (BSCCM) Dataset
Contains over 12 million images of individual white blood cells, captured with multiple illumination patterns on an LED array microscope, aimed at advancing computational microscopy and computer vision applications.
REVIEW
REVIEW
Bgee
Offers gene expression data across species and conditions, aiding AI-driven research in developmental biology and disease modeling.
Genomics & Multi-Omics
Transcriptomics
BigBrain Atlas
Ultra-high-resolution human brain atlas for neuroscience research.
Neuro Data
Brain Atlas
BigQuery Public Datasets (Healthcare & Life Sciences)
Google Cloud's BigQuery provides large-scale, AI-ready datasets for healthcare and clinical trial analytics.
General Research Repositories
Multidisciplinary Data
BindingDB
A public, web-accessible database of measured binding affinities, focusing on the interactions of proteins considered to be drug-targets with small, drug-like molecules.
Imaging & AI Datasets
Multimodal Imaging
BioCyc Database Collection
A collection of Pathway/Genome Databases (PGDBs) that provide reference to genome and metabolic pathway information for thousands of organisms, supporting multiomics analyses.
Imaging & AI Datasets
Multimodal Imaging
BioGRID
A biomedical interaction repository with data compiled through comprehensive curation efforts, encompassing protein-protein interactions, genetic interactions, chemical interactions, and post-translational modifications across multiple species.
Imaging & AI Datasets
Multimodal Imaging
BioStudies
A repository for biological study data.
General Research Repositories
Multidisciplinary Data
Biological General Repository for Interaction Datasets (BioGRID)
A protein-protein interaction database.
Imaging & AI Datasets
Multimodal Imaging
BiomedCLIP Dataset
A multimodal biomedical foundation model pretrained from fifteen million scientific image-text pairs, supporting various biomedical imaging tasks and applications.
Imaging & AI Datasets
Multimodal Imaging
Black Women's Health Study (BWHS)
A long-term observational study initiated in 1995, following 59,000 Black women to investigate health issues, including maternal health disparities, with the goal of improving health outcomes.
Public Health & Epidemiology
Health Disparities
Brain Genomics Superstruct Project (GSP)
Brain imaging and cognitive data for genetic and neuroscience research.
Imaging & AI Datasets
Multimodal Imaging
CARDIA (Coronary Artery Risk Development in Young Adults) - Obesity Data
Study tracking obesity and heart disease risk from young adulthood.
Clinical & Cohort Data
Longitudinal Cohort
CARDIOGRAMplusC4D Consortium
Large-scale genetic database of cardiovascular diseases.
Clinical & Cohort Data
Longitudinal Cohort
CDC Environmental Health Tracking Network (EPHT)
National environmental public health tracking program providing health and exposure data.
REVIEW
REVIEW
CDC Social Determinants of Health Database
Links between social factors (income, housing, transport) and disease.
Imaging & AI Datasets
Multimodal Imaging
CDC Social Vulnerability Index (SVI)
Measures community vulnerability to disasters and pandemics using 15 U.S. Census social factors (poverty, housing, minority status, access to transportation, etc.).
Public Health & Epidemiology
Population Health
CDC's Behavioral Risk Factor Surveillance System (BRFSS)
Large dataset for public health and disease risk modeling.
Public Health & Epidemiology
Population Health
CFDE DATA PORTAL Search Common Fund Programs' Metadata and Processed Datasets.
CFDE DATA PORTAL Search Common Fund Programs' Metadata and Processed Datasets.
REVIEW
REVIEW
CKAN
An open-source data portal platform used by governments and organizations to manage and publish collections of data, powering numerous data portals worldwide
Imaging & AI Datasets
Multimodal Imaging
CLSA (Canadian Longitudinal Study on Aging)
Tracks 50,000+ Canadians over 20 years to analyze aging, genetics, lifestyle, and environmental factors.
Imaging & AI Datasets
Multimodal Imaging
CMS Medicare & Medicaid Data
Includes Medicare and Medicaid claims data, utilization, and provider information.
REVIEW
REVIEW
COSMIC (Catalogue of Somatic Mutations in Cancer)
Catalog of somatic mutations in cancer from sequencing studies.
Disease-Specific Data
Cancer
COVID-19 Community Vulnerability Index (CCVI)
Social and healthcare factors contributing to COVID-19 disparities.
Imaging & AI Datasets
Multimodal Imaging
CRISPR Screen Data from Broad Institute
Gene knockout & perturbation datasets for understanding disease pathways and drug discovery.
Systems Biology & Modeling
Pathway Databases
CXR8 Chest X-ray Dataset
112,000+ labeled chest X-ray images with 14 different pathologies, ideal for AI-based disease detection models.
Imaging & AI Datasets
Multimodal Imaging
CalorieKing Food Database
A trusted food database offering nutrition facts for favorite brands and fast-food restaurants, along with tools like a free online calorie counter to assist with dietary tracking.
Public Health & Epidemiology
Nutrition Data
CalorieNinjas - Nutrition Facts and Recipe API
Provides an easy-to-use nutrition facts and recipe API, offering nutritional information for a vast array of foods, including fast-food items, to support dietary tracking and analysis.
Public Health & Epidemiology
Nutrition Data
Cambridge Centre for Ageing and Neuroscience (Cam-CAN)
Brain imaging dataset for aging and cognitive neuroscience research.
Imaging & AI Datasets
Multimodal Imaging
Canadian Longitudinal Study on Aging (CLSA)
A national, longitudinal study following approximately 50,000 men and women aged 45 to 85 at recruitment for at least 20 years to collect information on aging.
Imaging & AI Datasets
Multimodal Imaging
Cancer Cell Line Encyclopedia (CCLE)
Large-scale database of cancer cell lines for drug response and genomic profiling.
Disease-Specific Data
Cancer
Cancer Dependency Map (DepMap)
Systematic mapping of cancer cell dependencies and drug targets.
Disease-Specific Data
Cancer
Cancer Imaging Archive (TCIA)
A massive public repository of medical imaging data for training AI-driven cancer detection models.
Disease-Specific Data
Cancer
Centers for Medicare & Medicaid Services (CMS)
Healthcare utilization data, including disparities in service use among Medicaid and Medicare populations
REVIEW
REVIEW
ChEMBL
A manually curated chemical database of bioactive molecules with drug-like properties, maintained by the European Bioinformatics Institute (EBI), providing information on compound bioactivity data against drug targets.
Imaging & AI Datasets
Multimodal Imaging
ChEMBL
Manually curated database of bioactive molecules with drug-like properties.
Imaging & AI Datasets
Multimodal Imaging
Chan Zuckerberg Biohub Cell Atlas
Comprehensive cell atlas mapping human cells to understand disease biology.
REVIEW
REVIEW
ChemSpider
A free chemical structure database providing fast access to over 100 million structures, properties, and associated information.
Imaging & AI Datasets
Multimodal Imaging
Chinese Health and Retirement Longitudinal Study (CHARLS)
A nationally representative longitudinal study of Chinese residents aged 45 and older, collecting a wide range of information on their health and economic status.
Imaging & AI Datasets
Multimodal Imaging
ClinVar
A freely available resource on clinically relevant genetic variants.
REVIEW
REVIEW
Clinical Cohort at BioLINCC
Clinical and genetic disease datasets hosted by NHLBI BioLINCC.
REVIEW
REVIEW
Clinical Pharmacogenetics Implementation Consortium (CPIC) Database
A curated pharmacogenomic database providing guidelines for drug-gene interactions in personalized medicine.
Imaging & AI Datasets
Multimodal Imaging
ClinicalTrials.gov
A registry of publicly and privately funded clinical trials worldwide.
REVIEW
REVIEW
ClinicalTrials.gov Diabetes Studies
Searchable database of diabetes-focused clinical trials.
Disease-Specific Data
Diabetes
Collaborative Drug Discovery (CDD) Vault
A web-based database solution for managing drug discovery data, focusing on small molecules and associated bio-assay data, facilitating collaboration among research teams.
REVIEW
REVIEW
Common Crawl
A nonprofit organization that crawls the web and freely provides its archives and datasets to the public, consisting of petabytes of data collected since 2008
Imaging & AI Datasets
Multimodal Imaging
Comparative Toxicogenomics Database (CTD)
A publicly available resource that curates scientific data describing relationships between chemicals, genes, and diseases, including information on environmental exposures linked to neurodegenerative diseases like Alzheimer's and Parkinson's.
Disease-Specific Data
Parkinsons Disease
ConnectomeDB
Repository of structural and functional connectivity MRI studies.
Imaging & AI Datasets
Multimodal Imaging
County Health Rankings & Roadmaps
Database ranking health disparities and social determinants at the county level.
Public Health & Epidemiology
Health Disparities
DGV (Database of Genomic Variants)
A catalog of genomic structural variations in humans.
Imaging & AI Datasets
Multimodal Imaging
DIAAS Dataset (Digestible Indispensable Amino Acid Score)
Global reference database for protein quality assessment based on amino acid digestibility.
Public Health & Epidemiology
Nutrition Data
DNA DataBank of Japan (DDBJ)
A DNA sequence repository.
General Research Repositories
Multidisciplinary Data
Data Commons
An open-source platform created by Google that provides an open knowledge graph, combining economic, scientific, and other public datasets into a unified view
REVIEW
REVIEW
Data Sharing for Demographic Research (DSDR)
Advances research on maternal and child health by making demographic data discoverable and accessible for secondary analysis, adhering to FAIR principles.
REVIEW
REVIEW
Data for Global Health Equity Repository
Provides over 70 datasets on SDOH across multiple countries, aiming to inform policies and programs to reduce health inequities.
Public Health & Epidemiology
Health Disparities
Data.gov
The U.S. government's open data site, providing access to datasets from various federal agencies.
REVIEW
REVIEW
DataONE
A network of interoperable data repositories facilitating data sharing, discovery, and open science, particularly in the Earth and environmental sciences
REVIEW
REVIEW
Database of Genomic Variants Archive (DGVa)
A repository for genomic structural variation.
Imaging & AI Datasets
Multimodal Imaging
Database of Interacting Proteins (DIP)
A database of experimentally validated protein interactions.
Imaging & AI Datasets
Multimodal Imaging
DeepChem
An open-source toolkit integrating deep learning with chemistry, providing datasets and models to accelerate drug discovery using AI.
REVIEW
REVIEW
DeepChem AI for Drug Discovery
Open-source AI/ML models and datasets for automated drug discovery and predictive modeling.
Imaging & AI Datasets
Multimodal Imaging
Demographic and Health Surveys (DHS)
Data on maternal/child health, nutrition, and infectious diseases in developing countries.
Public Health & Epidemiology
Nutrition Data
Diabetes Genes Database (T2D-Genes)
Genetic database linking Type 2 diabetes to genetic variations.
Disease-Specific Data
Diabetes
Diabetes Prevention Program (DPP)
Longitudinal study tracking the effectiveness of diabetes prevention programs.
Disease-Specific Data
Diabetes
Dietary Supplement Ingredient Database (DSID)
Developed by the U.S. Department of Agriculture in collaboration with the National Institutes of Health, the DSID provides estimated levels of ingredients in dietary supplement products sold in the United States, aiding in the assessment of nutrient intake from supplements.
Public Health & Epidemiology
Nutrition Data
DisGeNET
A discovery platform integrating information on gene-disease associations, aiding in the exploration of genetic and environmental factors contributing to chronic diseases.
Imaging & AI Datasets
Multimodal Imaging
Dr. Duke's Phytochemical and Ethnobotanical Databases
Developed by Dr. James A. Duke at the USDA, this database provides detailed information on the phytochemical constituents of plants, their ethnobotanical uses, and associated biological activities. It serves as a valuable resource for exploring the chemical compounds in plants and their traditional medicinal applications.
Imaging & AI Datasets
Multimodal Imaging
Drug-Induced Liver Injury Network (DILIN)
Research network focused on drug-induced liver injury.
REVIEW
REVIEW
DrugBank
Comprehensive resource for in silico drug discovery and exploration.
REVIEW
REVIEW
DrugBank
A unique bioinformatics and cheminformatics resource that combines detailed drug data with comprehensive drug target information, supporting pharmaceutical research and drug development.
REVIEW
REVIEW
Dryad
An international open-access repository of research data, particularly data underlying scientific and medical publications, making data discoverable, freely reusable, and citable.
General Research Repositories
Multidisciplinary Data
Dryad Digital Repository
An open data repository for scientific and medical research data.
General Research Repositories
Multidisciplinary Data
ECHO Normal Database
Normal echocardiography database for reference values.
Clinical & Cohort Data
Longitudinal Cohort
ECHO-NET Dynamic
Deep learning database for echocardiography analysis.
Clinical & Cohort Data
Longitudinal Cohort
EMBL Nucleotide Sequence Database (ENA)
A nucleotide sequence database.
REVIEW
REVIEW
EMory BrEast imaging Dataset (EMBED)
A racially diverse dataset of 3.5 million screening and diagnostic mammograms from 116,000 women, including annotated lesions linked to imaging descriptors and pathologic outcomes.
Imaging & AI Datasets
Multimodal Imaging
ENCODE
Encyclopedia of DNA elements, providing functional genomic data.
Imaging & AI Datasets
Multimodal Imaging
ENCODE (Encyclopedia of DNA Elements)
A public research project that aims to build a comprehensive parts list of functional elements in the human genome, providing data on genome-wide mapping of regulatory elements, transcription factor binding sites, histone modifications, chromatin accessibility, and RNA transcripts.
Imaging & AI Datasets
Multimodal Imaging
EPA Integrated Risk Information System (IRIS)
A database of risk assessments for environmental substances, including their immunotoxicological effects.
Imaging & AI Datasets
Multimodal Imaging
Edamam Food Database API
Offers a food database and nutrition data API, providing detailed nutritional information for various foods, including fast-food items, to support health and wellness applications.
Public Health & Epidemiology
Nutrition Data
Electronic Medical Records and Genomics (eMERGE) Network
Genomic and electronic health record integration for cardiovascular studies.
Clinical & Cohort Data
Longitudinal Cohort
English Longitudinal Study of Ageing (ELSA)
A multidisciplinary study collecting data on the health, social, wellbeing, and economic aspects of aging in England.
Imaging & AI Datasets
Multimodal Imaging
English Longitudinal Study of Ageing (ELSA)
A longitudinal study collecting multidisciplinary data from a representative sample of the English population aged 50+, focusing on aging, health trajectories, and socioeconomic factors.
Imaging & AI Datasets
Multimodal Imaging
Ensembl Genome Browser
Genome browser for exploring annotated genes and variants.
REVIEW
REVIEW
Ensembl Parkinson Disease Genomics
Genomic database integrating Parkinson? disease-associated mutations.
Disease-Specific Data
Parkinsons Disease
Environmental Data Initiative Repository
A repository for environmental research data.
General Research Repositories
Multidisciplinary Data
Environmental Genome Project (EGP)
Focuses on understanding the impact of environmental exposures on human disease by studying genetic susceptibility, including genes involved in immune responses.
Imaging & AI Datasets
Multimodal Imaging
Environmental Influences on Child Health Outcomes (ECHO) Program
ECHO investigates how environmental exposures in early development?from conception through early childhood?influence child health outcomes, including pregnancy outcomes.
Imaging & AI Datasets
Ultrasound
Environmental Justice Screening and Mapping Tool (EJSCREEN)
EPA tool mapping environmental and social justice health disparities.
Public Health & Epidemiology
Health Disparities
Environmental Public Health Tracking Network (EPHTN)
A system by the CDC providing data on environmental exposures and health outcomes, facilitating research on environmental factors in chronic disease prevention.
Imaging & AI Datasets
Multimodal Imaging
Environmental Risk Factors for Alzheimer's and Parkinson's Diseases Database
A database compiling information on environmental risk factors, such as air pollution and pesticide exposure, associated with the incidence and progression of Alzheimer's and Parkinson's diseases, supporting research into environmental determinants of these neurodegenerative conditions.
Disease-Specific Data
Parkinsons Disease
Eukaryotic Pathogen Database Resources (EuPathDB)
A genomic database for eukaryotic pathogens.
REVIEW
REVIEW
European Genome-Phenome Archive (EGA)
A European repository for genotype and phenotype data.
General Research Repositories
Multidisciplinary Data
European Longitudinal Study of Pregnancy and Childhood (ELSPAC)
ELSPAC is a longitudinal study that investigates the health and development of children in relation to environmental factors during pregnancy and early childhood across several European countries.
Imaging & AI Datasets
Multimodal Imaging
European Prospective Investigation into Cancer and Nutrition (EPIC)
A large cohort study investigating the relationships between diet, nutritional status, lifestyle, environmental factors, and the incidence of chronic diseases.
Disease-Specific Data
Cancer
European Union Open Data Portal
European Union institutions and bodies.
REVIEW
REVIEW
ExRNA Atlas for Parkinson? Disease
Atlas of extracellular RNA biomarkers for neurodegenerative diseases.
Disease-Specific Data
Parkinsons Disease
Exposome Explorer
A database of biomarkers related to environmental exposures and their potential immunological impacts.
Imaging & AI Datasets
Multimodal Imaging
FDA Adverse Event Reporting System (FAERS)
Database containing information on adverse event and medication error reports submitted to the FDA.
REVIEW
REVIEW
FDA OTC Database
The FDA Over-the-Counter (OTC) Database provides information on approved OTC drugs, including active ingredients, formulations, labeling, and regulatory status.
Imaging & AI Datasets
Multimodal Imaging
FDA Open Data
A collection of publicly available datasets from the U.S. FDA.
Imaging & AI Datasets
Multimodal Imaging
FRAILOMIC (Frailty & Aging Biomarkers)
A multi-omics dataset designed to predict frailty, cognitive decline, and healthy aging biomarkers.
Imaging & AI Datasets
Multimodal Imaging
Fast Food Nutrition Dataset - Kaggle
A comprehensive dataset providing nutritional information for various fast food products from popular chains, including calorie counts, macronutrients, and micronutrients.
Public Health & Epidemiology
Nutrition Data
Fast Food Nutrition Facts
Provides nutritional facts, Weight Watchers points, allergens, and ingredients for menu items from various fast food restaurants, allowing users to make informed dietary choices.
Public Health & Epidemiology
Nutrition Data
Fast Food Nutritional Database - GitHub
An ETL project compiling nutritional information from several national U.S. fast-food chains into a relational database, facilitating analysis and comparison of nutritional content across different restaurants.
Public Health & Epidemiology
Nutrition Data
FatSecret Platform API
Provides access to a vast dataset of global food nutrition information, including data from fast-food franchises, supporting applications in meal planning and dietary analysis.
Public Health & Epidemiology
Nutrition Data
Figshare
An open-access repository where researchers can preserve and share their research outputs, including datasets, images, and videos
General Research Repositories
Multidisciplinary Data
FinnGen Diabetes Data
Finnish biobank with genetic and clinical diabetes data.
Disease-Specific Data
Diabetes
FinnGen Parkinson? Disease Data
Finnish biobank study linking Parkinson? to genetic and clinical data.
Disease-Specific Data
Parkinsons Disease
Firebrowse (Broad Institute TCGA Data Access)
TCGA dataset access tool with curated genomic and clinical data.
REVIEW
REVIEW
Florida Alzheimer Disease Research Center (ADRC)
The National Alzheimer�s Coordinating Center (NACC) functions as the centralized data repository, and collaboration and communication hub, for the National Institute on Aging�s (NIA) Alzheimer�s Disease Research Centers (ADRC) Program.
Disease-Specific Data
Alzheimers Disease
Florida Behavioral Risk Factor Surveillance System (BRFSS)
Health risk behavior data from Florida's adult population.
Public Health & Epidemiology
Population Health
Florida Birth Defects Registry (FBDR)
Registry tracking birth defects and congenital anomalies in Florida.
Imaging & AI Datasets
Multimodal Imaging
Florida CHARTS (Community Health Assessment Resource Tool Set)
Statewide public health data including mortality, disease, and demographics.
REVIEW
REVIEW
Florida COVID-19 Data and Surveillance
COVID-19 surveillance and case reporting in Florida.
REVIEW
REVIEW
Florida Cancer Data System (FCDS)
Statewide cancer registry collecting incidence and survival data.
Disease-Specific Data
Cancer
Florida Department of Health - Public Health Statistics
Comprehensive health statistics and disease surveillance for Florida.
REVIEW
REVIEW
Florida Environmental Public Health Tracking
Tracking environmental factors and their effects on public health in Florida.
Imaging & AI Datasets
Multimodal Imaging
Florida HIV/AIDS Surveillance Program
HIV/AIDS case surveillance and epidemiological tracking in Florida.
REVIEW
REVIEW
Florida Health Data Warehouse
Centralized warehouse for Florida public health datasets.
REVIEW
REVIEW
Florida Health Equity and Disparities Data
Health equity data tracking disparities among Florida communities.
REVIEW
REVIEW
Florida Injury Surveillance System (FL-ISS)
Statewide injury surveillance and prevention data.
REVIEW
REVIEW
Florida Medicaid Data
Medicaid health data for policy and healthcare analysis.
REVIEW
REVIEW
Florida Prescription Drug Monitoring Program (PDMP)
Prescription drug monitoring program to reduce opioid misuse.
REVIEW
REVIEW
Florida Rural Health Research Data
Health data focused on rural populations in Florida.
REVIEW
REVIEW
Florida Trauma Registry
Statewide trauma registry collecting injury and emergency response data.
Imaging & AI Datasets
Multimodal Imaging
Florida Vital Statistics
Florida birth, death, and marriage records for health research.
REVIEW
REVIEW
FlyBase
A database for Drosophila genetics and research.
REVIEW
REVIEW
FooDB
A database containing detailed compositional data on food and its metabolites.
REVIEW
REVIEW
FooDB
The world's largest resource on food constituents, chemistry, and biology, providing detailed information about chemical compounds found in food.
REVIEW
REVIEW
FooDB (The Food Database)
A comprehensive resource detailing the chemical composition of foods, including information on macronutrients and micronutrients such as vitamins and minerals, as well as their known health effects.
Imaging & AI Datasets
Multimodal Imaging
Food Access Research Atlas (USDA)
Provides data on food deserts and food insecurity by census tract.
Public Health & Epidemiology
Population Health
Food Frequency Questionnaire (FFQ) Data
FFQs are standardized questionnaires used to assess habitual dietary intake over a specified period, providing valuable data for studying associations between dietary patterns and health outcomes.
Disease-Specific Data
Cancer
Food Metabolome Repository
A repository for food metabolome data obtained using liquid chromatography-mass spectrometry (LC-MS).
Imaging & AI Datasets
Multimodal Imaging
Food and Microbiome Longitudinal Investigation
A dataset from NYU researchers focusing on the longitudinal study of diet and its impact on the human microbiome.
Public Health & Epidemiology
Nutrition Data
FoodRepo: An Open Food Repository
An open food repository of barcoded food items, programmatically accessible through an API, suitable for large-scale studies in digital nutrition.
Public Health & Epidemiology
Nutrition Data
Framingham Heart Study
Longitudinal cardiovascular study tracking risk factors for heart disease.
Clinical & Cohort Data
Longitudinal Cohort
Functional Connectomes Project International Neuroimaging Data-Sharing Initiative (FCP/INDI)
A neuroimaging data-sharing platform.
Imaging & AI Datasets
Multimodal Imaging
GEO (Gene Expression Omnibus)
A repository of functional genomics datasets.
Imaging & AI Datasets
Multimodal Imaging
GISAID
Global Initiative on Sharing Avian Influenza Data, focused on SARS-CoV-2 genome sequences.
REVIEW
REVIEW
GISAID (Genomic Epidemiology of Viruses)
Public Health & Epidemiology Data
REVIEW
REVIEW
GMrepo
A curated human gut microbiome database with a focus on disease markers and cross-dataset comparisons.
REVIEW
REVIEW
GPM DB
A proteomics database for protein identification.
REVIEW
REVIEW
GTEx (Genotype-Tissue Expression Project)
A resource linking genetic variation with gene expression in multiple tissues.
Imaging & AI Datasets
Multimodal Imaging
GenBank
A comprehensive sequence database.
REVIEW
REVIEW
Gene Expression Omnibus (GEO)
A gene expression repository.
Genomics & Multi-Omics
Transcriptomics
Gene Expression Omnibus (GEO)
Repository of high-throughput gene expression and hybridization array data.
Genomics & Multi-Omics
Transcriptomics
Genetic Perturbation Platform (GPP)
Provides resources for analyzing CRISPR and RNAi screening data.
REVIEW
REVIEW
Genetics of Alzheimer's Disease Data (NIAGADS)
Genetic repository for Alzheimer? disease GWAS and sequencing data.
Disease-Specific Data
Alzheimers Disease
Genetics of Type 2 Diabetes (GoT2D)
Comprehensive genetic study of Type 2 diabetes risk factors.
Disease-Specific Data
Diabetes
Genome Aggregation Database (gnomAD)
A large-scale reference database of human genetic variation.
REVIEW
REVIEW
Genome Aggregation Database (gnomAD)
A resource for aggregated human genetic variation data from multiple sequencing projects.
Imaging & AI Datasets
Multimodal Imaging
GenomeRNAi
A database for RNAi screening data.
REVIEW
REVIEW
Genomic Data Commons (GDC)
Centralized repository for cancer genomics data.
Disease-Specific Data
Cancer
Genomic Data Commons (GDC) Data Portal
NCI's unified data repository for cancer genomic research.
Disease-Specific Data
Cancer
Genomics of Parkinson? Disease (GP2)
Global Parkinson? genomics project analyzing genetic risk factors.
Disease-Specific Data
Parkinsons Disease
German Neuroinformatics Node/G-Node (GIN)
A neuroscience research data repository.
General Research Repositories
Multidisciplinary Data
GeroSense Wearable Data
AI-based biological aging clocks from real-world wearable devices (Fitbit, Apple, Garmin).
REVIEW
REVIEW
GigaDB
A database for large-scale biological data.
REVIEW
REVIEW
Global Biodiversity Information Facility (GBIF)
A global biodiversity database.
REVIEW
REVIEW
Global Burden of Disease Study (GBD)
A comprehensive regional and global research program assessing mortality and disability from major diseases, injuries, and risk factors, providing insights into environmental determinants of chronic diseases.
Imaging & AI Datasets
Multimodal Imaging
Global Diabetes Footprint Dataset
Global study tracking the impact of diabetes worldwide.
Disease-Specific Data
Diabetes
Global Health Observatory (WHO)
Global health inequalities across regions.
REVIEW
REVIEW
Global Microbiome Dataset (GMrepo)
Multi-continent human microbiome sequencing dataset for gut health research and immune system studies.
REVIEW
REVIEW
Global Network Maternal and Newborn Health Registry
Registry tracking maternal and newborn health outcomes in low-resource settings.
REVIEW
REVIEW
Global Nutrition Report Dataset
Contains data for all indicators used in country profiles, compiled from sources like UNICEF, WHO, and the World Bank.
Public Health & Epidemiology
Nutrition Data
Global Virus Network (GVN)
Early-warning system for new pandemics & viral outbreaks, providing surveillance of emerging infectious diseases.
Imaging & AI Datasets
Multimodal Imaging
Glycemic Index Research Database
Database on the glycemic index of foods and their impact on blood sugar.
Imaging & AI Datasets
Multimodal Imaging
Google BigQuery Public Datasets
A collection of open datasets optimized for cloud-based big data analysis.
Imaging & AI Datasets
Multimodal Imaging
Google Dataset Search
A tool that enables users to discover datasets stored across the web, covering various disciplines and topics.
REVIEW
REVIEW
Google DeepMind?s AlphaFold Protein Structure Database
AI-driven protein structure predictions, accelerating drug discovery and protein engineering.
Imaging & AI Datasets
Multimodal Imaging
Google Genomics
A cloud-based platform for storing and analyzing genomic data.
REVIEW
REVIEW
Gut Microbiome-Metabolome Dataset Collection
A curated collection of unified data tables from 14 different human gut microbiome-metabolome studies.
Imaging & AI Datasets
Multimodal Imaging
HDPulse Data Portal The Data Portal characterizes the burden of disparities across the United States and within communities
HDPulse Data Portal The Data Portal characterizes the burden of disparities across the United States and within communities
Imaging & AI Datasets
Multimodal Imaging

 

© 2025 by Center of Excellence – Consortium of Educators.

 

bottom of page