Human Genome Database
A consolidated, analytics-oriented database for conducting large-scale research on published genomics data
Data Pipeline
The HGD data pipeline is built with the intention of extracting & standardizing data from several public sources of published & validated genomics data. Currently the pipeline supports extraction from:
Database
The overarching goal of this database is to standardize the connections between public databases for genomics data to support advanced analytics, machine learning & research-based discovery. The database Entity-Relationship-Diagram below demonstrates how the data is connected between sources at this time.
Source Code: Human Genome Database Repository