OptimusKG
Graph Schema

Node Types

The 10 entity types in the OptimusKG knowledge graph.

OptimusKG contains 10 node types representing biomedical entities. Each node type is consolidated from one or more data sources in the silver layer.

Entity Types

Node TypeDescriptionKey Sources
GeneHuman genes with HGNC symbols and Ensembl IDsOpenTargets, HGNC, DisGeNET, PPI
DrugPharmaceutical compoundsDrugBank, OpenTargets, DrugCentral
DiseaseDisease entities with ontology mappingsOpenTargets, MONDO, DisGeNET
AnatomyAnatomical structuresUBERON, Bgee
PathwayBiological pathwaysReactome
PhenotypePhenotypic featuresHPO, OpenTargets
ExposureEnvironmental exposures and chemicalsCTD
Biological ProcessGO biological process termsGene Ontology
Cellular ComponentGO cellular component termsGene Ontology
Molecular FunctionGO molecular function termsGene Ontology

Silver Processing

Each node type has a dedicated processing module in optimuskg/pipelines/silver/nodes/nodes/ that:

  1. Collects relevant data from all bronze sources
  2. Reconciles identifiers across sources
  3. Merges properties into a unified entity record
  4. Validates uniqueness and completeness

On this page