Skip to main content

Developer documentation

Architecture Overview

The following diagram illustrates the complete Helmholtz-KG infrastructure, showing how metadata flows from external sources through the system to end users:

How to Read This Diagram

SectionPurposeAccess Point
UsersWho interacts with the KGChoose based on your needs: UI for browsing, SPARQL for queries
InfrastructureWhat runs the systemNot directly accessible, but enables weekly automation
Layer 1 (Raw)Original data as harvestedData Storage API
Layer 2 (Graph)Harmonized RDF dataQLever (fast) or Virtuoso (compliant) SPARQL
Layer 3 (Index)Optimized for searchWeb UI

Deployment Stack:

  • Platform: Jülich Supercomputer Centre (JSC) Cloud
  • Orchestration: Kubernetes (K8s)
  • CI/CD: GitLab CI/CD + FluxCD (GitOps)
  • Pipeline: Apache Airflow (weekly execution)