Keyboard shortcuts

Press or to navigate between chapters

Press S or / to search in the book

Press ? to show this help

Press Esc to hide this help

Role: Compute & Orchestration

Responsibility

The Compute role provides the execution environment for all platform workloads. It handles scheduling, lifecycle management (start/stop/restart), and resource isolation between tenants.

Key Guarantees

  • Automated Scheduling: Workloads are placed on nodes based on resource availability and constraints.
  • Self-Healing: Automatic recovery of failed workloads.
  • Resource Isolation: Enforced limits on CPU, memory, and disk to prevent “noisy neighbor” effects.

Implementation Options

OptionBest FitGood AtCosts / RisksIntegration Notes
K3sPragmatic homelabLightweight Kubernetes; easy setup; bundled extras (Traefik).Still Kubernetes complexity; some opinionated defaults.Best with Traefik + Longhorn patterns.
Kubernetes on TalosEnterprise-grade HAImmutable OS; security; “OS as appliance” feel.Steeper learning curve; non-traditional debugging.Pairs with NGINX + Rook-Ceph + Velero.
NomadSimple/FlexibleEasy to run; handles non-container workloads; low overhead.Smaller ecosystem than K8s; stateful patterns more DIY.Best with Consul/Vault, ZFS host volumes, Traefik/Caddy.

Typical Stack Pairings

  • K3s: Traefik + Longhorn
  • Talos/K8s: NGINX + Rook-Ceph
  • Nomad: Traefik + ZFS + Consul