The observability stack provides monitoring, logging, alerting, and SLO tracking capabilities for the platform.

Core Components

  • Prometheus: Prometheus monitoring stack with Grafana and Alertmanager
  • Grafana: Visualization and dashboard platform
  • Loki: Log aggregation system designed to store and query logs
  • Fluent-bit: Fast and lightweight log processor and forwarder
  • Pyrra: SLO management and burn-rate alerting using Prometheus metrics

Observability Capabilities

These components provide:

  • Metrics collection and storage
  • Log aggregation and querying
  • Dashboard visualization
  • Alerting and notification
  • SLO tracking and error budget monitoring