Trama

Observability

Metrics Catalog

MetricTypeDescription
saga_enqueue_totalCounterQueue ingress
saga_dequeue_totalCounterQueue claims
saga_processed_totalCounterProcessed outcomes
saga_failed_totalCounterFailures by reason
saga_retried_totalCounterRetry scheduling rate
saga_rate_limited_totalCounterRate-limited executions
saga_redis_claim_scans_totalCounterRedis shard claim scans performed by queue claimers.
saga_redis_active_podsGaugeHealthy worker pods seen in Redis membership.
saga_redis_owned_shardsGaugeVirtual shards currently assigned to this worker.
saga_redis_membership_refresh_age_msGaugeAge of the last successful membership refresh.
saga_duration_secondsHistogramEnd-to-end duration
saga_step_duration_success_secondsHistogramSuccessful step duration

The Redis cluster rollout adds worker-coordination metrics so you can observe shard ownership, claim pressure, and membership freshness without inspecting Redis directly.

Definition-Level Labels

Grafana Dashboard

Import grafana/trama-saga-dashboard.json.

Suggested Alerts

Tracing

OpenTelemetry spans cover request handling and saga processing when telemetry is enabled.