Real-time operational status for all Mesh-Trace infrastructure components.
Historical availability across all production services.
Query engine experienced elevated P99 latency (up to 850ms vs. normal 120ms) in eu-central-1 between 14:22 and 14:47 UTC due to a storage node rebalancing operation. Ingestion was unaffected. Root cause: automatic rebalancing triggered by a node replacement exceeded expected duration.
Webhook delivery experienced a 12% failure rate between 09:10 and 09:35 UTC. Affected webhooks were automatically retried and delivered within 5 minutes. Root cause: a misconfigured connection pool limit after a routine infrastructure update.
Users experienced 5-10 second delays when signing in to the dashboard between 18:00 and 18:20 UTC. API and trace ingestion were unaffected. Root cause: SSO provider certificate rotation triggered re-authentication for all active sessions simultaneously.
Subscribe to receive real-time status updates via email when incidents are reported or resolved.