Global Topology
Nodes, probe links, and live ingress overlaid in one map.
Event Stream
Recent incidents, status changes, and anomalies.
Active Incidents
Open, acknowledged, and suppressed issues that still need lifecycle tracking.
Control Plane
Server health, build metadata, database check, and required runtime assets.
Statistical Radar
24h robust outlier and baseline-shift ranking across the fleet.
Building the 24h fleet radar.
Experiment View
Ground-truth fault injection evaluation from labelled experiments.
Detector Benchmark
Five detectors replayed against the same labelled experiments with 95% bootstrap CIs.
| Detector | Detect % | Mean delay (s) | 95% CI | FP / node-hour | Events |
|---|
Loading benchmark…
View head-to-head figure
Scalability Benchmark
Single-server capacity from scripts/loadtest-local.sh. Static snapshot.
| Agents | Req/s | p50 (ms) | p95 (ms) | p99 (ms) | Success |
|---|
Loading scalability data…
Operational Reliability
24h availability proxy, telemetry quality, and incident pressure.
Building the reliability ledger.
Node Matrix
Sortable health, resource pressure, and score summary.
| Node | Status | CPU | Memory | Down | Links | Score |
|---|
Link Diagnostics
Probe latency and packet loss ranked by weakest path.
Ingress Hotspots
Top sampled remote sources across the last 24 hours.