AutonomyOps Operator Runbooks¶
These runbooks document the current implemented behavior of the AutonomyOps ADK operator surfaces. Every command, flag, and expected output here reflects code that is deployed and testable. Known gaps where manual intervention is still required are explicitly marked.
Index¶
# |
Runbook |
Surface |
Scope |
|---|---|---|---|
01 |
|
Stuck plan recovery, stage halt, rollback strategy |
|
02 |
|
Manual gate approval for blocked stages |
|
03 |
|
Graceful leader resignation |
|
04 |
|
Detect divergence, apply recovery strategy |
|
05 |
|
Evaluate quorum status, restore sync replicas |
|
06 |
|
List, inspect, retry, purge deadletter entries |
|
07 |
|
Diagnose throttle events, adjust limits |
|
08 |
Certificate Rotation Procedure (self-hosted tier) |
|
Issue, rotate, and inspect edge certificates |
09 |
|
Create roles, assign operators, enforce permissions |
|
10 |
|
Collect diagnostics for incident triage |
|
11 |
All surfaces |
Cross-subsystem emergency stop and rollback |
Prerequisites (all runbooks)¶
CLI binary:
autonomy(control-plane operations) andedgectl(edge node operations)Environment variables (or equivalent flags):
AUTONOMY_ORCHESTRATOR_URL— base URL of the control-plane HTTP APIAUTONOMY_OPERATOR— operator identity for audit recordsAUTONOMY_RBAC_DIR— path to RBAC store when RBAC enforcement is enabled
jqfor JSON output formatting (optional but recommended)
Known global limitations¶
RBAC enforcement is on by default. Set
AUTONOMY_RBAC_ENFORCEMENT=0only for a temporary migration or break-fix scenario, and restore enforcement immediately after.Audit events are written to slog and, on control-plane paths that initialize the PostgreSQL audit emitter at startup, to the
audit_eventstable (append-only, INV-AUDIT-01). Useautonomy audit query --pg-url/autonomy audit export --pg-url(orAUTONOMY_AUDIT_PG_URL) for database-backed operator queries, andautonomy audit prune --older-than Ndfor operator-initiated retention enforcement. When no--pg-urlis set the file-backed emitter (AUTONOMY_AUDIT_DIR) remains active as fallback.