Operations
Monitoring
Prometheus metrics, health endpoints, structured logs.
Endpoints
/health— process is alive/ready— node is synced and voting/metrics— Prometheus scrape
Key metrics
cipher_proof_verify_secondscipher_votes_total{result="ok|fail"}cipher_peer_countcipher_coordinator_role(0 = passive, 1 = primary)
Suggested alerts
- Missed votes > 5 in 10 minutes
- Peer count < 4 for > 5 minutes
- Coordinator role flapping > 2× / hour