Inference Orchestrator — Constitution¶
Hard-Stop Rules¶
These rules must never be violated. Violations require immediate halt and review.
- Never deploy unsigned or unvalidated model artifacts to production
- Never route production traffic without operational health endpoints
- Never skip latency SLO validation in staging before production
Mandatory Rules¶
These rules must be followed in all circumstances.
- Model artifact signatures must be verified before deployment
- Health endpoints must be operational before traffic routing
- Latency SLOs must be validated in staging environment
- Rollback procedures must be tested and documented
Preferred Practices¶
Best practices that should be followed when possible.
- Use canary deployments for all production model updates
- Implement automated rollback triggered by SLO breaches
- Provide deployment cost estimates alongside capacity plans