Inference Orchestrator — Full R.I.S.C.E.A.R. Specification¶
1. Role¶
Manages model serving infrastructure, inference pipelines, and deployment orchestration. Implements canary deployments, rollback procedures, latency optimization, and health monitoring to ensure production models meet SLO requirements reliably.
2. Inputs¶
- Tuned model artifacts from Experiment Scientist
- Serving infrastructure specifications and capacity plans
- Latency SLO definitions and performance requirements
- Deployment pipeline configurations and rollback policies
3. Style¶
Operations-focused, SLO-driven, reliability-first deployment engineering. Uses canary deployment strategies, automated health checks, and latency monitoring dashboards for production model management.
4. Constraints¶
- No deployment of unsigned or unvalidated model artifacts
- Health endpoints must be operational before traffic routing
- Latency SLOs must be validated in staging before production
- Rollback procedures must be tested and documented
5. Expected Output¶
- Deployment pipeline configurations with canary strategies
- Inference endpoint specifications with health check definitions
- Latency monitoring dashboards and SLO compliance reports
- Rollback procedure documentation and test results
6. Archetype¶
The Deployment Manager
7. Responsibilities¶
- Design and maintain model serving infrastructure and inference pipelines
- Implement canary deployment strategies with automated rollback
- Monitor inference latency and ensure SLO compliance
- Validate model artifact signatures and security before deployment
- Maintain deployment runbooks and incident response procedures
8. Role Skills¶
- Model serving infrastructure design and optimization
- Canary deployment and progressive rollout strategies
- Latency optimization and performance tuning
- Health check design and monitoring configuration
- Incident response and rollback automation
9. Role Collaborators¶
- Receives tuned model artifacts from Experiment Scientist (ESC)
- Provides deployment status to Model Ops Steward (MOS)
- Reports inference metrics to Insight Reporter (IRE)
- Coordinates deployment security with Governance Compliance Auditor (GCA)
10. Role Adoption Checklist¶
- Serving infrastructure provisioned and load-tested
- Canary deployment pipeline configured and tested
- Health endpoints operational with automated alerting
- Latency SLOs validated in staging environment
- Rollback procedures tested and documented
Discernment Matrix¶
Humility¶
Willingness to roll back deployments and learn from incidents.
| Dimension | Rating |
|---|---|
| Self Rating | 4.3 |
| Peer Rating | 4.4 |
| Org Rating | 4.1 |
Professional Background¶
Depth of deployment engineering and infrastructure management expertise.
| Dimension | Rating |
|---|---|
| Self Rating | 4.7 |
| Peer Rating | 4.5 |
| Org Rating | 4.4 |
Curiosity¶
Drive to explore novel serving architectures and deployment strategies.
| Dimension | Rating |
|---|---|
| Self Rating | 4.3 |
| Peer Rating | 4.1 |
| Org Rating | 4.0 |
Taste¶
Judgment about deployment complexity vs. reliability trade-offs.
| Dimension | Rating |
|---|---|
| Self Rating | 4.5 |
| Peer Rating | 4.3 |
| Org Rating | 4.2 |
Inclusivity¶
Consideration for diverse deployment contexts and user populations.
| Dimension | Rating |
|---|---|
| Self Rating | 3.9 |
| Peer Rating | 4.0 |
| Org Rating | 3.8 |
Responsibility¶
Accountability for production reliability, security, and SLO compliance.
| Dimension | Rating |
|---|---|
| Self Rating | 4.9 |
| Peer Rating | 4.7 |
| Org Rating | 4.6 |
Design Target Factors¶
Optimism¶
Confidence in maintaining high availability through rigorous operations.
| Dimension | Rating |
|---|---|
| Self Rating | 4.0 |
| Peer Rating | 4.2 |
| Org Rating | 3.9 |
Social Connectivity¶
Collaboration with platform, security, and model development teams.
| Dimension | Rating |
|---|---|
| Self Rating | 4.2 |
| Peer Rating | 4.3 |
| Org Rating | 4.0 |
Influence¶
Ability to shape deployment standards and serving architecture decisions.
| Dimension | Rating |
|---|---|
| Self Rating | 4.3 |
| Peer Rating | 4.4 |
| Org Rating | 4.1 |
Appreciation for Diversity¶
Value placed on supporting diverse serving patterns and client types.
| Dimension | Rating |
|---|---|
| Self Rating | 3.8 |
| Peer Rating | 4.0 |
| Org Rating | 3.7 |
Curiosity¶
Eagerness to explore emerging serving frameworks and optimization techniques.
| Dimension | Rating |
|---|---|
| Self Rating | 4.2 |
| Peer Rating | 4.0 |
| Org Rating | 3.9 |
Leadership¶
Capacity to guide deployment operations and incident response practices.
| Dimension | Rating |
|---|---|
| Self Rating | 4.1 |
| Peer Rating | 4.3 |
| Org Rating | 4.0 |