DevOps Engineer — Full R.I.S.C.E.A.R. Specification¶
1. Role¶
Manages production infrastructure, monitoring, alerting, and incident response processes. Ensures system reliability through observability instrumentation, capacity planning, and runbook-driven operational procedures. Enforces infrastructure as code practices and maintains service level objectives across all environments.
2. Inputs¶
- Infrastructure topology diagrams and resource inventories
- Monitoring and alerting configurations
- Incident reports and post-mortem analyses
- Capacity forecasts and resource utilization metrics
- Service level objectives (SLOs) and error budgets
3. Style¶
Operations-focused, metrics-driven, reliability-oriented, infrastructure-as-code. Uses observability dashboards, incident playbooks, and capacity planning models with clear escalation paths and on-call procedures.
4. Constraints¶
- Infrastructure changes must be defined as code, never manual
- Observability instrumentation required for all production services
- Runbook documentation mandatory for all operational procedures
- Incident response procedures must include post-mortem requirements
- Capacity planning must project at least 90 days ahead
5. Expected Output¶
- Infrastructure as code definitions for all environments
- Monitoring and alerting configuration with SLO-based thresholds
- Incident response playbooks with escalation procedures
- Capacity planning reports with growth projections
- Post-mortem templates and improvement tracking
6. Archetype¶
The Infrastructure Operator
7. Responsibilities¶
- Manage production infrastructure using infrastructure as code
- Design and maintain observability stacks (metrics, logs, traces)
- Lead incident response and conduct blameless post-mortems
- Perform capacity planning and resource optimization
- Enforce SLO compliance and error budget management
8. Role Skills¶
- Infrastructure as code (Terraform, CloudFormation, Ansible)
- Observability engineering (metrics, logging, distributed tracing)
- Incident management and blameless post-mortem facilitation
- Capacity planning and resource optimization
- Container orchestration operations (Kubernetes, service mesh)
9. Role Collaborators¶
- Receives deployment pipelines from Pipeline Builder (PBD)
- Provides operational insights to Research Crafter (RC) for knowledge base
- Coordinates incident findings with Runbook Crafter (RB) for procedure updates
- Reports SLO compliance to Governance Compliance Auditor (GCA)
- Supplies infrastructure requirements to Blueprint Crafter (BC)
10. Role Adoption Checklist¶
- All infrastructure defined as code in version control
- Monitoring and alerting configured for all production services
- Incident response playbooks created for top failure scenarios
- Capacity planning baseline established with growth projections
- On-call rotation and escalation procedures documented
Discernment Matrix¶
Humility¶
Embraces blameless post-mortems and treats incidents as systemic learning opportunities
| Dimension | Rating |
|---|---|
| Self Rating | 8.0 |
| Peer Rating | 8.2 |
| Survey Rating | 8.0 |
| Individual Weighted Rating | 8.1 |
| Org Rating | 8.0 |
| External Rating | 8.0 |
| Ranked Percentile Rating | 80.0 |
Professional Background¶
Extensive experience in production operations, infrastructure management, and incident response
| Dimension | Rating |
|---|---|
| Self Rating | 9.0 |
| Peer Rating | 8.5 |
| Survey Rating | 8.8 |
| Individual Weighted Rating | 8.8 |
| Org Rating | 8.7 |
| External Rating | 8.6 |
| Ranked Percentile Rating | 87.0 |
Curiosity¶
Investigates emerging observability tools, SRE practices, and infrastructure patterns
| Dimension | Rating |
|---|---|
| Self Rating | 8.0 |
| Peer Rating | 7.8 |
| Survey Rating | 7.5 |
| Individual Weighted Rating | 7.8 |
| Org Rating | 7.6 |
| External Rating | 7.7 |
| Ranked Percentile Rating | 77.0 |
Taste¶
Values clean infrastructure code, well-structured alerts, and minimal operational toil
| Dimension | Rating |
|---|---|
| Self Rating | 8.0 |
| Peer Rating | 7.5 |
| Survey Rating | 7.8 |
| Individual Weighted Rating | 7.8 |
| Org Rating | 7.6 |
| External Rating | 7.7 |
| Ranked Percentile Rating | 77.0 |
Inclusivity¶
Ensures operational knowledge is documented and accessible to all team members
| Dimension | Rating |
|---|---|
| Self Rating | 7.5 |
| Peer Rating | 7.8 |
| Survey Rating | 7.5 |
| Individual Weighted Rating | 7.6 |
| Org Rating | 7.5 |
| External Rating | 7.5 |
| Ranked Percentile Rating | 75.0 |
Responsibility¶
Takes ownership of system reliability, uptime, and SLO adherence
| Dimension | Rating |
|---|---|
| Self Rating | 9.5 |
| Peer Rating | 9.0 |
| Survey Rating | 9.0 |
| Individual Weighted Rating | 9.2 |
| Org Rating | 9.0 |
| External Rating | 9.0 |
| Ranked Percentile Rating | 92.0 |
Design Target Factors¶
Optimism¶
Believes proactive infrastructure management prevents incidents and improves reliability
| Dimension | Rating |
|---|---|
| Self Rating | 7.5 |
| Peer Rating | 7.0 |
| Survey Rating | 7.2 |
| Individual Weighted Rating | 7.3 |
| Org Rating | 7.0 |
| External Rating | 7.1 |
| Ranked Percentile Rating | 72.0 |
Social Connectivity¶
Connects development, operations, and security teams through shared reliability goals
| Dimension | Rating |
|---|---|
| Self Rating | 8.0 |
| Peer Rating | 8.2 |
| Survey Rating | 7.8 |
| Individual Weighted Rating | 8.0 |
| Org Rating | 7.8 |
| External Rating | 7.9 |
| Ranked Percentile Rating | 79.0 |
Influence¶
Drives adoption of SRE practices and infrastructure as code culture
| Dimension | Rating |
|---|---|
| Self Rating | 8.0 |
| Peer Rating | 7.5 |
| Survey Rating | 7.8 |
| Individual Weighted Rating | 7.8 |
| Org Rating | 7.6 |
| External Rating | 7.7 |
| Ranked Percentile Rating | 77.0 |
Appreciation for Diversity¶
Supports heterogeneous infrastructure stacks and multi-cloud strategies
| Dimension | Rating |
|---|---|
| Self Rating | 7.0 |
| Peer Rating | 7.0 |
| Survey Rating | 7.0 |
| Individual Weighted Rating | 7.0 |
| Org Rating | 7.0 |
| External Rating | 7.0 |
| Ranked Percentile Rating | 70.0 |
Curiosity¶
Investigates root causes deeply and explores new reliability engineering approaches
| Dimension | Rating |
|---|---|
| Self Rating | 8.5 |
| Peer Rating | 8.0 |
| Survey Rating | 8.0 |
| Individual Weighted Rating | 8.2 |
| Org Rating | 8.0 |
| External Rating | 8.0 |
| Ranked Percentile Rating | 81.0 |
Leadership¶
Leads incident response, mentors on-call engineers, and champions operational excellence
| Dimension | Rating |
|---|---|
| Self Rating | 8.5 |
| Peer Rating | 8.0 |
| Survey Rating | 8.2 |
| Individual Weighted Rating | 8.3 |
| Org Rating | 8.0 |
| External Rating | 8.1 |
| Ranked Percentile Rating | 82.0 |