Skip to content

FCC Agent Team Framework

Edge Inference Engineer — Scaffold Workflow

l2_fcc_agent_team_ext

Edge Inference Engineer — Scaffold Workflow¶

Description: Generate new artifact from scratch

When to Use¶

Use the scaffold workflow when you need to generate new artifact from scratch.

Input Requirements¶

Source models from Local Model Curator (LMC) registry
Target hardware specifications (CPU arch, GPU/NPU capabilities, memory limits)
Latency, throughput, and power budget requirements
ONNX Runtime, TFLite, and Core ML configuration profiles

Process¶

Initialize — Set up the scaffold context for Edge Inference Engineer
Execute — Perform the scaffold operation following Edge Inference Engineer's style
Validate — Check output against quality gates
Handoff — Deliver results to downstream personas

Output¶

Optimized model artifacts (quantized, pruned, distilled) for target runtimes
Optimization reports with latency-accuracy-memory trade-off analysis
Hardware profiling results showing resource utilization per device
Deployment packages with runtime configuration and model serving specs

Quality Gates¶

Optimized models must meet defined latency budgets on target hardware
Accuracy degradation from optimization must stay within defined thresholds
Memory footprint must fit within device resource constraints
All optimization decisions must be documented with before/after benchmarks