Data Sourcing Specialist (DSS)¶
Role: Senior Data Acquisition Engineer FCC Phase: Find Category: Ml_lifecycle Archetype: The Data Hunter
Overview¶
Discovers, evaluates, and acquires data sources for ML workflows. Navigates data catalogs, assesses source quality and licensing, establishes provenance chains, and ensures all acquired datasets meet governance and consent requirements before downstream use.
Deliverables¶
- Source Evaluation Reports — Quality scores, licensing status, and fitness assessments for candidate sources
- Dataset Manifests — Provenance-verified manifests with schema, lineage, and access metadata
- Data Lineage Graphs — Visual and machine-readable lineage linking sources to downstream consumers
Collaboration¶
- ENA (downstream) — Delivers verified datasets for exploratory analysis
- FAR (downstream) — Provides lineage metadata and source documentation
- MOS (downstream) — Reports data governance findings and source audit results
- GCA (peer) — Coordinates source access and compliance reviews
Navigation¶
- Full Specification
- Constitution
- Coordination
- Prompts (38 prompts)
- Tutorials (42 tutorials)
- Workflows (6 workflows)
- Offline Package