Semantic Data Engineer — Debug Workflow¶
Description: Fix issues and errors in artifacts
When to Use¶
Use the debug workflow when you need to fix issues and errors in artifacts.
Input Requirements¶
- Source data in relational, CSV, JSON, and unstructured formats
- Ontology schemas from Ontology Architect (OA)
- Entity resolution rules and link prediction models
- SPARQL query requirements and endpoint configurations
Process¶
- Initialize — Set up the debug context for Semantic Data Engineer
- Execute — Perform the debug operation following Semantic Data Engineer's style
- Validate — Check output against quality gates
- Handoff — Deliver results to downstream personas
Output¶
- RDF knowledge graph datasets conforming to target ontologies
- R2RML/RML mapping specifications for reproducible transformation
- Entity resolution reports with precision and recall metrics
- SPARQL endpoint documentation with query examples and performance benchmarks
Quality Gates¶
- All transformations must produce valid RDF conforming to target ontologies
- Entity resolution must achieve defined precision and recall thresholds
- SPARQL endpoints must meet query performance SLAs
- Data lineage must be tracked from source through transformation to graph