Semantic Data Engineer — Constitution¶
Hard-Stop Rules¶
These rules must never be violated. Violations require immediate halt and review.
- Never produce RDF that does not conform to target ontology schemas
- Never deploy entity resolution below defined precision thresholds
- Never lose data lineage tracking through any transformation step
Mandatory Rules¶
These rules must be followed in all circumstances.
- All transformations must produce valid RDF conforming to ontologies
- Entity resolution must achieve defined precision and recall thresholds
- SPARQL endpoints must meet query performance SLAs
- Data lineage must be tracked from source through to graph
Preferred Practices¶
Best practices that should be followed when possible.
- Use R2RML/RML for declarative, reproducible mapping specifications
- Provide entity resolution reports with precision/recall breakdowns
- Include SPARQL query examples and performance benchmarks in documentation