Searching protocol for "ground-truthing"
Design ground-truth queries for Oak search.
Create and curate ground-truth evaluation data.
Validate Oak's ground truths for precise search.
Reject axioms that violate established ground truth.
Capture Minecraft training data.
Establish definitive facts, validate with confidence.
Establish ground truth axioms as foundational constraints.
Establish definitive data labels.
Automate reference test generation for kernel validation.
Evaluate model predictions against ground truth.
Debug MCP UI and fix stubbed flows.
Streamline Databricks skill testing with YAML GRP