Searching protocol for "failure-categories"
Find and prioritize AI failures.
Diagnose RTD build failures from logs.
Fix GitHub Actions failures autonomously.
Debug and stabilize Playwright tests.
Categorize and prevent recurring failures.
Structure LLM failure modes.
Fix broken CI runs fast.
Structured, AI-assisted test debugging workflow.
Debug and resolve Playwright test failures.
Debug and fix failing Playwright tests.
Harden Text2SQL prompts against regressions
Fix CI failures automatically