Searching protocol for "macro-f1"
Quantify model performance with robust metrics.
Optimizes prompts for text classification.