Searching protocol for "expert-selection"
Structured expert debates that converge.
Enterprise RL for large MoE models.