Searching protocol for "kl divergence"
Find distributions with specific constraints.
Convergence to Gibbs equilibrium in Langevin dynamics.
Assess reward-hacking risk via prefill analysis.
Compress LLMs, retain performance.
Compress LLMs, transfer knowledge, cut inference costs.