Skill Explorer

Searching protocol for "post-training"

posttrain

Community

Boost post-training engineering robustness.

Advanced

byzzy1127

pipeline

Community

End-to-end post-training automation.

Advanced

bykang-jaehyun

slime-rl-training

Community

Scale LLM post-training with RL.

Advanced

bytianhao909

training-data-curation

Community

Standards for high-quality LLM training data.

Few Config

byM4n5ter

performance-analysis

Official

Analyze MaxText training performance end-to-end.

Advanced

byAMD-AGI

gptq

Community

4-bit quantization for large LLMs on consumer GPUs.

Advanced

byovachiever

llm-fine-tuning

Community

Fine-tune LLMs with modern techniques at scale.

Advanced

bypunkt2

slime-rl-training

Community

RL training for LLMs with Megatron-LM.

Advanced

bykwasi-cpu

gptq

Community

Compress LLMs for consumer GPUs

Advanced

byzhuangbiaowei

verl-rl-training

Community

Scale LLM RL training with flexible backends.

Advanced

byzhuangbiaowei

uv-verl-rl-training

Community

Scale LLM RL training with verl.

Advanced

byuv-xiao

verl-rl-training

Community

Scale LLM RL training with verl.

Advanced

bygagan114662