Skill Explorer

Searching protocol for "reward-functions"

add-reward

Official

Add custom reward functions for AReaL quickly.

Few Config

byinclusionAI

grpo-rl-training

Community

Master GRPO/RL fine-tuning with TRL.

Advanced

bychoice5346

grpo-rl-training

Official

Fine-tune LLMs with custom rewards.

Advanced

byOrchestra-Research

grpo-rl-training

Community

Fine-tune LLMs with custom rewards.

Advanced

byihatesea69

grpo-rl-training

Community

Master GRPO/RL fine-tuning with TRL.

Advanced

bygagan114662

grpo-rl-training

Community

Fine-tune LLMs with custom rewards for complex tasks.

Advanced

byzhuangbiaowei

grpo-finetuning

Official

GRPO fine-tuning for vision-language models

Advanced

byaws-solutions-library-samples

grpo-rl-training

Community

Fine-tune models with custom rewards.

Advanced

byAum08Desai

grpo-rl-training

Community

Fine-tune models with GRPO/RL

Advanced

bykwasi-cpu

grpo-rl-training

Community

Fine-tune LLMs with custom rewards.

Advanced

byGarrettRoi

rnow-rewards

Official

Define RL rewards for ReinforceNow training.

Advanced

byReinforceNow

grpo-rl-training

Community

Fine-tune models with custom rewards.

Advanced

byhochoa13