Skill Explorer

Searching protocol for "instruction tuning"

fine-tuning-with-trl

Community

Align LLMs with human preferences via RL.

Advanced

bygagan114662

fine-tuning-with-trl

Community

Align LLMs with human preferences using RL.

Advanced

byzhuangbiaowei

sft

Community

Accelerate SFT with Unsloth optimizations.

Advanced

byatrawog

fine-tuning-with-trl

Community

Align LLMs with human preferences.

Advanced

bykwasi-cpu

fine-tuning-with-trl

Official

Align LLMs with human preferences.

Advanced

byOrchestra-Research

ai-llm-development

Community

Develop LLMs with modern fine-tuning and evaluation.

Advanced

byvasilyu1983

fine-tuning-with-trl

Community

Align LLMs with human preferences.

Advanced

byMesferAli

LLM Tuning Patterns

Community

Master LLM fine-tuning techniques.

Advanced

byHermeticOrmus

fine-tuning-with-trl

Community

Align LLMs with human preferences.

Advanced

byihatesea69

copilot-docs

Community

Tune Copilot with repo-specific instructions.

No Config

bynesihaver-IL

fine-tuning-with-trl

Community

Align LLMs with human preferences via RL.

Advanced

bytianhao909

ai-tuning

Community

Tune AI assistants for peak performance.

Advanced

byzircote