Name: unsloth-training
Availability: InStock
Author: ScientiaCapital

System Documentation

What problem does it solve?

Fine-tune large language models efficiently using Unsloth via GRPO reinforcement learning or supervised fine-tuning, reducing memory and time costs while enabling advanced features like FP8, vision fine-tuning, and mobile deployment.

Core Features & Use Cases

GRPO RL training with reward design and LoRA adapters
SFT training with packing and long-context options
FP8 training for significant VRAM savings
Vision fine-tuning for VLM tasks
Docker-based training and reproducible environments
Mobile deployment support through QAT/ExecuTorch export
GGUF export options for various serving backends
End-to-end pipeline from data prep to export and deployment

Quick Start

Run the GRPO training script with a small dataset to start RL fine-tuning.

Please help me install this Skill: Name: unsloth-training Download link: https://github.com/ScientiaCapital/skills/archive/main.zip#unsloth-training Please download this .zip file, extract it, and install it in the .claude/skills/ directory.

unsloth-training

System Documentation

What problem does it solve?

Core Features & Use Cases

Quick Start

Dependency Matrix

Required Modules

Components

💻 Claude Code Installation

Agent Skills Search Helper