Name: offline-rl
Availability: InStock
Author: tachyon-beep

System Documentation

What problem does it solve?

This skill enables practitioners to perform offline reinforcement learning from fixed datasets without interacting with an environment, addressing issues like distribution shift and extrapolation errors by adopting conservative value estimation and robust evaluation strategies.

Core Features & Use Cases

Conservative offline RL methods: CQL, IQL, and BCQ to mitigate overestimation when data is limited or distribution shifts occur.
Offline training & evaluation: Train purely from logged data and evaluate using offline techniques to estimate policy performance without live rollout.
Real-world scenarios: Robotics from logged demonstrations, medical treatment policy learning from historical data, and recommendations improved from historical user interactions.

Quick Start

Prepare a fixed dataset, choose a method (CQL, IQL, or BCQ), train the model, and perform offline evaluation using RIS or model-based estimates.

Please help me install this Skill: Name: offline-rl Download link: https://github.com/tachyon-beep/hamlet/archive/main.zip#offline-rl Please download this .zip file, extract it, and install it in the .claude/skills/ directory.

offline-rl

System Documentation

What problem does it solve?

Core Features & Use Cases

Quick Start

Dependency Matrix

Required Modules

Components

💻 Claude Code Installation

Agent Skills Search Helper