reward-shaping-engineering

Community

Master reward design with safe shaping.

Authortachyon-beep
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill provides a structured framework for designing reward functions in reinforcement learning that preserve the optimal policy while accelerating training and reducing reward hacking.

Core Features & Use Cases

  • Potential-based shaping that preserves policy
  • Anti-hacking penalties and robust validation workflows
  • Guidance on sparse vs dense rewards, normalization, and clipping
  • Inverse RL considerations and distribution-shift testing

Quick Start

Implement a potential-based shaping function and combine it with the main task reward, then run validation tests across environment variants to ensure policy preservation and improved learning speed.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: reward-shaping-engineering
Download link: https://github.com/tachyon-beep/hamlet/archive/main.zip#reward-shaping-engineering

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.