reward-shaping-engineering
CommunityMaster reward design with safe shaping.
Education & Research#validation#reinforcement-learning#reward-design#potential-based-shaping#credit-assignment#reward-hacking
Authortachyon-beep
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill provides a structured framework for designing reward functions in reinforcement learning that preserve the optimal policy while accelerating training and reducing reward hacking.
Core Features & Use Cases
- Potential-based shaping that preserves policy
- Anti-hacking penalties and robust validation workflows
- Guidance on sparse vs dense rewards, normalization, and clipping
- Inverse RL considerations and distribution-shift testing
Quick Start
Implement a potential-based shaping function and combine it with the main task reward, then run validation tests across environment variants to ensure policy preservation and improved learning speed.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: reward-shaping-engineering Download link: https://github.com/tachyon-beep/hamlet/archive/main.zip#reward-shaping-engineering Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.