book-sft-pipeline

Community

Train LLMs to write in any author's style.

Content & Communication #fine-tuning #LLM training #LoRA #author voice #style transfer #dataset generation

AuthorCxxxxDxxxF

Version1.0.0

Installs0

System Documentation

What problem does it solve?

This skill automates the process of converting books into datasets suitable for training language models to mimic specific authorial styles, addressing the challenge of creating high-quality, style-transferable training data from long-form text.

Core Features & Use Cases

Intelligent Segmentation: Splits books into semantically coherent chunks (150-400 words) at natural boundaries.
Diverse Instruction Generation: Creates varied prompts to prevent model memorization and encourage style learning.
LoRA Training: Facilitates fine-tuning base models using LoRA on platforms like Tinker for efficient style adaptation.
Use Case: You want to train a model to write poetry in the style of Emily Dickinson. This skill will process her collected works, generate training examples, and guide the fine-tuning process.

Quick Start

Use the book-sft-pipeline skill to fine-tune a model on the provided ePub file to replicate the author's style.

Dependency Matrix

Required Modules

None required

Components

scriptsreferencesassets

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: book-sft-pipeline
Download link: https://github.com/CxxxxDxxxF/project-blackout/archive/main.zip#book-sft-pipeline

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.

View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.