extract-kernel-definitions

Official

Automate GPU kernel schema extraction.

Authorflashinfer-ai
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill automates the extraction of kernel schemas and definitions from SGLang model implementations, enabling consistent, deduplicated kernel definitions for FlashInfer trace.

Core Features & Use Cases

  • Kernel extraction: Detect and extract kernel definitions (MLA, MoE, GQA, RMSNorm, GEMM) from SGLang models.
  • Deduplication: Prevent duplicate definitions by consolidating shared kernels and tagging model usage.
  • Definition generation: Produce JSON-based kernel definitions suitable for flashinfer_trace and downstream testing.

Quick Start

Run /extract-kernel-definitions --model-name <MODEL_NAME> to scan the model and generate new definitions under flashinfer_trace/definitions, then review and deduplicate if needed.

Dependency Matrix

Required Modules

SGLangFlashInfer

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: extract-kernel-definitions
Download link: https://github.com/flashinfer-ai/flashinfer-bench/archive/main.zip#extract-kernel-definitions

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.