lds-optimization

Community

Boost GPU kernel LDS performance.

Authorfsx950223
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill optimizes Local Data Share (LDS) access patterns in GPU kernels, diagnosing and resolving performance bottlenecks like bank conflicts and latency stalls.

Core Features & Use Cases

  • Diagnose LDS Bottlenecks: Analyzes trace data to identify issues with LDS access, bank conflicts, and write-read latency.
  • Apply Optimizations: Implements solutions like swizzle layouts, padding, and increasing write-read distance to improve performance.
  • Use Case: When a GPU kernel's performance is hampered by slow shared memory access, this Skill can pinpoint the exact cause and apply targeted fixes to significantly speed up execution.

Quick Start

Use the lds-optimization skill to analyze and fix LDS bottlenecks in the provided kernel trace data.

Dependency Matrix

Required Modules

None required

Components

referencesscripts

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: lds-optimization
Download link: https://github.com/fsx950223/claude-stuff/archive/main.zip#lds-optimization

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.