Skill Explorer

Searching protocol for "RMSNorm"

diffusion-kernel

Official

Optimize diffusion model inference speed.

Advanced

bymoirai-internal

diffusion-kernel

Official

Optimize diffusion model inference speed.

Advanced

bysgl-project

normalization-techniques

Community

Stabilize deep networks, accelerate training.

Few Config

bytachyon-beep

training-mlps

Community

Train modular MLP backbones with Flax NNX.

Few Config

byyonesuke

cuda-kernels

Community

Speed up CUDA kernels for Diffusers.

Advanced

byrdromer2

extract-kernel-definitions

Official

Automate GPU kernel schema extraction.

Advanced

byflashinfer-ai

h100-diffusers-kernels

Community

Boost CUDA kernels for Diffusers on H100

Advanced

byburtenshaw

cuda-kernels

Official

Optimize NVIDIA GPU kernels for AI models.

Advanced

byhuggingface

add-reference-tests

Official

Automate reference test generation for kernel validation.

Advanced

byflashinfer-ai

diffusion-kernel

Community

Optimize diffusion model GPU kernels.

Advanced

byguqiong96

diffusion-kernel

Community

Optimize diffusion model kernels

Advanced

byrayleizhu

model-architect

Community

Design transformer architectures.

Advanced

byRachasumanth