Searching protocol for "ncu"
Optimize diffusion model GPU kernels.
Find and fix GPU kernel performance bottlenecks.
Master CUDA kernel development and profiling.
Optimize diffusion model inference speed.
Visualize and manage disk space usage.
Surface TUI skills to speed terminal UIs.