Skill Explorer

Searching protocol for "int4"

miles-rl-training

Community

Enterprise RL for large MoE models.

Advanced

byMesferAli

quantization

Community

Lean, fast model quantization for inference.

Advanced

byatrawog

tensorrt-llm

Community

10-100x faster LLM inference on NVIDIA GPUs.

Advanced

byovachiever

tensorrt-llm

Community

Accelerate LLM inference on NVIDIA GPUs

Advanced

byGarrettRoi

uv-tensorrt-llm

Community

Accelerate LLM inference on NVIDIA GPUs

Advanced

byuv-xiao

tensorrt-llm

Community

Accelerate LLM inference on NVIDIA GPUs.

Advanced

byDoanNgocCuong

tensorrt-llm

Community

Accelerate LLM inference on NVIDIA GPUs.

Advanced

byzhuangbiaowei

tensorrt-llm

Community

Accelerate LLM inference on NVIDIA GPUs.

Advanced

byhochoa13

tensorrt-llm

Community

Accelerate LLM inference on NVIDIA GPUs

Advanced

bychoice5346

tensorrt-llm

Community

Accelerate LLM inference on NVIDIA GPUs.

Advanced

byihatesea69

tensorrt-llm

Community

Accelerate LLM inference on NVIDIA GPUs.

Advanced

bytianhao909

tensorrt-llm

Community

Accelerate LLM inference on NVIDIA GPUs.

Advanced

byAum08Desai