Skill Explorer

Searching protocol for "awq"

awq-quantization

Community

Compress LLMs with 4-bit AWQ.

Advanced

bychoice5346

awq-quantization

Community

Compress LLMs with 4-bit AWQ.

Few Config

byMesferAli

awq-quantization

Community

Compress LLMs for faster inference.

Few Config

byinformatico-madrid

awq-quantization

Community

Compress LLMs with minimal accuracy loss.

Advanced

byDoanNgocCuong

awq-quantization

Community

Compress LLMs for faster inference.

Advanced

byihatesea69

awq-quantization

Community

Compress LLMs for faster inference.

Advanced

bygagan114662

vllm-omni-quantization

Community

Reduce VRAM usage and speed up vLLM-Omni.

Advanced

byhsliuustc0106

awq-quantization

Community

Compress LLMs with minimal accuracy loss.

Advanced

byzhuangbiaowei

awq-quantization

Official

Compress LLMs for faster, cheaper inference.

Advanced

byOrchestra-Research

awq-quantization

Community

Compress LLMs for faster, leaner inference.

Advanced

bytianhao909

text-generation-inference

Official

Deploy LLMs with TGI

Advanced

byHouseGarofalo

serving-llms-vllm

Community

High-throughput LLM serving

Advanced

byhochoa13