Skill Explorer

Searching protocol for "gptq"

gptq

Community

Compress LLMs to 4-bit for efficiency.

Advanced

byihatesea69

gptq

Community

Compress LLMs for consumer GPUs.

Advanced

byDoanNgocCuong

gptq

Official

Compress LLMs for efficient deployment.

Advanced

byOrchestra-Research

gptq

Community

Compress LLMs for consumer GPUs

Advanced

byzhuangbiaowei

gptq

Community

Compress LLMs to 4-bit for efficiency.

Advanced

byinformatico-madrid

gptq

Community

Compress LLMs for efficiency

Advanced

byMesferAli

gptq

Community

Compress LLMs for efficient deployment.

Advanced

bytianhao909

gptq

Community

Compress LLMs for efficient deployment

Advanced

bychoice5346

gptq

Community

Compress LLMs for efficient deployment

Advanced

bygagan114662

vllm-omni-quantization

Community

Reduce VRAM usage and speed up vLLM-Omni.

Advanced

byhsliuustc0106

gptq

Community

4-bit quantization for large LLMs on consumer GPUs.

Advanced

byovachiever

text-generation-inference

Official

Deploy LLMs with TGI

Advanced

byHouseGarofalo