Skill Explorer

Searching protocol for "quantization"

pytorch-quantization

Community

Optimize PyTorch models with INT8 quantization.

Few Config

bycuba6112

hqq-quantization

Community

Compress LLMs without calibration data.

Advanced

bychoice5346

vllm-omni-quantization

Community

Reduce VRAM usage and speed up vLLM-Omni.

Advanced

byhsliuustc0106

hqq-quantization

Community

Compress LLMs for faster inference.

Few Config

byMesferAli

quantizing-models-bitsandbytes

Community

8-bit/4-bit quantization for memory-efficient LLMs.

Advanced

byovachiever

hqq-quantization

Community

Quantize LLMs without calibration data.

Few Config

byinformatico-madrid

hqq-quantization

Community

Quantize LLMs fast, no calibration needed.

Advanced

byDoanNgocCuong

hqq-quantization

Official

Compress LLMs for faster inference.

Few Config

byOrchestra-Research

hqq-quantization

Community

Compress LLMs with HQQ

Few Config

bygagan114662

hqq-quantization

Community

Compress LLMs without calibration data.

Advanced

byihatesea69

gguf-quantization

Community

Efficient model inference on any hardware.

Few Config

byAum08Desai

hqq-quantization

Community

Compress LLMs with HQQ: Fast, no calibration.

Advanced

byzhuangbiaowei