Skill Explorer

Searching protocol for "bitsandbytes"

quantizing-models-bitsandbytes

Community

Shrink LLMs, boost performance.

Advanced

byDoanNgocCuong

quantizing-models-bitsandbytes

Community

Shrink LLMs for less VRAM

Advanced

byihatesea69

quantizing-models-bitsandbytes

Official

Shrink LLMs, boost performance.

Advanced

byOrchestra-Research

quantizing-models-bitsandbytes

Community

Shrink LLMs, boost performance.

Advanced

bychoice5346

quantizing-models-bitsandbytes

Community

Shrink LLMs, boost GPU efficiency.

Advanced

byMesferAli

quantizing-models-bitsandbytes

Community

Shrink LLMs, boost GPU efficiency.

Few Config

byinformatico-madrid

quantizing-models-bitsandbytes

Community

Fit larger models, faster inference.

Advanced

byzhuangbiaowei

quantization

Community

Lean, fast model quantization for inference.

Advanced

byatrawog

quantizing-models-bitsandbytes

Community

8-bit/4-bit quantization for memory-efficient LLMs.

Advanced

byovachiever

text-generation-inference

Official

Deploy LLMs with TGI

Advanced

byHouseGarofalo

quantizing-models-bitsandbytes

Community

Shrink LLMs, boost performance.

Advanced

bytianhao909

peft-fine-tuning

Community

Efficiently fine-tune LLMs with PEFT

Advanced

byDoanNgocCuong