Skill Explorer

Searching protocol for "inference speed"

speculative-decoding

Community

Accelerate LLM inference speed.

Advanced

byMesferAli

speculative-decoding

Community

Accelerate LLM inference speed.

Advanced

byihatesea69

uv-speculative-decoding

Community

Accelerate LLM inference speed.

Advanced

byuv-xiao

speculative_decoding

Community

Accelerate LLM inference speed.

Advanced

bytianhao909

speculative_decoding

Community

Accelerate LLM inference speed.

Advanced

byDoanNgocCuong

unsloth-core

Community

Accelerate LLM fine-tuning & inference.

Few Config

bycuba6112

speculative-decoding

Community

Accelerate LLM inference speed.

Advanced

bygagan114662

model-pruning

Community

Shrink LLMs, boost inference speed.

Advanced

byMesferAli

optimizing-attention-flash

Community

Accelerate transformer training & inference

Advanced

bykwasi-cpu

speculative-decoding

Community

Accelerate LLM inference speed

Advanced

bychoice5346

model-pruning

Community

Shrink LLMs, boost inference speed.

Advanced

bygagan114662

GroqRealAdapter

Community

Integrate Groq API, achieve ultra-fast AI inference.

No Config

bystarwreckntx