Skill Explorer

Searching protocol for "llm-inference"

llm-inference-batching-scheduler

Community

Optimize LLM inference batching.

Advanced

byZurybr

speculative-decoding

Community

Accelerate LLM inference speed.

Advanced

byMesferAli

speculative-decoding

Community

Accelerate LLM inference speed.

Advanced

byihatesea69

speculative-decoding

Community

Accelerate LLM inference speed.

Advanced

bygagan114662

speculative_decoding

Community

Accelerate LLM inference speed.

Advanced

bytianhao909

fuel

Official

Slash LLM inference costs.

Advanced

byopenclaw-rocks

speculative_decoding

Community

Accelerate LLM inference speed.

Advanced

byDoanNgocCuong

ai-llm-inference

Community

High-throughput, cost-aware LLM inference.

Advanced

byvasilyu1983

uv-speculative-decoding

Community

Accelerate LLM inference speed.

Advanced

byuv-xiao

win-ai-local

Official

Run LLMs locally on Windows with Ollama

Few Config

byIrisGoLab

llama-cpp

Community

CPU-first LLM inference on non-NVIDIA hardware.

Advanced

byovachiever

llm-inference-scaling

Community

Scale LLM inference on Kubernetes

Advanced

byBagelHole