Skill Explorer

Searching protocol for "serving-models"

tensorflow-model-deployment

Official

Deploy TensorFlow models to production.

Advanced

byTheBushidoCollective

mlx

Community

Run and fine-tune LLMs on Apple Silicon with MLX.

Advanced

byitsmostafa

modal

Community

Serverless Python compute with automatic scaling and GPUs.

Advanced

byovachiever

mlflow

Official

Track ML experiments and manage models.

Advanced

byJNZader-Vault

llm-deploy

Official

Deploy LLMs with GPU inference servers.

Advanced

bytruefoundry

AI/ML Operations

Official

Orchestrate ML workflows from data to deployment.

Advanced

byqenex-ai

unsloth-inference

Community

Accelerate LLM inference and serving.

Advanced

bycuba6112

serving-llms-vllm

Community

High-throughput LLM serving with vLLM

Advanced

bykwasi-cpu

uv-tensorrt-llm

Community

Accelerate LLM inference on NVIDIA GPUs

Advanced

byuv-xiao

mlflow

Official

Master ML lifecycle management.

Advanced

byHouseGarofalo

serving-llms-vllm

Official

High-throughput LLM serving with vLLM.

Advanced

byNousResearch

@ruvector/ruvllm-cli

Community

Local LLM inference & management

Few Config

byricable