Skill Explorer

Searching protocol for "batch inference"

inference-operator-and-batch-yaml

Community

Bridge prompts to batch YAML for inference.

Few Config

bydavidrd123

llm-inference-batching-scheduler

Community

Optimize LLM inference batching.

Advanced

byZurybr

inference

Community

Fast, memory-efficient LLM inference with vLLM.

Advanced

byatrawog

doubleword-batch

Official

High-throughput batch inference, made easy.

Advanced

bydoublewordai

ml-serving-optimization

Community

Boost ML inference speed and efficiency.

Advanced

bydoanchienthangdev

hf-jobs

Community

Run cloud workloads on Hugging Face

Advanced

bykeremtoker468-dotcom

tensorrt-llm

Community

10-100x faster LLM inference on NVIDIA GPUs.

Advanced

byovachiever

deployment-paradigms

Community

Master ML deployment strategies.

Few Config

bydoanchienthangdev

schema-apply

Community

Apply inferred schema to incomplete notes safely.

Advanced

bybencassie

ai-llm-ops-inference

Community

Optimize LLM inference for speed and cost efficiency.

Advanced

byvasilyu1983

hugging-face-jobs

Community

Run cloud workloads on Hugging Face

Advanced

byyuxia214

tensorrt-llm

Community

Accelerate LLM inference on NVIDIA GPUs.

Advanced

byihatesea69