Skill Explorer

Searching protocol for "efficient inference"

inference

Community

Fast, memory-efficient LLM inference with vLLM.

Advanced

byatrawog

llama-cpp

Community

CPU-first LLM inference on non-NVIDIA hardware.

Advanced

byovachiever

preferences-scalable-probabilistic-modeling-workflow

Community

Master complex models with Bayesian workflow.

Advanced

bycameronraysmith

geo-infer

Community

Geospatial Active Inference Framework

Advanced

byActiveInferenceInstitute

rwkv-architecture

Community

RNN+Transformer hybrid AI

Advanced

bychoice5346

uv-rwkv-architecture

Community

RNN+Transformer for efficient LLM inference.

Advanced

byuv-xiao

rwkv-architecture

Community

Efficient RNN+Transformer AI models.

Advanced

byDoanNgocCuong

fuel

Official

Slash LLM inference costs.

Advanced

byopenclaw-rocks

gguf-quantization

Official

Efficient model inference on any hardware.

Few Config

byOrchestra-Research

gguf-quantization

Community

Optimize LLMs for efficient inference.

Advanced

byMesferAli

mamba-architecture

Official

O(n) sequence models for efficient AI.

Advanced

byOrchestra-Research

ai-llm-ops-inference

Community

Optimize LLM inference for speed and cost efficiency.

Advanced

byvasilyu1983