Skill Explorer

Searching protocol for "tensor-parallelism"

torch-tensor-parallelism

Community

Optimize PyTorch distributed linear layers.

Advanced

byZurybr

vllm-server

Community

High-throughput LLM inference

Advanced

byBagelHole

distributed-llm-pretraining-torchtitan

Official

Scale LLM pretraining with 4D parallelism.

Advanced

byOrchestra-Research

tensorrt-llm

Community

Accelerate LLM inference on NVIDIA GPUs

Advanced

bykwasi-cpu

text-generation-inference

Community

Deploy LLMs with Hugging Face TGI.

Advanced

byfgarofalo56

serving-llms-vllm

Community

High-throughput LLM serving with vLLM.

Advanced

byovachiever

text-generation-inference

Official

Deploy LLMs with TGI

Advanced

byHouseGarofalo

distributed-llm-pretraining-torchtitan

Community

Scale LLM pretraining with 4D parallelism.

Advanced

byihatesea69

distributed-llm-pretraining-torchtitan

Community

Scale LLM pretraining with 4D parallelism.

Advanced

byMesferAli

distributed-llm-pretraining-torchtitan

Community

Scale LLM pretraining with 4D parallelism.

Advanced

bytianhao909

serving-llms-vllm

Community

High-throughput LLM serving

Advanced

byhochoa13

exo-distributed

Official

Run distributed LLMs on Apple Silicon with ease.

Advanced

byplurigrid