Skill Explorer

Searching protocol for "tensor parallelism"

torch-tensor-parallelism

Community

Optimize PyTorch distributed linear layers.

Advanced

byZurybr

training-llms-megatron

Community

Megatron-Core: 3D parallelism for huge LLMs.

Advanced

byovachiever

tensorrt-llm

Community

Accelerate LLM inference on NVIDIA GPUs

Advanced

bykwasi-cpu

tensorrt-llm

Community

10-100x faster LLM inference on NVIDIA GPUs.

Advanced

byovachiever

distributed-llm-pretraining-torchtitan

Official

Scale LLM pretraining with 4D parallelism.

Advanced

byOrchestra-Research

training-llms-megatron

Community

Train LLMs with advanced parallelism.

Advanced

bychoice5346

Megatron-LM

Community

Megatron-LM skills for agents

Advanced

bypedestrianlove

training-llms-megatron

Community

Scale LLM training with advanced parallelism.

Advanced

byihatesea69

tensorflow-lite

Community

Run on-device ML in React Native apps.

Advanced

by333-333-333

training-llms-megatron

Community

Scale LLM training with advanced parallelism.

Advanced

bytianhao909

training-llms-megatron

Community

Scale LLM training with advanced parallelism.

Advanced

bygagan114662

vllm-omni-distributed

Community

Scale distributed inference across GPUs.

Advanced

byhsliuustc0106