Name: Enterprise LLM Gateway Implementation Skill
Availability: InStock
Author: ngoquytuan

System Documentation

What problem does it solve?

This Skill enables building a production-grade multi-provider LLM gateway with a unified OpenAI-compatible API.

Core Features & Use Cases

Intelligent routing across providers (OpenAI, Claude, Gemini, local models) with fallback, exponential backoff retry, and load balancing.
Semantic caching plus simple exact-match caching to dramatically reduce LLM usage and latency.
Guardrails including PII detection, content safety checks, and a plugin-based architecture for extensibility.
Cost tracking and per-tenant budget enforcement for real-time cost visibility.
Observability with structured logging, Prometheus metrics, and Grafana-ready dashboards for monitoring.
Use Case: Enterprise chatbots requiring high reliability, compliance, and cost discipline at scale.

Quick Start

Start by reviewing the implementation guidance in SKILL.md, install dependencies from requirements.txt, and run the FastAPI gateway locally with docker-compose as described in the deployment guide.

Please help me install this Skill: Name: Enterprise LLM Gateway Implementation Skill Download link: https://github.com/ngoquytuan/thietKeHeThongchatbot/archive/main.zip#enterprise-llm-gateway-implementation-skill Please download this .zip file, extract it, and install it in the .claude/skills/ directory.

Enterprise LLM Gateway Implementation Skill

System Documentation

What problem does it solve?

Core Features & Use Cases

Quick Start

Dependency Matrix

Required Modules

Components

💻 Claude Code Installation

Agent Skills Search Helper