LLMux is a fully managed cloud API. Use our SDK and we’ll pick the best provider in real time, enforce budgets, surface usage, and give you SLAs — no servers required.
Cloud-hosted · SDKs for JS/TS, Python, Go · Works with OpenAI & Anthropic
import { LLMux } from "@llmux/sdk";
const llm = new LLMux({ apiKey: "sk-…" });
const out = await llm.chat([{ role: "user", content: "Hello" }]);
Everything you need from one cloud API—no infra, no queues, no cronjobs.
Live scoring by latency, failure ratio & cost; budget caps; fairness routing.
Plan-based quotas, per-key rate limits, and simple dashboards.
Key scoping, short-lived logs, region pinning, and BYOK.
OpenAI & Anthropic out of the box; optional connectors for local endpoints.
Per-provider cost/latency metrics and request logs (PII-safe).
Premium plans include uptime SLAs and support.
Measure p50/p95 latency, failure ratio & estimated cost per 1K tokens for each provider.
Enforce per-request and per-plan budgets before routing.
Round-robin across healthy, in-budget providers with transparent fallback.
Start free. Pay as you grow.
* Provider usage is pass-through to your account when BYOK is enabled.
// JavaScript/TypeScript
import { LLMux } from "@llmux/sdk";
const llm = new LLMux({ apiKey: process.env.LLMUX_KEY });
const out = await llm.chat([{ role: "user", content: "Hello" }]);
# Python
from llmux import LLMux
llm = LLMux(api_key=os.getenv("LLMUX_KEY"))
out = llm.chat([{"role":"user","content":"Hello"}])
POST https://api.llmux.app/v1/chat – chat routed to best providerGET https://api.llmux.app/v1/usage – plan & usage summaryGET https://api.llmux.app/v1/providers – health & pricing snapshotGET https://api.llmux.app/v1/keys – list & scope API keysNo. LLMux is fully managed. Install the SDK and call our cloud API.
We don’t retain prompts or outputs by default. Region pinning and short-lived logs are available on paid plans.
Yes. You may route through your own provider accounts while still using LLMux policies and metrics.
Planned for key management and usage summaries. Most users only need the SDK.