Drop-in OpenAI-compatible gateway to eleven frontier models. Same code. Same SDKs. Half the cost — and none of the hidden prompt tax other routers quietly add to your invoice.
Eleven frontier models. One endpoint. Half the bill.
One base_url swap. Keep your SDK, keep your code. cURL-friendly. OpenAI-compatible. Zero new abstractions.
Real-time spend, per-key caps, per-seat ceilings, single invoice. See exactly where the line item drops the moment you switch.
→ dashboard.slashed.proOpus is in. Codex Max is in. Codex 5.5 is in. Same response shape. Half the per-token rate, every call. See the rate card →
A unified gateway across Anthropic, Google, and OpenAI — not the headline, the plumbing. One OpenAI-compatible endpoint. One canonical ID per model. Half the rack price.
If your code talks to OpenAI, it already talks to SLASHED. Change the URL. Change the key prefix. Keep everything else.
# Paying full rack rate from openai import OpenAI client = OpenAI( base_url="https://api.openai.com/v1", api_key="sk-...", ) client.chat.completions.create( model="gpt-5.4", messages=[...] )
# Half the bill, same call from openai import OpenAI client = OpenAI( base_url="https://api.slashed.pro/v1", api_key="sl-...", ) client.chat.completions.create( model="gpt-5.4", messages=[...] )
Full reference at api.slashed.pro — endpoints, error codes, rate limits, every supported model ID.
Same OpenAI-compatible spec. We just didn't break it. Six contract tenets we hold ourselves to — verifiable against the public OpenAPI spec and the live endpoint.
content field. Auxiliary fields never become undocumented substitutes for the customer-visible answer. Boring is a feature.502 Bad Gateway. Your retry loop hammers them. You can't tell their bug from yours.400 client errors. Upstream 5xx codes are reserved for actual upstream failures. Your retry, alerting, and incident logic just works.Half the bill. All the receipts. None of the excuses.
qua- key →
— See your 30-day savings in 60 seconds. No card. No phone call.
Works with Claude Code, Codex, Cursor, Factory Droid, OpenCode, raw OpenAI SDKs. Zero vendor abstractions to learn.
TLS 1.3 in transit. Prompts and completions live only as long as the request. No retention, no training, no leaks.
Live token counter, per-key caps, per-seat ceilings, daily digests. The dashboard mirrors the API, line for line.
Email hits engineers who read traces and ship fixes — no ticket maze, no tier-1 hand-off. Incidents get owners, timelines, postmortems. Not vibes. Not vanished threads.
Connect any sl-, qua-, sk-, or or- key. The dashboard imports your last 30 days of usage, projects what you would have paid direct, and shows the delta in real time. CFO-shareable, board-shareable, audit-trail-clean.
— Begin · free 1M tokens to migrate · no card
Get your sl- key →