Should provider keys live in clients or in a gateway?

In a gateway. Putting them in clients spreads the secret across machines you do not fully control and makes rotation painful. Privian holds the provider key (yours, BYOK, or pooled) and exposes a gateway API key to your services.

How does Privian map a model to a provider?

Model IDs are namespaced like openai/gpt-5.5 or anthropic/claude-sonnet-4.5. The gateway resolves the provider from the model and uses the credential bound to that provider. Unknown models are rejected before any upstream call.

Can I fail over between providers?

The current beta does not auto-failover across providers on a single request. You can change the model in your application call. Cross-provider failover is on the roadmap; see Updates.

How to route prompts securely

The patterns that actually work

Secure routing is not one big decision. It is a small set of consistent choices that compound:

One front door — every prompt goes through the same gateway
One credential per provider, held server-side
One naming scheme for models, validated at the edge
One audit trail, with metadata only

One front door

The single biggest reduction in operational risk is making "where do prompts go" have exactly one answer. With Privian that answer is POST /v1/gateway on https://api.privian.io.

Provider credentials

Two patterns work. Both keep the credential off client machines:

Pooled credentials. The gateway holds its own provider keys and you authenticate only with the gateway API key.
BYOK (bring your own key). You upload your provider key once; the gateway encrypts it at rest with AES-GCM and uses it for your requests only. See BYOK.

Model identification

Model IDs are namespaced by provider:

openai/gpt-5.5
openai/gpt-5.5-mini
anthropic/claude-sonnet-4.5
google/gemini-3.5-flash

The gateway resolves the provider from the prefix, applies the right credential, and rejects unknown models before any upstream call. This stops a caller from probing arbitrary providers by crafting model names.

Audit without archive

Every routing decision should be visible after the fact — which model, which provider, how long it took, how many entities were masked. None of that requires storing the prompt body. See Zero retention.

A worked example

curl https://api.privian.io/v1/gateway \
  -H "Authorization: Bearer $PRIVIAN_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "anthropic/claude-sonnet-4.5",
    "prompt": "Draft a reply to jane@example.com",
    "stream": false
  }'

Same gateway, same shape of request, different model — the routing happens inside the gateway. Your application code does not change.

How Privian fits

Privian provides the single front door, the credential management (pooled or BYOK), the model registry, and the audit trail. See LLM Gateway for the product detail.

Written under our editorial principles: implementation-grounded, honest about limitations, educational first.

Frequently asked questions

Should provider keys live in clients or in a gateway?: In a gateway. Putting them in clients spreads the secret across machines you do not fully control and makes rotation painful. Privian holds the provider key (yours, BYOK, or pooled) and exposes a gateway API key to your services.
How does Privian map a model to a provider?: Model IDs are namespaced like openai/gpt-5.5 or anthropic/claude-sonnet-4.5. The gateway resolves the provider from the model and uses the credential bound to that provider. Unknown models are rejected before any upstream call.
Can I fail over between providers?: The current beta does not auto-failover across providers on a single request. You can change the model in your application call. Cross-provider failover is on the roadmap; see Updates.