AI Gateway

Change Your Base URL. Keep Your Code.

Palveron Gateway is OpenAI-compatible. Point your existing client at the Gateway URL and every prompt, response, and tool call is governed — without touching application code.

OpenAI-compatible. One URL swap, full governance.

bash
curl https://gateway.palveron.com/v1/chat/completions \ -H "Authorization: Bearer $PALVERON_API_KEY" \ -H "Content-Type: application/json" \ -d '{"model":"gpt-4o","messages":[{"role":"user","content":"Hello"}]}'
Full documentation on docs.palveron.com

The Challenge

AI Without a Gateway Is AI Without Control

Teams call AI providers directly from dozens of services. No central policy, no audit trail, no kill switch.

Scattered Access

Every microservice, every developer, every tool calls AI providers independently. No single place to enforce rules.

No Audit Trail

When regulators ask what your AI did last Tuesday, you need an answer. Direct API calls leave no governance trace.

Latency Concerns

Adding governance shouldn't slow your application. Traditional proxy solutions add 50–200ms. That's unacceptable.

Three Integration Paths

Pick the Path That Fits Your Stack

Every path lands traffic in the same governance engine. Start with the proxy in five minutes, graduate to the SDK when you need richer metadata, or use Browser Guard for the endpoint.

Recommended

Gateway Proxy

Five-minute onboarding

OpenAI-compatible drop-in. Change your base URL, keep your existing code. No SDK, no framework changes — works with any LLM client that speaks OpenAI.

  • Zero code changes
  • Works with any OpenAI-compatible client
  • Streaming and tool-calls supported
See Gateway capabilities

SDK Integration

Rich metadata, type safety

Eight published packages across TypeScript, Python, Go, and Java. Attach agent identity, session context, and policy hints natively in your application code.

  • Type-safe clients
  • Built-in retry and error handling
  • Trace metadata enrichment
Read SDK reference

Browser Guard

Endpoint governance

For employees using ChatGPT, Claude, Copilot, Gemini and four more in the browser. Palveron AI Browser Guard enforces the same policies on the device — no proxy, no SDK, no infrastructure change.

  • Seven supported browsers
  • MDM-deployable (Intune · SCCM · Jamf · Workspace)
  • Same policies, same trace store
Explore Browser Guard

OpenAI-compatible curl example

Swap api.openai.com → gateway.palveron.com. Everything else stays the same.

bash
curl https://gateway.palveron.com/v1/chat/completions \ -H "Authorization: Bearer $PALVERON_API_KEY" \ -H "Content-Type: application/json" \ -d '{"model":"gpt-4o","messages":[{"role":"user","content":"Hello"}]}'

TypeScript · @palveron/sdk-typescript

npm install @palveron/sdk-typescript

typescript
import OpenAI from "openai"; const client = new OpenAI({ baseURL: "https://gateway.palveron.com/v1", apiKey: process.env.PALVERON_API_KEY, }); const completion = await client.chat.completions.create({ model: "gpt-4o", messages: [{ role: "user", content: "Hello" }], });

Request Lifecycle at a Glance

Every request enters through one of four paths, passes through the governance pipeline, and exits with a complete trace.

ProxySDKGuardMCPGOVERNANCE GATEWAYAuthenticateEvaluate PoliciesCreate TracePASSED89%MODIFIED8%BLOCKED3%AI PROVIDERSOpenAI · AnthropicGoogle · AzureAvg Latency<2msDual AuthAPI-Key + JWTMulti-TenantOrg → ProjectFail ModeOpen / Closed

Key Capabilities

Enterprise-Grade AI Gateway

OpenAI-Compatible Proxy

Drop-in for /v1/chat/completions, /v1/embeddings, and tool-calls. Streaming supported. Forwards to OpenAI, Anthropic, Google, Azure, or any compatible upstream.

Dual Authentication

API-Key plus JWT verification on every request. Organization and project isolation enforced at the network level.

Rust-Powered Performance

The gateway core is written in Rust with Axum. Sub-2ms governance overhead. No JVM cold starts, no garbage collection pauses.

Complete Trace

Every request creates a governance trace: prompt, response, policy evaluations, actions taken. Searchable, filterable, auditable.

Configurable Fail Mode

Choose fail-open (pass through if the gateway is unreachable) or fail-closed (block traffic). Per-project override, hot-reloadable.

Self-Hosted or Cloud

Docker Compose for quick start, Helm Charts for Kubernetes. Or use the managed cloud, your data stays in EU (Hetzner, Germany).

Getting Started

Live in Under 5 Minutes

1

Point traffic to the gateway

Replace your AI provider URL with the Gateway endpoint — the OpenAI-compatible proxy needs no other changes. Or use one of the four SDKs for richer trace metadata. Both work side-by-side.

2

Define policies

Create guardrails per agent, department, or role. Start with built-in templates for PII detection, content safety, and cost limits.

3

Monitor everything

Every AI interaction is traced, evaluated, and visible in real-time. Optionally anchor critical decisions to the blockchain.

Integrations

Works with Your Stack

AI Providers
OpenAI · Anthropic · Google · Azure · Mistral · any OpenAI-compatible upstream
SDKs
TypeScript · Python · Go · Java — published on npm, PyPI, pkg.go.dev, Maven Central
Frameworks
LangChain · CrewAI · OpenAI Agents · Microsoft AGT · OpenClaw / MCP
Deployment
Docker Compose · Helm Charts · Managed Cloud (EU, Hetzner)

Start Governing AI Traffic Today

Community tier is free forever. No credit card required. Upgrade when your team needs more.

Free Community tier
Sub-2ms latency
Self-hosted available