The Challenge
AI Without a Gateway Is AI Without Control
Teams call AI providers directly from dozens of services. No central policy, no audit trail, no kill switch.
Scattered Access
Every microservice, every developer, every tool calls AI providers independently. No single place to enforce rules.
No Audit Trail
When regulators ask what your AI did last Tuesday, you need an answer. Direct API calls leave no governance trace.
Latency Concerns
Adding governance shouldn't slow your application. Traditional proxy solutions add 50–200ms. That's unacceptable.
Three Integration Paths
Pick the Path That Fits Your Stack
Every path lands traffic in the same governance engine. Start with the proxy in five minutes, graduate to the SDK when you need richer metadata, or use Browser Guard for the endpoint.
Gateway Proxy
OpenAI-compatible drop-in. Change your base URL, keep your existing code. No SDK, no framework changes — works with any LLM client that speaks OpenAI.
- Zero code changes
- Works with any OpenAI-compatible client
- Streaming and tool-calls supported
SDK Integration
Eight published packages across TypeScript, Python, Go, and Java. Attach agent identity, session context, and policy hints natively in your application code.
- Type-safe clients
- Built-in retry and error handling
- Trace metadata enrichment
Browser Guard
For employees using ChatGPT, Claude, Copilot, Gemini and four more in the browser. Palveron AI Browser Guard enforces the same policies on the device — no proxy, no SDK, no infrastructure change.
- Seven supported browsers
- MDM-deployable (Intune · SCCM · Jamf · Workspace)
- Same policies, same trace store
OpenAI-compatible curl example
Swap api.openai.com → gateway.palveron.com. Everything else stays the same.
TypeScript · @palveron/sdk-typescript
npm install @palveron/sdk-typescript
Request Lifecycle at a Glance
Every request enters through one of four paths, passes through the governance pipeline, and exits with a complete trace.
Key Capabilities
Enterprise-Grade AI Gateway
OpenAI-Compatible Proxy
Drop-in for /v1/chat/completions, /v1/embeddings, and tool-calls. Streaming supported. Forwards to OpenAI, Anthropic, Google, Azure, or any compatible upstream.
Dual Authentication
API-Key plus JWT verification on every request. Organization and project isolation enforced at the network level.
Rust-Powered Performance
The gateway core is written in Rust with Axum. Sub-2ms governance overhead. No JVM cold starts, no garbage collection pauses.
Complete Trace
Every request creates a governance trace: prompt, response, policy evaluations, actions taken. Searchable, filterable, auditable.
Configurable Fail Mode
Choose fail-open (pass through if the gateway is unreachable) or fail-closed (block traffic). Per-project override, hot-reloadable.
Self-Hosted or Cloud
Docker Compose for quick start, Helm Charts for Kubernetes. Or use the managed cloud, your data stays in EU (Hetzner, Germany).
Getting Started
Live in Under 5 Minutes
Point traffic to the gateway
Replace your AI provider URL with the Gateway endpoint — the OpenAI-compatible proxy needs no other changes. Or use one of the four SDKs for richer trace metadata. Both work side-by-side.
Define policies
Create guardrails per agent, department, or role. Start with built-in templates for PII detection, content safety, and cost limits.
Monitor everything
Every AI interaction is traced, evaluated, and visible in real-time. Optionally anchor critical decisions to the blockchain.
Integrations