Policy Enforcement
Budget gates, rate limits, and model access controls enforced at the proxy layer in real time. Every policy violation logged to an immutable audit trail.
Policy Enforcement
Budget gates, rate limits, and model access controls enforced at the proxy layer in real time. Every policy violation logged to an immutable audit trail.
Cost Governance
Real-time cost calculation per request, per model, per tenant. Set per-user budgets, create time-bound grants, and get threshold alerts before spend is exceeded.
Full Observability
Every LLM request captured with OpenTelemetry — latency, TTFB, token counts, cache hits, and cost — all in a unified trace tree with W3C Trace Context propagation.
Drop-In Integration
Works with Google ADK, LangChain, Claude Code, Cursor, Zed, OpenCode — anything that talks to an LLM endpoint. Point base_url at Candela and you’re done.
Candela is not a roadmap — these enforcement controls are live in production:
Budget Enforcement
Per-user daily budgets with pre-flight gates. Requests that exceed limits get a 402 before they ever reach the LLM provider.
Rate Limiting
Per-user request throttling prevents runaway automation from draining budgets. Minute-window counters with configurable limits.
Immutable Audit Trail
Every request, every admin action, every policy decision — logged to BigQuery, DuckDB, or SQLite. Tamper-proof by design.
Tenant Isolation
Full multi-tenant attribution via X-Candela-Tenant-Id headers and W3C Baggage. Per-tenant cost tracking, leaderboards, and scoped views.
eBPF Enforcement
Kernel-level traffic interception via iptables redirect, Tetragon process enforcement, and Hubble observability. Zero SDK changes — every LLM request flows through the proxy, guaranteed.