Proxy Routes

All Candela proxy components (candela-local, candela-server, candela-sidecar) expose the same route structure.

Provider Routes

Route	Provider	Format	Auth
`/proxy/google/*`	Google Gemini	Native Gemini API	ADC
`/proxy/openai/*`	OpenAI	OpenAI Chat Completions	API key
`/proxy/anthropic/*`	Anthropic (via Vertex AI)	Messages API	ADC

OpenAI-Compatible Endpoint

candela-local exposes an LM Studio-compatible endpoint on :1234:

Route	Description
`GET /v1/models`	List all available models (local + cloud merged)
`POST /v1/chat/completions`	Chat completions (auto-routes to correct backend)

Health Check

GET /healthz

Returns 200 OK with {"status": "ok"}.

Management API (candela-local only)

Route	Description
`GET /_local/`	Management UI
`GET /_local/api/traces`	Recent traces (default: 50, max: 200)
`GET /_local/api/health`	Runtime health status
`POST /_local/api/pull`	Pull a model from Ollama
`POST /_local/api/start`	Start the runtime backend
`POST /_local/api/stop`	Stop the runtime backend

Example: Query Traces

curl http://localhost:8181/_local/api/traces?limit=20

{
  "spans": [
    {
      "span_id": "abc123...",
      "model": "llama3.2:3b",
      "provider": "local",
      "input_tokens": 150,
      "output_tokens": 42,
      "total_tokens": 192,
      "cost_usd": 0.0,
      "duration_ms": 1230,
      "status": "ok",
      "timestamp": "2026-04-19T22:15:00Z"
    }
  ],
  "count": 1
}

Headers

Request Headers

Header	Description
`Authorization`	`Bearer <token>` — Required for candela-server
`traceparent`	W3C Trace Context — enables unified trace trees
`tracestate`	W3C Trace State — forwarded to upstream

Response Headers

Header	Description
`X-Candela-Span-Id`	Span ID for this proxy request
`X-Candela-Trace-Id`	Trace ID (matches caller’s if `traceparent` was provided)