Skip to content

IDE & Developer Tool Integration

Candela acts as a transparent LLM proxy — any tool that lets you configure a custom base_url or API endpoint can route through Candela for full observability.

Most AI-powered IDEs and tools talk to LLMs via the OpenAI-compatible API (/v1/chat/completions, /v1/models). candela exposes this on port :1234 — just point your tool there.

IDE / Tool candela Provider
┌──────────┐ `:1234` ┌──────────────────────┐ ADC/key ┌──────────┐
│ Cline │ ──────────▶ │ /v1/chat/completions │ ──────────▶ │ Gemini │
│ Zed │ │ /v1/models │ │ OpenAI │
│ IntelliJ │ │ │ │ Ollama │
│ OpenCode │ │ trace + cost │ │ Claude │
└──────────┘ └──────────────────────┘ └──────────┘

What you get automatically:

  • ⏱️ Latency and TTFB per request
  • 📊 Token counts (input / output / total)
  • 💰 Cost calculation per request
  • 🔗 Trace correlation (if the tool supports traceparent)
  • 📋 Full request/response content capture

For any tool that supports a custom OpenAI-compatible endpoint:

SettingValue
Base URLhttp://localhost:1234/v1
API Keycandela (or any string — candela handles real auth)
ModelAny model from GET /v1/models
Terminal window
# Verify it works
curl http://localhost:1234/v1/models | jq '.data[].id'

This returns all available models — local (Ollama) and cloud (Gemini, Claude) merged.

After configuring your IDE, check the management UI:

  1. Open http://localhost:8181/_local/
  2. Navigate to the Traces card
  3. Send a prompt from your IDE
  4. You should see the trace appear with model, tokens, cost, and duration