Skip to content

Cline

Cline is an autonomous AI coding agent that runs inside VS Code. It supports custom OpenAI-compatible endpoints.

  1. Open Cline settings in VS Code (Cmd+Shift+P → “Cline: Open Settings”)
  2. Set API Provider to OpenAI Compatible
  3. Configure:
SettingValue
Base URLhttp://localhost:1234/v1
API Keycandela
Model IDgemini-3.5-pro (or any model from /v1/models)

Add to your VS Code settings.json:

{
"cline.apiProvider": "openai-compatible",
"cline.openaiBaseUrl": "http://localhost:1234/v1",
"cline.openaiApiKey": "candela",
"cline.openaiModelId": "gemini-3.5-pro"
}

Switch to a local model for free, private AI coding:

{
"cline.openaiModelId": "llama3.2:3b"
}

Candela still traces every request — you get full token counts and latency metrics even for local models (cost will show $0.00).

Continue (another popular VS Code AI extension) works the same way:

{
"models": [
{
"title": "Candela — Gemini Pro",
"provider": "openai",
"apiBase": "http://localhost:1234/v1",
"apiKey": "candela",
"model": "gemini-3.5-pro"
},
{
"title": "Candela — Local Llama",
"provider": "openai",
"apiBase": "http://localhost:1234/v1",
"apiKey": "candela",
"model": "llama3.2:3b"
}
]
}

For deeper integration, install the official candela-cline plugin package. This enables Cline to run custom tools inside its workspace to query session spend, check daily budgets, and ensure proxy health.

  • Session Cost Summaries: Formats a detailed summary of token usage, requests, spend, and cache savings for the active coding session.
  • Live Budget Check: Inspects daily budget limits, current consumption percentage, remaining allowance, and active grant waterfall breakdown (including expiries).
  • Automatic Health Check: Ensures that the local candela daemon is active and responsive before running heavy automated tasks.

Install from npm:

Terminal window
npm install --save-dev candela-cline

You can register these custom tools inside your workspace or configure Cline’s MCP tools. If you are developing extensions or wrappers around Cline, import the methods directly:

import {
getSessionSummary,
getBudgetStatus,
checkCandelaHealth
} from "candela-cline";
// Print summary of the last 2 hours of work
console.log(await getSessionSummary("http://localhost:8181", 2));
// Print budget status progress bar and active grants
console.log(await getBudgetStatus("http://localhost:8181"));
📊 Session Cost Summary (Past 1h)
---------------------------------
Requests: 12 calls
Tokens: 45.2K (28.4K in / 16.8K out)
Cache Savings: 32.4K tokens ($0.12 saved)
Total Spend: $0.18
💰 Daily Budget Status
Daily: [████████████░░░░░░░░] 60% $3.00 / $5.00
🎁 Grant: $10.00 / $10.00 — hackathon sprint (expires May 25)
Total available: $12.00