Copilot Adapter Kit

Any model. Every provider. One picker.
Plugin-based provider mesh for GitHub Copilot Chat.

Copilot Adapter Kit brings any model into GitHub Copilot Chat — OpenAI, Anthropic (native Messages API), Ollama, LM Studio, vLLM, Groq, Fireworks, Together AI, DeepSeek, or your own self-hosted endpoint. All at once, side-by-side in the model picker.

Agent mode. Tool calling. Streaming. Vision fallback. Thinking blocks. Rate-limit retry. Error mapping. Request diagnostics. All built in.

Getting Started

Quickstart

// settings.json
{
  "copilot-adapter-kit.providers": {
    "openai": { "baseUrl": "https://api.openai.com/v1" }
  }
}

Cmd+Shift+P → Copilot Adapter Kit: Set API Key → pick openai → paste your key
Click CAK in the status bar → Add Model → enter model id, name, family
Cmd+Shift+I → Copilot Chat → pick your model from the dropdown
Chat.

Open the Settings Panel

Click CAK in the status bar, or run Copilot Adapter Kit: Open Panel from the command palette.

The panel has tabs:

Providers — add/edit/remove API endpoints
Models — add/edit/remove models visible in the picker
Keys — manage API keys per provider
Configuration — log level, vision fallback, prompts, tool stabilization
Request Dumps — full payload logs for debugging

Consumer Guide

Providers

Add one or more providers. Each provider needs a family name and a base URL.

In the panel: Providers tab → choose family → enter base URL → Add.

Each provider gets its own API key. Set keys in the Keys tab or via Copilot Adapter Kit: Set API Key.

Provider Families

Family	Default Base URL
`openai`	`https://api.openai.com/v1`
`deepseek`	`https://api.deepseek.com/v1`
`groq`	`https://api.groq.com/openai/v1`
`fireworks`	`https://api.fireworks.ai/inference/v1`
`together`	`https://api.together.xyz/v1`
`openrouter`	`https://openrouter.ai/api/v1`
`mistral`	`https://api.mistral.ai/v1`
`xai`	`https://api.x.ai/v1`
`ollama`	`http://localhost:11434/v1`
`lmstudio`	`http://localhost:1234/v1`
`vllm`	`http://localhost:8000/v1`
`custom`	(you define it)

Model Aliases

If your provider uses different model names than what you want in the picker:

{
  "copilot-adapter-kit.providers": {
    "ollama": {
      "baseUrl": "http://localhost:11434/v1",
      "modelAlias": {
        "llama3-8b": "llama3.1:8b-instruct-q8_0"
      }
    }
  }
}

The picker shows llama3-8b but the API receives llama3.1:8b-instruct-q8_0.

Models

All models are user-defined. Add them in the Models tab or via copilot-adapter-kit.models in settings.

Field	Required	Description
`id`	✅	Model ID sent to the API (e.g. `gpt-5.2`, `deepseek-chat`)
`family`	✅	Must match a provider family name
`name`	—	Display name in the picker. Defaults to `id`.
`maxIn`	—	Max input tokens. Default `128000`.
`maxOut`	—	Max output tokens. Default `16384`.
`image`	—	Model supports vision. Default `true`.
`thinking`	—	Model supports reasoning tokens. Default `false`.
`toolCalling`	—	Max parallel tool calls. Default `128`.
`apiPath`	—	Per-model API path override (e.g. `/responses`).
`visionFallback`	—	Per-model vision fallback. Format: `family:modelId`.
`pricing`	—	Cost display. Use structured format for best results.

Pricing Display

Pricing appears in two places — the model picker subtitle and the model details panel (click the model in the picker).

Structured format (recommended) — parsed into separate Input / Output / Cache rows in the details panel:

in $0.14 / out $0.28 / cache $0.0028
$0.14/$0.28 (cache $0.0028)
$0.14 → $0.28 | cache $0.0028

Simple format — shown as a single row:

$0.14/$0.28

In the picker subtitle it always shows as a compact badge: 💰 $0.14 → $0.28 | cache $0.0028/1M

Vision Fallback

When a model doesn't support images, CAK can route images through a vision-capable model and send the text description to your primary model.

How it works

CAK reports to VS Code that the model can handle images (so attachments aren't blocked)
When you attach an image, the bridge detects it
The image is sent to a vision fallback model for description
The text description replaces the image before reaching your primary model

Configuration

Global fallback model — set in the Configuration tab:

Vision Fallback dropdown: pick any Copilot or CAK model
Always preprocess images through fallback checkbox: forces fallback for all models when on

Per-model fallback — set in the Models tab when adding/editing a model:

Vision Fallback dropdown: overrides the global setting for that specific model

Trigger conditions

Fallback always	Model `image` flag	Behavior
OFF (default)	`true`	Images sent directly to model
OFF (default)	`false`	Fallback runs automatically
ON	any	Fallback always runs

Thinking Models & Reasoning

Models with thinking: true support reasoning tokens. VS Code shows a glow animation for thinking blocks.

CAK persists the chain-of-thought across conversation turns. When a thinking model responds, its reasoning is captured and re-injected as reasoning_content in the next API call. This preserves reasoning context.

Configure reasoningEffort (None / High / Max) per-model in VS Code's model configuration dropdown.

Prompts

Customize the system prompt and user message template in the Configuration tab:

System Prompt — injected as the first message. Supports placeholders: {model}, {date}, {tools}, {cakVersion}.
User Prompt Template — wraps user messages. Use {userMessage} as the placeholder.

Logging & Diagnostics

Level	What you get
`quiet`	Nothing in output channel (default)
`meta`	Request fingerprints, message diffs, cache trace
`dump`	Meta + full request payloads written to disk

Access logs via Copilot Adapter Kit: Show Logs. View dumps via Copilot Adapter Kit: Open Dumps Folder.

Tool Stabilization

When you see "tool list is unstable" warnings, enable stabilizeTools in the Configuration tab. This pre-activates VS Code tools to lock the tools array across conversation turns, improving cache prefix stability.

Commands

All commands available via Cmd+Shift+P under Copilot Adapter Kit:.

Command	Description
Open Panel	Full settings UI — providers, models, keys, config
Set API Key	Store a provider API key in OS keychain
Clear API Key	Remove a provider's API key
Add Provider	Step-by-step wizard
Remove Provider	Cascade deletes provider + models + key
Add Model	Step-by-step form with dropdowns
Remove Model	Pick a model to remove
Open Settings	Jump to raw JSON settings
Show Logs	Open the output channel
Open Dumps Folder	Reveal request dumps in Finder

Architecture

Copilot Chat (VS Code)
       │
       ▼
  CopilotBridge
  Model → family → provider config + key → engine
  Vision fallback, prompt templates, tool stabilization
       │
       ▼
  Pipeline (Interceptor Chain)
  RateLimitGuard → ErrorWarden → DiagTracer
       │
       ▼
  ProviderDiscovery (Engine Registry)
  OpenAIEngine (OpenAI-compat) | AnthropicEngine (native Messages API)
       │
       ▼
  Provider API

Design Patterns

Pattern	Where
SPI	`Engine` interface — every backend implements it
IoC	`Context` — single bootstrapper wires all services
AOP	`Pipeline` — interceptor chain wraps every engine call
Factory	`ProviderDiscovery` — register engines, lookup at runtime
Strategy	Per-family `ProviderConfig` with optional model aliases

Developer Guide

Prerequisites

nvm use 22           # Node ≥22
npm install           # Install dependencies
npm run watch         # Compile + watch

Project Structure

src/
├── entry.ts                          # VS Code activate/deactivate
├── kernel/
│   ├── context.ts                    # IoC container — boots everything
│   ├── vault.ts                      # OS keychain per-family API keys
│   ├── tuning.ts                     # Typed settings accessors (providers, models, visionFallback, prompts)
│   └── families.ts                   # 13 provider family presets with default URLs
├── conduit/
│   ├── copilot-bridge.ts             # LanguageModelChatProvider — main request handler
│   ├── model-catalog.ts              # Model metadata registry + VS Code capability reporting
│   └── replay.ts                     # Chain-of-thought stash/replay for thinking models
├── mesh/
│   ├── contract.ts                   # Engine SPI, Payload, Envelope, StreamEvents, ToolDef
│   ├── discovery.ts                  # Provider registry (AnthropicEngine + OpenAIEngine × families)
│   ├── pipeline.ts                   # AOP interceptor chain (RateLimitGuard → ErrorWarden → DiagTracer)
│   └── engines/
│       ├── anthropic/
│       │   ├── anthropic-engine.ts   # Native Anthropic Messages API — SSE streaming
│       │   └── anthropic-wire-format.ts  # Envelope[] → Anthropic request (system, tool_use, tool_result)
│       └── openai/
│           ├── openai-engine.ts      # OpenAI Chat Completions — SSE streaming (used by 12 families)
│           └── openai-wire-format.ts # VS Code messages → OpenAI JSON (forgeEnvelopes, forgeTools)
├── crosscut/
│   ├── rate-limit-guard.ts           # 429 auto-retry ×3 with thinking block progress
│   ├── error-warden.ts               # HTTP + network error → friendly chat messages
│   ├── diag-tracer.ts                # Request logging, fingerprinting, JSON dumps, vision audit
│   ├── insight-engine.ts             # Request fingerprint hashing & diff detection
│   └── tool-stabilizer.ts            # Tool pre-activation to lock tools array across turns
├── panel/
│   └── SettingsPanel.ts              # Webview panel — builds state + handles config save messages
└── tooling/
    └── token-math.ts                 # Approximate token counting
media/
└── settings-panel.html               # Full settings UI — Providers, Models, Keys, Config, Dumps, Danger Zone

Key Concepts

CopilotBridge (conduit/copilot-bridge.ts) — the single LanguageModelChatProvider VS Code sees. Flow:

Resolves selected model → metadata (ModelMeta) + family
Loads provider config (base URL) and API key from the vault
Applies custom system prompt and user message template if configured
Detects image parts → runs vision fallback if configured (global or per-model)
Builds the abstract Payload (Envelope[] + ToolDef[])
Runs it through the interceptor pipeline to the engine
Streams tokens, thinking blocks, and tool calls back to VS Code
Stashes chain-of-thought for thinking models on completion

ModelCatalog (conduit/model-catalog.ts) — loads user-defined models from copilot-adapter-kit.models. Reports capabilities to VS Code:

imageInput: true when: model supports images natively, OR per-model visionFallback is set, OR global visionFallbackModel is set
imageInput: false only when no fallback is configured → VS Code shows native warning

Settings Panel (panel/SettingsPanel.ts + media/settings-panel.html) — full webview UI with tabs:

Providers — add/edit/remove provider endpoints with family presets (auto-fills default URLs)
Models — add/edit/remove models; fields for id, name, family, context window, API path, vision, thinking, tools, pricing, per-model vision fallback
Keys — set/clear API keys per provider, stored in OS keychain
Configuration — log level, vision fallback model (grouped dropdown: Copilot + CAK models), always-preprocess toggle, system prompt, user prompt template, tool stabilization
Request Dumps — list of saved JSON payloads from dump log level
Danger Zone — reset all settings, hide built-in models

Vision Fallback Flow:

User attaches image
        │
        ▼
VS Code checks imageInput capability (set by ModelCatalog)
  ─ true → image reaches bridge
  ─ false → VS Code blocks image at UI level
        │
        ▼
Bridge: _hasImageParts() detects image parts
        │
        ▼
shouldFallback = hasImages && hasFallbackModel && (visionFallbackAlways || !modelSupportsImages)
        │
        ▼
_applyVisionFallback() routes each image:
  ─ copilot:* → _describeViaCopilot() (native vscode.lm.selectChatModels)
  ─ family:* → _describeViaEngine() (routes through CAK engine, non-streamed)
        │
        ▼
Image replaced with text description in message payload
        │
        ▼
Primary model receives text-only messages

Thinking Stash/Replay:

When a thinking model responds, chain-of-thought is captured and packed as a binary x-cak/chain message part (magic header + compressed payload). On the next turn, openai-wire-format.ts detects the stash via unpackStash() and injects reasoning_content into the API payload. Preserves reasoning context across conversation turns.

Adding a New Engine

Two approaches depending on the provider's API:

OpenAI-compatible (Groq, Fireworks, Together, DeepSeek, Ollama, LM Studio, etc.) — no code needed. Just add a family in families.ts and it auto-routes through OpenAIEngine.

Non-OpenAI API (example: native Anthropic) — implement the Engine SPI:

Create wire format in src/mesh/engines/{family}/{family}-wire-format.ts:

export function toMyApiRequest(payload: Payload): MyApiRequest {
  // Convert abstract Envelope[] + ToolDef[] → provider-specific JSON
}

Create engine in src/mesh/engines/{family}/{family}-engine.ts:

import { Engine, Payload, StreamEvents } from '../../contract';

export class MyEngine implements Engine {
  readonly family = 'myfamily';
  configure(endpoint: string, key: string): void { ... }
  async stream(req: Payload, sink: StreamEvents, signal?: AbortSignal): Promise<void> {
    // Call toMyApiRequest(req), fetch(), stream SSE, emit sink events
  }
}

Register in src/mesh/discovery.ts:

this.register(new MyEngine());

Add family preset in src/kernel/families.ts (optional).

Adding a New Interceptor

import type { Interceptor } from '../mesh/pipeline';

export class MyInterceptor implements Interceptor {
  async intercept(payload, engine, sink, signal, next) {
    await next();  // Call the chain
  }
}

Key Design Decisions

Two-engine architecture. AnthropicEngine for native Anthropic Messages API; OpenAIEngine for all 12 OpenAI-compatible families. Both implement the same Engine SPI.
Abstract wire format. Bridge produces abstract Envelope[] + ToolDef[]; each engine's wire format translates to provider-specific JSON.
Per-family API keys in OS keychain. No shared keys, no fallback.
Compile-time engine registration. Engines registered in ProviderDiscovery, not from config. Prevents arbitrary code execution.
Inline error reporting. Errors rendered as LanguageModelTextPart in chat, not thrown as exceptions.
Thinking blocks use LanguageModelThinkingPart (proposed API) with ID cak-thinking.
Fully awaited async interceptor chains. Critical for 429 retry correctness.

Settings Reference

All settings under copilot-adapter-kit.*.

`providers`

{
  "copilot-adapter-kit.providers": {
    "openai": {
      "baseUrl": "https://api.openai.com/v1",
      "name": "My OpenAI",
      "defaultApiPath": "/chat/completions",
      "modelApiPaths": { "codex-5.3": "/responses" },
      "modelAlias": { "gpt-4o": "gpt-4o-2024-08-06" },
      "visionFallback": "openai:gpt-5.2"
    }
  }
}

`models`

Array of model definitions. See Models section for schema.

`visionFallbackModel`

Global vision fallback model. Format: family:modelId or just modelId. Empty = disabled.

`visionFallbackAlways`

Default: false. When true, always preprocess images through fallback regardless of model's image flag.

`maxTokens`

Max output tokens. 0 = no limit (provider default applies).

`logLevel`

quiet (default), meta, or dump.

`stabilizeTools`

Default: false. Pre-activates tools to stabilize the tools array.

`systemPrompt`

Custom system prompt template. Placeholders: {model}, {date}, {tools}, {cakVersion}.

`userPromptTemplate`

User message wrapper. Use {userMessage} placeholder.

`hiddenCustomModels`

Managed by the Panel UI when you hide models. No JSON editing needed.

Troubleshooting

"Model does not support images" warning in chat

VS Code shows this when imageInput: false. CAK reports imageInput: true if a vision fallback is configured. If you still see this:

Set a vision fallback model (global or per-model)
Reload the extension host after changes

Image sends but model returns errors

Check that your vision fallback model actually supports images. If it's also text-only, CAK shows [image — vision fallback failed] in the response. Use a known vision-capable model as fallback (e.g. copilot:gpt-5.2 or openai:gpt-5.2).

Model not showing in picker

Model's family must match a key in providers
API key must be set for that family
Reload window after adding

"No API key configured" warning

Run Copilot Adapter Kit: Set API Key and select the provider.

"No baseUrl configured" error

Add the provider to copilot-adapter-kit.providers with a valid baseUrl.

429 Rate Limit

CAK auto-retries 3 times with exponential backoff. If it persists, reduce request frequency or upgrade your provider tier.

Connection refused (Ollama/LM Studio)

Verify the local server is running:

curl http://localhost:11434/v1/models   # Ollama
curl http://localhost:1234/v1/models    # LM Studio

Copilot Adapter Kit

salilvnair

Copilot Adapter Kit

Getting Started

Quickstart

Open the Settings Panel

Consumer Guide

Providers

Provider Families

Model Aliases

Models

Pricing Display

Vision Fallback

How it works

Configuration

Trigger conditions

Thinking Models & Reasoning

Prompts

Logging & Diagnostics

Tool Stabilization

Commands

Architecture

Design Patterns

Developer Guide

Prerequisites

Project Structure

Key Concepts

Adding a New Engine

Adding a New Interceptor

Key Design Decisions

Settings Reference

providers

models

visionFallbackModel

visionFallbackAlways

maxTokens

logLevel

stabilizeTools

systemPrompt

userPromptTemplate

hiddenCustomModels

Troubleshooting

"Model does not support images" warning in chat

Image sends but model returns errors

Model not showing in picker

"No API key configured" warning

"No baseUrl configured" error

429 Rate Limit

Connection refused (Ollama/LM Studio)

License

`providers`

`models`

`visionFallbackModel`

`visionFallbackAlways`

`maxTokens`

`logLevel`

`stabilizeTools`

`systemPrompt`

`userPromptTemplate`

`hiddenCustomModels`