Chutes AI Provider for GitHub Copilot Chat

Leverage Chutes.ai open-source models — including DeepSeek, Qwen, GLM and Kimi — directly within VS Code's GitHub Copilot Chat. The full Chutes catalogue is discovered automatically, with streaming, tool calling (agent mode) and vision. No GitHub Copilot subscription required.

⚡ Quick Start
- Install the extension from the VS Code Marketplace.
- Open VS Code's Chat view.
- Open the model picker and select Manage Models….
- Choose Chutes AI as the provider.
- Paste your Chutes API key (starts with
cpk_, get one at chutes.ai).
- Select the models you want to use. 🎉
You can also set the key anytime via Chutes AI: Manage API Key in the Command Palette.
✨ Features
- Automatic model discovery — the full Chutes catalogue is fetched from the API; nothing to maintain by hand.
- Native chat integration — models appear in Ask, Edit and Agent modes; tool-capable models light up agent mode.
- Vision — models that accept image input can read images attached to a chat.
- Streaming — responses stream token by token and honour cancellation.
- Secure key storage — your API key lives in VS Code SecretStorage (OS keychain), never in settings.
- Configurable filtering — narrow the picker to just the models you care about.
- Usage & spend in chat — ask
@chutes /usage in the chat panel to see your Chutes spend and quotas.

Requirements
- VS Code 1.104.0 or newer (the language model provider API). VS Code 1.125+ also lets you discover this extension from the Language Models editor via Install Model Providers.
- A Chutes API key (starts with
cpk_). Create one at chutes.ai.
Settings
| Setting |
Default |
Description |
chutes.endpoint |
https://llm.chutes.ai/v1 |
OpenAI-compatible API base URL. Change only for self-hosted or proxy endpoints. |
chutes.modelFilter |
(empty) |
Restrict which models appear. Comma-separated terms matched against the model id as a case-insensitive substring or regex (e.g. deepseek, qwen or Qwen3.*TEE). Empty shows all chat models. |
chutes.requestTimeoutMs |
15000 |
Timeout (ms) for fetching the model list. Does not limit streaming responses. |
Commands
| Command |
Description |
Chutes AI: Manage API Key |
Set, update or clear your API key. |
Chutes AI: Refresh Models |
Re-fetch the model list (e.g. after Chutes adds models). |
Usage & spend in chat
Type @chutes in the chat input to check your Chutes account without leaving the editor:
@chutes /usage — spend for the current billing windows (monthly cap and 4-hour window) plus your daily request quota.
@chutes /quota — per-model quotas.
It uses the same API key you configured for the provider. Note: VS Code does not let third-party providers display live spend inside Copilot's own usage UI, so this surfaces it as an on-demand chat reply.
Privacy
Your prompts and attachments are sent to the Chutes API to generate responses — that is the purpose of the extension. The API key is stored in VS Code SecretStorage and is never written to settings or logs. The extension collects no telemetry, analytics, or personal data.
🛠️ Development
git clone https://github.com/TheStreamCode/chutes-model-provider-vscode
cd chutes-model-provider-vscode
npm install
npm run compile
Press F5 to launch an Extension Development Host. Run npm test for the unit tests.
📚 Resources
Trademarks
Chutes, the Chutes logo, and related names and marks are trademarks of their respective owner and are used with permission. This extension's own source code is licensed under the MIT License.
Support & License