Skip to content
| Marketplace
Sign in
Visual Studio Code>AI>OmniCopilotNew to Visual Studio Code? Get it now.
OmniCopilot

OmniCopilot

浅忆QanYi

|
57 installs
| (0) | Free
Use models from multiple LLM platforms (DeepSeek, Kimi, Qwen, MiniMax, GLM, Volcengine Plan, Custom) in GitHub Copilot Chat as language model providers.
Installation
Launch VS Code Quick Open (Ctrl+P), paste the following command, and press enter.
Copied to clipboard
More Info

OmniCopilot

中文版

A VS Code extension that lets you use models from multiple LLM platforms in GitHub Copilot Chat as language model providers.

Supported Providers

Provider Vendor ID Models
DeepSeek deepseek deepseek-v4-flash, deepseek-v4-pro
Bigmodel Plan (GLM) zhipu GLM-5.1, GLM-5-Turbo, GLM-4.7, GLM-4.5-Air
Moonshot (Kimi) moonshot kimi-for-coding
Qwen qwen qwen3.6-plus, qwen3-max, qwen3.5-flash, qwen3-coder-plus
MiniMax minimax MiniMax-M2.7, MiniMax-M2.7-highspeed, MiniMax-M2.5
Volcengine Plan volcengine doubao-seed-2.0-code, doubao-seed-2.0-pro, doubao-seed-2.0-lite, doubao-seed-code, minimax-m2.5, glm-4.7, deepseek-v3.2, kimi-k2.5
Custom custom-openai Any OpenAI-compatible model

Tested & Working

The following platforms have been tested and confirmed working:

  • DeepSeek Open Platform (platform.deepseek.com)
  • Kimi Code (Kimi coding model)
  • MiniMax Token Plan (China) (platform.minimaxi.com)
  • Qwen / Alibaba DashScope (dashscope.aliyuncs.com)
  • Bigmodel Plan (open.bigmodel.cn Coding API)

TODO

  • [ ] Test Volcengine Plan
  • [ ] Test Qwen Coding Plan
  • [ ] Test Kimi Open Platform
  • [ ] Support SiliconFlow
  • [ ] Support MiniMax International
  • [ ] Support GLM International
  • [ ] Support SiliconFlow International
  • [ ] Verify thinking effort levels (DeepSeek None/High/Max; others None/Low/Medium/High or None/On) actually take effect across providers
  • [ ] To be continued…

Features

  • Multiple Providers: Access models from major LLM platforms plus any OpenAI-compatible endpoint
  • Per-Model Thinking Effort: Hover any thinking-capable model in the Copilot picker to pick the effort level for the next turn — no need to flip a global switch
    • DeepSeek V4 menu: None / High / Max (matches the V4 API's reasoning_effort domain)
    • 4-level menu (None / Low / Medium / High) for Qwen reasoning models
    • 2-level menu (None / On) for models that only expose a thinking on/off knob (GLM, Kimi, MiniMax, Volcengine reasoning models)
  • Thinking UI: Models with reasoning capabilities show collapsible thinking sections via LanguageModelThinkingPart
  • Vision Support: Vision-capable models (kimi-for-coding, qwen3.6-plus) can read images attached in Copilot Chat
  • Tool Calling: Function calling support for compatible models
  • Custom Model IDs: Add custom model IDs to any provider via Settings or command palette

Usage

  1. Install the extension
  2. Open Copilot Chat → Manage Models → Add Model
  3. Select a provider and enter your API key
  4. Start chatting with the selected model

Custom Model IDs

You can add custom model IDs to any provider:

  • Command Palette: Run OmniCopilot: Add Custom Model ID
  • Settings: Edit omniCopilot.<vendor>.customModelIds array in settings.json

Configuration

Thinking effort is now selected per model, per turn via the Copilot model picker's hover menu — there is no global thinking-effort setting.

Setting Description Default
omniCopilot.contextLength Max input context length (4K–1M presets, or custom) default
omniCopilot.customContextLength Custom max input context (used when contextLength is custom) 131072
omniCopilot.enableVision Enable vision for supported models true
omniCopilot.<vendor>.customModelIds Custom model IDs for each vendor []

Development

Prerequisites

  • Node.js (LTS recommended)
  • VS Code 1.108.0+
  • GitHub Copilot extension installed

Setup

git clone https://github.com/LyaQanYi/OmniCopilot.git
cd OmniCopilot
npm install

Build & Run

# Compile TypeScript
npm run compile

# Watch mode (auto-recompile on changes)
npm run watch

To debug the extension, press F5 in VS Code to launch an Extension Development Host with the extension loaded.

Package .vsix

npx @vscode/vsce package --no-dependencies

Project Structure

src/
├── extension.ts   # Extension entry point, activation & commands
├── provider.ts    # Language model provider implementation
├── api.ts         # API call logic (streaming, thinking, vision)
├── models.ts      # Preset model definitions per vendor
└── types.ts       # Shared TypeScript interfaces

Contributing

Contributions are welcome! Here's how you can help:

  1. Fork the repository
  2. Create a feature branch: git checkout -b feat/my-feature
  3. Commit your changes: git commit -m "feat: add my feature"
  4. Push to the branch: git push origin feat/my-feature
  5. Open a Pull Request

Guidelines

  • Follow existing code style (TypeScript strict mode)
  • Test with at least one provider before submitting
  • Keep commit messages clear and descriptive
  • One feature/fix per PR when possible

Ways to Contribute

  • Test a provider — Pick an untested platform from the TODO list, test it, and report results
  • Add a new provider — Add model definitions in models.ts and register in extension.ts
  • Fix bugs — Check Issues for reported problems
  • Improve docs — Help with documentation or translations

Requirements

  • VS Code 1.108.0+
  • GitHub Copilot extension

Changelog

0.3.0 — 2026-05-08

  • Per-model Thinking Effort picker in Copilot model selector — hover a thinking-capable model and choose effort for the next turn, no global setting needed
    • DeepSeek V4 menu: None / High / Max (matches the V4 API's reasoning_effort domain)
    • 4-level menu (None / Low / Medium / High) for Qwen reasoning models
    • 2-level menu (None / On) for GLM, Kimi, MiniMax, and Volcengine reasoning models
  • DeepSeek model list updated from deepseek-chat / deepseek-reasoner to deepseek-v4-flash / deepseek-v4-pro (1M input, 384K output, both reasoning-capable)
  • Removed global omniCopilot.enableThinking and omniCopilot.thinkingEffort settings, the matching status-bar items, and OmniCopilot: Toggle Thinking Mode / Set Thinking Effort commands — picker covers all cases now
  • Vendor-specific reasoning mapping reworked to handle the full None / On / Low / Medium / High / Max space:
    • DeepSeek: reasoning_effort: high|max when enabled, omitted when None
    • Qwen: enable_thinking + thinking_budget (1024 / 4096 / 16384 tokens; max → 16384)
    • Moonshot: explicit thinking: { type: "enabled"|"disabled" }
    • Volcengine: thinking: { type: "enabled" } only when enabled
    • Zhipu / MiniMax: no API knob, picker only controls output stripping

0.2.0 — 2026-04-30

  • Add user-configurable max input context length cap (presets 4K–1M + custom 1K-2M tokens), shown in status bar

0.1.3 — 2026-04-12

  • Add Volcengine Plan provider with 8 models (doubao-seed, minimax-m2.5, glm-4.7, deepseek-v3.2, kimi-k2.5)
  • Gate reasoning_content field by vendor capability — only send it for DeepSeek, Qwen, Moonshot/Kimi, and Zhipu; avoids request rejection on strict backends (Volcengine, MiniMax, custom)
  • Extract shared buildOpenAIMessages helper to deduplicate message-serialization logic between MultiModelChatProvider and CustomOpenAIProvider
  • Add reasoning_content handling in CustomOpenAIProvider

0.1.2 — 2026-04-09

  • Add Bigmodel Plan (Zhipu) provider with GLM-5.1, GLM-5-Turbo, GLM-4.7, GLM-4.5-Air models
  • Enable thinking capability for Zhipu models
  • Normalize Zhipu model IDs

0.1.1 — 2026-04-07

  • Enable thinking capability for MiniMax vendor
  • Enhance thinking support logic in both MultiModelChatProvider and CustomOpenAIProvider
  • Fix: flush remaining thinking buffer and pending tool calls after stream ends
  • Add MIT LICENSE file
  • Fix repository URL in package.json

0.1.0 — 2026-04-06

  • Initial release
  • Multi-model provider architecture with DeepSeek, Moonshot (Kimi), Qwen, MiniMax support
  • Custom OpenAI-compatible provider for any endpoint
  • Thinking mode with <think> tag parsing and collapsible UI (via LanguageModelThinkingPart)
  • Vision support for image-capable models
  • Tool calling / function calling support
  • Custom model ID management via settings and command palette
  • Configurable thinking effort (low / medium / high)

License

MIT

  • Contact us
  • Jobs
  • Privacy
  • Manage cookies
  • Terms of use
  • Trademarks
© 2026 Microsoft