Skip to content
| Marketplace
Sign in
Visual Studio Code>AI>Kimi Language Model Provider for CopilotNew to Visual Studio Code? Get it now.
Kimi Language Model Provider for Copilot

Kimi Language Model Provider for Copilot

Denizhan Dakılır

|
718 installs
| (0) | Free
Registers Kimi (Moonshot AI) models as VS Code language model providers for Copilot Chat
Installation
Launch VS Code Quick Open (Ctrl+P), paste the following command, and press enter.
Copied to clipboard
More Info

Kimi LM Copilot Provider

VS Code language model provider that connects Copilot Chat to Moonshot Kimi. Just enter your API key from kimi.com/code/console and start chatting with the model in VS Code Copilot Chat/Agent.

If you want to compile yourself

  1. Install dependencies: npm install
  2. Compile: npm run compile
  3. Run npx vsce package to create the .vsix file.
  4. Load the extension in VS Code.
  5. Configure apiKey from kimi.com/code/console in the model provider settings.

API

  • Base URL: https://api.kimi.com/coding/v1
  • Chat endpoint: /chat/completions
  • Full endpoint used by the client: https://api.kimi.com/coding/v1/chat/completions

Request Fields

The client sends:

  • model
  • messages
  • stream
  • thinking
  • top_p (optional)
  • max_tokens (optional)
  • tools (optional)
  • stop (optional)
  • prompt_cache_key (optional, from metadata.taskId)

Notes:

  • temperature is not sent by this extension. I am letting API handle it by itself. Chinese providers are better at handling these stuff by themselves, and I don't want to mess with it.
  • safety_identifier is not sent.

Default Headers

Currently, Kimi wouldn't accept ANY clients so we need to send extra headers to larp as an accepted client.

Each request includes:

  • Content-Type: application/json
  • Authorization: Bearer <apiKey>
  • User-Agent: KimiCLI/<version>
  • X-Msh-Platform: kimi_cli
  • X-Msh-Version: <version>
  • X-Msh-Device-Name: <hostname>
  • X-Msh-Device-Id: <generated device id>

Streaming and Tools

  • Streaming responses are consumed as SSE (data: lines).
  • Tool calls are collected from streamed deltas and emitted when the model finishes with tool_calls.
  • Contact us
  • Jobs
  • Privacy
  • Manage cookies
  • Terms of use
  • Trademarks
© 2026 Microsoft