GLM Chat Provider

Z.AI GLM models as a VS Code Language Model Chat Provider for the Coding Plan.

Model	Context	Output	Tool Calling
GLM-5.1	200K	131K	Yes
GLM-5	205K	131K	Yes
GLM-5-Turbo	200K	131K	Yes
GLM-4.7	205K	131K	Yes
GLM-4.7-Flash	200K	131K	Yes
GLM-4.7-FlashX	200K	131K	Yes
GLM-4.6	205K	131K	Yes
GLM-4.5	131K	98K	Yes
GLM-4.5-Flash	131K	98K	Yes
GLM-4.5-Air	131K	98K	Yes

Model	Context	Output	Image Input	Tool Calling
GLM-5V-Turbo	200K	131K	Yes	Yes
GLM-4.6V	128K	33K	Yes	Yes
GLM-4.5V	64K	16K	Yes	Yes

Commands

For models that support it (GLM-4.5 and above), you can control whether the model uses its reasoning/thinking capability.

Run GLM: Set Thinking Effort from the Command Palette to choose between:

The selected value is persisted in your VS Code settings under glm-chat-provider.defaultThinkingMode.

Open the Command Palette and run GLM: Set API Key to configure your API credentials
Use the provider from VS Code's Language Model Chat UI and select Z.AI GLM

MIT (c) Denizhan Dakilir