Cerebras VS Code Extension
Make GitHub Copilot run 10× faster with the world’s fastest inference API. Cerebras Inference powers the world’s top coding models at 2,000 tokens/sec, making code generation instant and enabling super-fast agentic flows. Get your free API key to get started today. Get StartedAPI Key SetupHere's how you can use Cerebras models in VS Code:
Supported ModelsThis extension provides support for Qwen 3 Coder in agent mode, as well as the following models in chat mode:
Advanced TipsHere's how you can accomplish more with Cerebras:
What is Cerebras?Cerebras Systems delivers the world's fastest AI inference for leading open models on top of its revolutionary AI hardware and software. Cerebras consistently delivers chart-topping speeds for leading open models like Qwen 3 480B Coder and OpenAI's GPT OSS 120B, according to independent measurements by Artificial Analysis and OpenRouter. At the heart of Cerebras' technology is the Wafer-Scale Engine (WSE), which is purpose-built for ultra-fast AI training and inference. The Cerebras WSE is the world's fastest processor for AI, delivering unprecedented speed that no number of GPUs can match. Learn more about our novel hardware architecture here. Related |