Opilot — Ollama for GitHub Copilot VS Code Extension

Run Ollama models with full tool and vision support inside GitHub Copilot Chat

Opilot integrates the full Ollama ecosystem — local models, cloud models, and the Ollama model library — directly into VS Code's Copilot Chat interface. Your conversations never leave your machine when using local models, and you can switch between models without leaving the editor.

📖 Docs • 🛒 Marketplace • 🐙 GitHub • 🐛 Issues

🌐 Ollama • 📖 Ollama Repo • 📚 Model Library

✨ Features

🦙 All Ollama Models — Use any model from the Ollama Library, including Cloud models (after ollama login), as first-class Copilot chat models and as the @ollama participant
🛠️ Model Management Sidebar — Pull, run, inspect, stop, and delete models from a dedicated Ollama activity bar panel with live status badges
🎛️ Per-Model Settings Panel — Tune temperature, top-p/top-k, context, max tokens, and thinking budget from an in-editor webview; settings persist per model
📡 Status Bar Heartbeat — Always-visible Ollama server indicator with running model count, connectivity state, and resource tooltip
💬 Chat Participant — Invoke @ollama in Copilot Chat for a dedicated, history-aware conversation with your chosen local model
📝 Modelfile Manager — Create, edit, and build custom Ollama modelfiles with syntax highlighting, hover documentation, and autocomplete
⌨️ Inline Code Completions — Get fill-in-the-middle code suggestions powered by a local Ollama model as you type
🔧 Tool Calling — Full tool/function-calling support for agentic workflows with compatible models (MCP servers, VS Code commands, custom skills)
🖼️ Vision Support — Image input for models with vision capabilities; non-vision models automatically have images stripped to avoid prompt overflow
💭 Thinking Models — Extended reasoning with collapsible "Thinking" and "Response" sections for models that expose chain-of-thought (e.g., DeepSeek-R1, Qwen QwQ, Kimi)
🏠 Local Execution & Privacy — Local models run entirely on your machine; no data is sent to any external service
⚡ Streaming — Real-time token streaming for low-latency responses in both the chat participant and provider paths
🔒 Secure Token Storage — Authentication tokens for remote Ollama instances are stored in VS Code's encrypted secrets API

🔧 Requirements

VS Code 1.111.0 or higher
GitHub Copilot Chat extension installed and active
Ollama installed locally (Download) or a remote Ollama instance you control

🚀 Quick Start

Install Ollama and start it (ollama serve or open the app)
Install Opilot from the VS Code Marketplace (or install the .vsix file)
The Ollama icon appears in the activity bar — click it to open the sidebar
Pull a model from the Library panel (e.g., llama3.2:3b)
Open Copilot Chat, click the model picker, and select your Ollama model — or type @ollama to chat

The extension auto-detects your local Ollama instance at http://localhost:11434. To use cloud models, run ollama login first. To use a remote instance, set opilot.host in VS Code settings (legacy ollama.host is still supported).

⚙️ Configuration

Open VS Code Settings (Ctrl+, / Cmd+,) and search for "Opilot":

opilot.host - Ollama server address (default: http://localhost:11434)
opilot.streamLogs - Stream Ollama server logs to output channel (default: true)
opilot.localModelRefreshInterval - Auto-refresh interval for local and running models, in seconds (default: 30)
opilot.libraryRefreshInterval - Reserved refresh interval for library and cloud model catalogs, in seconds (default: 21600); panels currently refresh on startup and via the manual refresh button
opilot.completionModel - Model used for inline code completions (e.g. qwen2.5-coder:1.5b). Leave empty to disable.
opilot.enableInlineCompletions - Enable or disable inline code completions (default: true)
opilot.modelfilesPath - Folder where modelfiles are stored (default: ~/.ollama/modelfiles)
opilot.diagnostics.logLevel - Verbosity of the Ollama output channel (debug, info, warn, error; default: info)

Legacy ollama.* settings continue to work and are migrated automatically on activation.

To use a remote Ollama instance, update opilot.host to point to your remote server.

💬 Usage

Model Picker

To use an Ollama model in Copilot Chat without the @ollama handle:

Open GitHub Copilot Chat panel in VS Code
Click the model selector dropdown
Choose an Ollama model (local or from library)
Start chatting!

Chat Participant

Type @ollama in any Copilot Chat input to direct the conversation to your local Ollama instance:

@ollama explain the architecture of this TypeScript project

The participant is sticky — once invoked, it stays active for the thread.

Inline Code Completions

Set opilot.completionModel to a locally-installed model to get inline code completions as you type. Smaller, fast models work best:

qwen2.5-coder:1.5b
deepseek-coder:1.3b
starcoder2:3b

Completions use fill-in-the-middle (FIM) when the model supports it, and can be toggled with opilot.enableInlineCompletions.

The Ollama activity bar icon opens a sidebar with four panels:

Local Models

View all locally installed models grouped by family (tree view) or as a flat list
Filter models by name using the filter icon in the panel header
Toggle between grouped tree view and flat list with the layout icon
Open Model Settings from the gear icon in the Local Models toolbar
Inline buttons per model: Start (▶), Stop (⏹), Delete (🗑)
Running models show VRAM usage and how long they've been loaded
Model capability badges: 🧠 thinking, 🛠️ tools, 👁️ vision, 🧩 embedding
Auto-refreshes every 30 seconds (configurable via opilot.localModelRefreshInterval); refresh interval restarts automatically when the setting changes

Model Settings Panel

Open Ollama: Open Model Settings (or click the gear icon in Local Models) to configure per-model generation overrides:

Temperature
Top-P
Top-K
Context window (num_ctx)
Max tokens (num_predict)
Thinking toggle (think)
Thinking budget (think_budget)

Changes apply immediately and are persisted per model in the extension global storage.

Status Bar Heartbeat

Opilot adds a persistent status bar item:

$(loading~spin) Ollama… while checking
$(pulse) Ollama or $(pulse) Ollama (N) when reachable
$(warning) Ollama offline after debounced failures

Click the status bar item (or run Ollama: Check Server Health) for an immediate connectivity check.

Cloud Models

View models pulled from Ollama Cloud (requires ollama login)
Filter, group by family, and collapse all — same controls as Local Models
Inline buttons: Open page (🔗), Run (▶), Stop (⏹), Delete (🗑)
Use the Login (👤) button in the panel header to authenticate

Library

Browse hundreds of pre-configured models from ollama.ai/library
Models grouped by family with collapsible variant children
Filter by name; sort by newest or name
Variants already downloaded locally show a ✓ checkmark
Click Pull (⬇) on any variant to download it with streaming progress

Modelfiles

The Modelfile Manager pane for creating and managing custom Ollama modelfiles. See Modelfile Manager below.

Modelfile Manager

Creating a new Modelfile

Click the + button in the Modelfile Manager pane header. An interactive wizard will guide you through:

Name — enter a name for the modelfile (e.g. pirate-bot)
Base model — pick a model from your locally installed Ollama models
System prompt — describe the AI persona or task

The wizard creates the file, pre-populates it with the chosen settings, and opens it in the editor.

Building a Modelfile

Right-click any .modelfile in the pane and choose Build Model from Modelfile (or use the command palette: Ollama: Build Model from Modelfile). This runs ollama create with the file and streams progress in a VS Code notification.

Syntax support

All .modelfile files receive:

Syntax highlighting — keywords (FROM, PARAMETER, SYSTEM, TEMPLATE, ADAPTER, LICENSE, MESSAGE, REQUIRES), parameter names, numbers, strings, and comments
Hover documentation — hover over any keyword or parameter name to see its description and usage
Autocomplete — suggestions for Modelfile keywords and common parameter names

# Modelfile — pirate-bot
FROM llama3.2:3b

SYSTEM """You are a helpful pirate assistant. Arr!"""

PARAMETER temperature 0.7
PARAMETER num_ctx 4096

See the Ollama Modelfile Docs for the full syntax reference.

🛡️ Privacy & Security

Your models and conversations run completely locally - no data is sent to external services
The extension communicates only with your local Ollama instance (or your specified remote instance)
No telemetry, tracking, or data collection
Authentication tokens (if using a remote instance) are stored securely using VS Code's encrypted secrets API

For more information on Ollama's security and privacy model, see the Ollama GitHub repository.

🛠️ Development

Prerequisites

Node.js 20+
pnpm (version pinned in package.json)
VS Code 1.111.0+

Build

pnpm install
pnpm run compile        # type-check + lint + bundle
pnpm run watch          # parallel watch for type-check and bundle

Testing

pnpm test               # unit tests (Vitest)
pnpm run test:coverage  # unit tests with coverage (target: 85%)
pnpm run test:extension # VS Code integration tests
pnpm run lint           # static analysis (oxlint)

Debugging

Open the project in VS Code and press F5 to launch the Extension Development Host with the extension loaded.

📄 License

MIT License - See LICENSE for details.

Maintained by Daniel Sieradski (@selfagency).

📚 Resources

Opilot

Documentation - Full user and developer docs
VS Code Marketplace - Install from the marketplace
GitHub Repository - Source code and releases
GitHub Issues - Bug reports and feature requests

Ollama

Ollama GitHub - Main Ollama repository
Ollama Model Library - Browse available models
Ollama API Docs - REST API documentation
Ollama Modelfile Docs - Create custom models
VS Code Language Model API - Extension API reference

Opilot

The Self Agency LLC

Opilot — Ollama for GitHub Copilot VS Code Extension

✨ Features

🔧 Requirements

🚀 Quick Start

⚙️ Configuration

💬 Usage

Model Picker

Chat Participant

Inline Code Completions

Sidebar: Model Management

Local Models

Model Settings Panel

Status Bar Heartbeat

Cloud Models

Library

Modelfiles

Modelfile Manager

Creating a new Modelfile

Building a Modelfile

Syntax support

🛡️ Privacy & Security

🛠️ Development

Prerequisites

Build

Testing

Debugging

📄 License

📚 Resources

Opilot

Ollama