Ollama Copilot

Ollama Copilot integrates local LLMs from Ollama directly into VS Code, providing AI-powered code completion and an interactive chat experience with your own locally-running models.

Inline Suggestions and Model Selection

What's New in v0.2.0 🎉

Major Architecture Overhaul

Enterprise-grade Dependency Injection: Complete rewrite with DI container for better modularity
Enhanced Security: Comprehensive input validation, path security, and XSS prevention
Performance Monitoring: Built-in metrics and resource management
Improved Reliability: Automatic error recovery and retry mechanisms

Chat Experience Improvements

Thinking Indicator: Collapsible dropdown showing AI processing (like Cursor IDE)
Stop Generation: Ability to stop responses mid-stream

🚀 Coming Soon

MCP Server Support: Native Model Context Protocol integration
Smart Repository Indexing: Automatic codebase analysis for better suggestions
Enhanced Context Awareness: Improved understanding of your project structure

See full changelog

Features

AI-Powered Code Completions

Get contextual code suggestions as you type, powered by your local Ollama models:

Smart context awareness (up to 1000 lines of surrounding code)
Multi-line code suggestions
Language-specific completions
Variable and function name awareness
Tab completion support

Chat Interface

Interactive Chat Interface

Engage with your code through:

Dedicated sidebar chat panel
Real-time streaming responses
Context-aware code discussions
File and workspace context integration

Privacy-Focused

All processing happens locally through Ollama
No data sent to external servers
Complete control over your models and data

Customizable Configuration

Choose from any installed Ollama model
Configure API host settings
Adjust workspace context settings

Prerequisites

Install Ollama on your system
Pull at least one model in Ollama (see model recommendations)
Make sure Ollama is running (ollama serve)

Quick Start

Install the extension from VS Code marketplace
Run Ollama in the background (ollama serve)
Select a default model when prompted
Start coding to see inline suggestions
Use the sidebar chat for more complex queries

Model Selection

Choose your model through:

Command Palette (Ctrl+Shift+P or Cmd+Shift+P)
Type "Ollama Copilot: Select Default Model"
Pick from your installed models

Recommended Models

For the best experience, we recommend:

Code Completion

qwen:14b - Excellent for general code completion
codellama:13b - Strong at understanding context
deepseek-coder:6.7b - Fast and efficient
phind-codellama:34b - Great for complex completions

Chat Interface

mixtral:8x7b - Strong reasoning and explanation
llama2:13b - Good balance of speed and capability
neural-chat:7b - Fast responses for simple queries

Installing Models

# Qwen - Powerful 14B model with strong coding capabilities
ollama pull qwen:14b

# CodeLlama - Meta's specialized coding model
ollama pull codellama:13b

# Mixtral - High-performance 8x7B model
ollama pull mixtral:8x7b

# List all installed models
ollama list

Usage Tips

Code Completion

Type normally and wait for suggestions
Press Tab to accept full suggestions
Use → (right arrow) to accept word by word
Clear completion cache if suggestions seem stale

Chat Interface

Click the Ollama icon in the sidebar
Use @ to reference files
Select code before asking questions
Toggle workspace context for broader awareness

Commands

Access via Command Palette (Ctrl+Shift+P or Cmd+Shift+P):

Ollama Copilot: Select Default Model - Change your model
Ollama Copilot: Clear Completion Cache - Reset suggestions
Ollama Copilot: Open Chat Panel - Open chat interface
Ollama Copilot: Search Available Models - View installed models

Configuration

Settings available in VS Code:

ollama.defaultModel: Your preferred model
ollama.apiHost: Ollama API endpoint (default: http://localhost:11434)

Troubleshooting

No Suggestions

Verify Ollama is running (ollama serve)
Check model is selected (Command Palette > Select Default Model)
Clear completion cache
Ensure cursor is at a valid completion point

Performance Issues

Try a smaller model
Clear completion cache
Check system resources
Reduce context size if needed

Connection Issues

Confirm Ollama is running
Check ollama.apiHost setting
Verify port 11434 is accessible
Restart VS Code if needed

Contributing

We welcome contributions! Please check our GitHub repository for:

Bug reports
Feature requests
Pull requests
Documentation improvements

License

MIT License

Ollama Dev Companion

Gnana997

Ollama Copilot

What's New in v0.2.0 🎉

Major Architecture Overhaul

Chat Experience Improvements

🚀 Coming Soon

Features

AI-Powered Code Completions

Interactive Chat Interface

Privacy-Focused

Customizable Configuration

Prerequisites

Quick Start

Model Selection

Recommended Models

Code Completion

Chat Interface

Installing Models

Usage Tips

Code Completion

Chat Interface

Commands

Configuration

Troubleshooting

No Suggestions

Performance Issues

Connection Issues

Contributing

License