Local LLM Chat for Visual Studio

A powerful Visual Studio extension for chatting with local Large Language Models directly in your IDE—completely private and secure

Features • Installation • Getting Started • Commands • Configuration

Features

🤖 Local AI Chat

Privacy-First: All data stays on your machine—no cloud APIs required
OpenAI-Compatible: Works with Ollama, LM Studio, OpenAI, and any OpenAI-compatible endpoint
Persistent Conversations: Chat history is maintained throughout your Visual Studio session
Integrated Tool Window: Chat interface seamlessly integrated into Visual Studio's UI

📁 Solution Integration

Read Files: Load any solution file into the conversation context
List Directories: Browse your project structure directly from chat
Search Files: Find files using wildcard patterns (e.g., *.cs, *.json)
Solution Info: Get metadata about your solution (Git status, Node.js projects, etc.)
Smart Context: Send active file to the AI with a single command

✨ File Creation & Management

AI-Powered File Generation: Let the AI create complete files based on your requirements
Special Syntax: AI responds with ```file path="relative/path.ext" blocks for file suggestions
Safe Operations: Confirmation prompts before creating/overwriting files
Path Validation: Security checks prevent file access outside your solution directory

⚡ Chat Commands

Built-in slash commands for quick actions:

/read <file-path> - Read a file and add to conversation
/list [directory] - List files in a directory
/search <pattern> - Search for files with wildcard patterns
/workspace - Show solution information
/help - Display all available commands

Installation

Requirements

Visual Studio 2022 (version 17.0 or higher) - Community, Professional, or Enterprise
.NET Framework 4.7.2 or higher
Local LLM Server: Ollama, LM Studio, or compatible OpenAI API endpoint

From VSIX File

Download the .vsix file from the releases page
Close Visual Studio if running
Double-click the .vsix file to install
Or use the command line: VSIXInstaller.exe /quiet LocalLLMChatVS.vsix
Restart Visual Studio

From Source

git clone https://github.com/markusbegerow/local-llm-chat-vs.git
cd local-llm-chat-vs

Open LocalLLMChatVS.sln in Visual Studio 2022, then press F5 to build and launch in the experimental instance.

Getting Started

1. Set Up Your Local LLM

Option A: Ollama (Recommended)

# Install Ollama from https://ollama.ai
ollama pull llama3.2
ollama serve

Option B: LM Studio

Download from lmstudio.ai
Load a model
Start the local server (default: http://localhost:1234/v1/chat/completions)

Option C: OpenAI API

Use directly with your OpenAI API key
Endpoint: https://api.openai.com/v1/chat/completions

2. Configure the Extension

Open Visual Studio 2022
Go to Tools → Options → Local LLM Chat → General
Configure the following settings:
- API URL: Full endpoint URL (e.g., http://localhost:11434/v1/chat/completions)
- API Token: Your API token or "ollama" for local Ollama
- Model Name: The model to use (e.g., llama3.2, gpt-4)

Example Ollama Configuration:

API URL: http://localhost:11434/v1/chat/completions
API Token: ollama
Model Name: llama3.2

Example OpenAI Configuration:

API URL: https://api.openai.com/v1/chat/completions
API Token: sk-your-api-key-here
Model Name: gpt-4

3. Start Chatting

Open a solution in Visual Studio
Go to View → Other Windows → Local LLM Chat
Or use the command: Tools → Local LLM Chat → Open Chat
Start asking questions!

Usage Examples

Analyze Code

/read Controllers/HomeController.cs
Can you explain how this controller works and suggest improvements?

Explore Project Structure

/workspace
/list Controllers
What is the architecture of this project?

Generate Files

Create a C# service class for user authentication with JWT tokens, including proper error handling and XML documentation.

The AI will suggest a file with path and content. Click "Yes" in the confirmation dialog to create it!

Search and Refactor

/search *.cs
Find all C# files, then help me refactor the error handling patterns across the project.

Quick File Analysis

Open any file in the editor
Go to Tools → Local LLM Chat → Send Active File to Chat
Ask questions about the code

Send Specific File

Go to Tools → Local LLM Chat → Send File to Chat
Select the file from the dialog
The file content will be added to the conversation context

Commands

Access these commands from Tools → Local LLM Chat menu:

Command	Description	Shortcut
Open Chat	Open the chat tool window	-
Clear Conversation	Reset the chat history	-
Send Active File to Chat	Send current file to chat	-
Send File to Chat	Browse and send any file to chat	-

You can also access the chat window from View → Other Windows → Local LLM Chat

In-Chat Slash Commands

Command	Description	Example
`/read <path>`	Read file contents	`/read Program.cs`
`/list [dir]`	List directory files	`/list Controllers`
`/search <pattern>`	Find files by wildcard	`/search *.json`
`/workspace`	Show solution info	`/workspace`
`/help`	Show all commands	`/help`

Note: Press Ctrl+Enter in the chat input to send your message.

Configuration

Access settings through Tools → Options → Local LLM Chat → General

API Configuration

Setting	Default	Description
API URL	`http://localhost:11434/v1/chat/completions`	Full API endpoint URL
API Token	`ollama`	API authentication token (or dummy value for local)
Model Name	`llama3.2`	Model name to use

Model Parameters

Setting	Default	Description
Temperature	`0.7`	Sampling temperature (0.0 = deterministic, 2.0 = very random)
Max Tokens	`2048`	Maximum tokens for model responses
System Prompt	(default)	System prompt sent to the LLM to define behavior

Conversation

Setting	Default	Description
Max History Messages	`50`	Maximum messages to keep in conversation history

Network

Setting	Default	Description
Request Timeout (ms)	`120000`	Request timeout in milliseconds (2 minutes)

Security

Setting	Default	Description
Max File Size (bytes)	`1048576`	Maximum file size for LLM-generated files (1MB)
Allow Write Without Prompt	`false`	NOT recommended - allows file creation without confirmation

Configuration Examples

Ollama (Local)

API URL: http://localhost:11434/v1/chat/completions
API Token: ollama
Model Name: llama3.2

LM Studio (Local)

API URL: http://localhost:1234/v1/chat/completions
API Token: lmstudio
Model Name: your-model-name

OpenAI (Cloud)

API URL: https://api.openai.com/v1/chat/completions
API Token: sk-your-actual-api-key-here
Model Name: gpt-4

Security Features

✅ Path Traversal Protection: Prevents access outside solution directory
✅ File Size Limits: Configurable maximum file sizes (default: 1MB)
✅ Path Validation: All file paths validated before operations
✅ URL Validation: API endpoints validated for security
✅ Confirmation Prompts: Review before creating/overwriting files
✅ Local-Only Default: No external API calls unless you configure them
✅ Binary/Build Exclusion: Automatically excludes bin/, obj/, node_modules/, .git/

Requirements

Visual Studio 2022: Version 17.0 or higher (Community, Professional, or Enterprise)
.NET Framework: Version 4.7.2 or higher
Local LLM: Ollama, LM Studio, or compatible OpenAI API server

Development

Building from Source

# Clone repository
git clone https://github.com/markusbegerow/local-llm-chat-vs.git
cd local-llm-chat-vs

Open LocalLLMChatVS.sln in Visual Studio 2022
Restore NuGet packages (right-click solution → Restore NuGet Packages)
Build the solution (Ctrl+Shift+B)
Press F5 to launch in experimental instance

Packaging as VSIX

Build the project in Release mode
The VSIX file will be generated in bin\Release\LocalLLMChatVS.vsix
Install it by double-clicking the VSIX file

Project Structure

LocalLLMChatVS/
├── Commands/
│   ├── OpenChatCommand.cs           # Open chat window command
│   ├── ClearConversationCommand.cs  # Clear chat command
│   ├── SendFileToChatCommand.cs     # Send file command
│   └── SendActiveFileToChatCommand.cs # Send active file command
├── Services/
│   ├── LLMService.cs                # LLM API integration
│   └── WorkspaceService.cs          # Solution/file operations
├── ToolWindows/
│   ├── ChatWindow.cs                # Chat tool window
│   └── ChatWindowControl.xaml.cs    # Chat UI control (WPF)
├── Models/
│   └── ChatMessage.cs               # Data models
├── Utilities/
│   ├── PathValidator.cs             # Path security validation
│   └── SecurityValidator.cs         # Security utilities
├── Options/
│   └── GeneralOptions.cs            # Settings page
├── LocalLLMChatPackage.cs           # Main package class
├── source.extension.vsixmanifest    # Extension manifest
└── VSCommandTable.vsct              # Command definitions

Troubleshooting

Connection Issues

Verify LLM server is running:
- Ollama: curl http://localhost:11434/v1/models or ollama list
- LM Studio: Check the server tab shows "Server Running"
Check API URL in settings:
- Go to Tools → Options → Local LLM Chat → General
- Verify the API URL matches your server
- Ensure the endpoint includes /v1/chat/completions
Test with curl:
```
curl http://localhost:11434/v1/models
```

Chat Not Responding

Check error messages in the chat window or Visual Studio Output window
Increase timeout: Go to Tools → Options → Local LLM Chat → General and increase "Request Timeout"
Verify model is loaded in your LLM server
Check API token is correct (use "ollama" or any dummy value for local servers)

File Operations Failing

Ensure a solution is open in Visual Studio
Check file paths are relative to the solution directory (no absolute paths)
Verify file size limits in Tools → Options → Local LLM Chat → Security
Check permissions on the solution directory

Extension Not Loading

Check Visual Studio version: Must be 2022 (17.0+)
Verify .NET Framework 4.7.2 or higher is installed
Check Extensions → Manage Extensions to see if extension is enabled
Try resetting Visual Studio settings: Tools → Import and Export Settings

Recommended Models

Coding Tasks (C# / .NET)

Qwen 2.5 Coder (7B/14B): Excellent for C# and .NET development
CodeLlama (13B/34B): Specialized for code generation and understanding
DeepSeek Coder (6.7B/33B): Strong at complex algorithms and refactoring
Llama 3.2 (3B/8B): Good balance of speed and capability

General Tasks

Llama 3.2: Best all-around performance for most tasks
Mistral 7B: Fast and efficient for quick questions
Phi-3: Compact but capable for lighter workloads

Cloud/Enterprise

GPT-4 / GPT-4 Turbo: Best quality, requires OpenAI API key
GPT-3.5 Turbo: Faster and cheaper alternative

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Fork the repository
Create your feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

Development Guidelines

Follow C# coding conventions
Test in Visual Studio 2022 experimental instance
Update documentation for new features
Ensure security validators are used for file operations

License

This project is licensed under the GPL License - see the LICENSE.txt file for details.

Acknowledgments

Thanks to the Ollama team for making local LLMs accessible
LM Studio for providing an excellent local inference platform
The Visual Studio extensibility team for comprehensive SDK and documentation
OpenAI for the standardized API format

🙋‍♂️ Get Involved

If you encounter any issues or have questions:

🐛 Report bugs
💡 Request features
⭐ Star the repo if you find it useful!

☕ Support the Project

If you like this project, support further development with a repost or coffee:

📬 Contact

🧑‍💻 Markus Begerow
💾 GitHub
✉️ Twitter

Privacy Notice: This extension operates entirely locally by default. No data is sent to external servers unless you explicitly configure it to use a remote API endpoint (like OpenAI).

Local LLM Chat for Visual Studio

Markus Begerow

Local LLM Chat for Visual Studio

Features

🤖 Local AI Chat

📁 Solution Integration

✨ File Creation & Management

⚡ Chat Commands

Installation

Requirements

From VSIX File

From Source

Getting Started

1. Set Up Your Local LLM

2. Configure the Extension

3. Start Chatting

Usage Examples

Analyze Code

Explore Project Structure

Generate Files

Search and Refactor

Quick File Analysis

Send Specific File

Commands

Tools Menu Commands

In-Chat Slash Commands

Configuration

API Configuration

Model Parameters

Conversation

Network

Security

Configuration Examples

Security Features

Requirements

Development

Building from Source

Packaging as VSIX

Project Structure

Troubleshooting

Connection Issues

Chat Not Responding

File Operations Failing

Extension Not Loading

Recommended Models

Coding Tasks (C# / .NET)

General Tasks

Cloud/Enterprise

Contributing

Development Guidelines

License

Acknowledgments

🙋‍♂️ Get Involved

☕ Support the Project

📬 Contact