I am thankful to everyone who contributed to this project and hope it served as a useful tool.
Llama.cpp Provider for GitHub Copilot
This extension integrates Llama.cpp models into GitHub Copilot in VS Code. It allows users to connect to a local Llama.cpp server and use its models for chat interactions, code generation, and more.
Features
Integrates Llama.cpp server into VS Code's language model chat.
Supports streaming responses.
Handles tool calling for function invocations.
Manages multiple models from the Llama.cpp server.
Requirements
VS Code version 1.104.0 or higher.
A running Llama.cpp server with OpenAI-compatible API.
The extension uses a base provider class for OpenAI-compatible chat APIs. The Llama.cpp provider extends this base to connect to a local Llama.cpp server. It handles model fetching, message conversion, and streaming responses. Tool calling is supported through OpenAI-compatible formats.