KISS Framework

When Simplicity Becomes Your Superpower: Meet KISS Multi-Agent Multi-Optimization Framework

"Everything should be made as simple as possible, but not simpler." — Albert Einstein

KISS stands for "Keep it Simple, Stupid" which is a well-known software engineering principle.

Table of Contents

Install and Launch KISS Sorcar
Introduction to KISS Sorcar
The Problem with AI Agent Frameworks Today
Your First Agent in 30 Seconds
Multi-Agent Orchestration
Key Features and Principles
KISSAgent API Reference
GEPA Prompt Optimization
KISSEvolve Algorithm Discovery
Models Supported
Contributing
License
Authors

Introduction to KISS Sorcar

KISS Sorcar (named after the famous Bengali magician P.C. Sorcar) is a general-purpose agent with web browsing and native desktop app execution capabilities*. It runs locally as a VS Code extension. It codes really well and works pretty fast. The agent can run relentlessly for hours to days. It is implemented as a Visual Studio Code extension. It has full browser support, multimodal support, and OpenClaw like features (we will post more information about them on our social media channels). The good part is that KISS Sorcar is completely free and open-source; you bring your API key and give it to KISS Sorcar. Note that Sorcar also means government in Bengali.

git clone https://github.com/ksenxx/kiss_ai.git
cd kiss_ai
./install.sh

https://www.youtube.com/watch?v=xnYxWvRqACE.

Install and Launch KISS Sorcar

To Install KISS Sorcar, open Visual Studio Code, search for "KISS Sorcar" in the extension marketplace, install, and relaunch VS Code. Press ESC if you don't have a specific API key, but you must provide at least one API key.

You can also manually download the extension from src/kiss/agents/vscode/kiss-sorcar.vsix.

If you do not want to use the KISS Sorcar IDE, you can open a terminal and use sorcar as a normal shell command. Some examples are:

sorcar -t "What is 2435*234"

sorcar -n -t --use-chat "What is 2435*234?" # to start in a new chat session in sorcar use -n

sorcar -m "claude-sonnet-4-6" -t "What is 2435*234?" # to use a specific model

echo "Can you find the cheapest non-stop flight from SFO to JFK on June 15 by consulting various websites?" > prompt
sorcar -f prompt # use contents of a file to send task

sorcar -t 'Can you send the message "Hello from Sorcar!" to ksen via the desktop slack app?'

sorcar -t 'Can you write a thorough and precise plan in PLAN.md to simplify the project code?'
sorcar -t 'I see some issues and bugs in PLAN.md.  Can you fix them?'  # lie to the agent to force improve the plan

🤖 Models Supported

Supported Models: The framework includes context length, pricing, and capability flags for:

Generation Models (text generation with function calling support):

OpenAI: gpt-4.1, gpt-4.1-mini, gpt-4.1-nano, gpt-4o, gpt-4o-mini, gpt-4.5-preview, gpt-4-turbo, gpt-4, gpt-5, gpt-5-mini, gpt-5-nano, gpt-5-pro, gpt-5.1, gpt-5.2, gpt-5.2-pro, gpt-5.3-chat-latest, gpt-5.4, gpt-5.4-pro
OpenAI (Codex): gpt-5-codex, gpt-5.1-codex, gpt-5.1-codex-max, gpt-5.1-codex-mini, gpt-5.2-codex, gpt-5.3-codex, codex-mini-latest
OpenAI (Reasoning): o1, o1-mini, o1-pro, o3, o3-mini, o3-mini-high, o3-pro, o3-deep-research, o4-mini, o4-mini-high, o4-mini-deep-research
OpenAI (Open Source): openai/gpt-oss-20b, openai/gpt-oss-120b
Anthropic: claude-opus-4-6, claude-opus-4-5, claude-opus-4-1, claude-opus-4, claude-sonnet-4-6, claude-sonnet-4-5, claude-sonnet-4, claude-haiku-4-5
Anthropic (Legacy): claude-3-5-haiku, claude-3-haiku-20240307
Gemini: gemini-2.5-pro, gemini-2.5-flash, gemini-2.0-flash, gemini-2.0-flash-lite, gemini-1.5-pro (deprecated), gemini-1.5-flash (deprecated)
Gemini (preview, unreliable function calling): gemini-3-pro-preview, gemini-3-flash-preview, gemini-3.1-pro-preview, gemini-3.1-flash-lite-preview, gemini-2.5-flash-lite
Together AI (Llama): Llama-4-Scout/Maverick (with function calling), Llama-3.x series (generation only)
Together AI (Qwen): Qwen2.5-72B/7B-Instruct-Turbo, Qwen2.5-Coder-32B, Qwen2.5-VL-72B, Qwen3-235B series, Qwen3-Coder-480B, Qwen3-Coder-Next, Qwen3-Next-80B, Qwen3-VL-32B/8B, QwQ-32B (with function calling)
Together AI (DeepSeek): DeepSeek-R1, DeepSeek-V3-0324, DeepSeek-V3.1 (with function calling)
Together AI (Kimi/Moonshot): Kimi-K2-Instruct, Kimi-K2-Instruct-0905, Kimi-K2-Thinking, Kimi-K2.5
Together AI (Mistral): Ministral-3-14B, Mistral-7B-v0.2/v0.3, Mistral-Small-24B
Together AI (Z.AI): GLM-5.0, GLM-4.5-Air, GLM-4.7
Together AI (Other): Nemotron-Nano-9B, Arcee (Coder-Large, Maestro-Reasoning, Virtuoso-Large, trinity-mini), DeepCogito (cogito-v2 series), google/gemma-2b/3n, Refuel-LLM-2/2-Small, essentialai/rnj-1, marin-community/marin-8b
OpenRouter: Access to 300+ models from 60+ providers via unified API:
- OpenAI (gpt-3.5-turbo, gpt-4, gpt-4-turbo, gpt-4.1, gpt-4o variants, gpt-5/5.1/5.2/5.3/5.4 and codex variants, o1, o3, o3-pro, o4-mini, codex-mini, gpt-oss, gpt-audio)
- Anthropic (claude-3-haiku, claude-3.5-haiku/sonnet, claude-3.7-sonnet, claude-sonnet-4/4.5, claude-haiku-4.5, claude-opus-4/4.1/4.5/4.6 with 1M context)
- Google (gemini-2.0-flash, gemini-2.5-flash/pro, gemini-3-flash/pro-preview, gemma-2-9b/27b, gemma-3-4b/12b/27b, gemma-3n-e4b)
- Meta Llama (llama-3-8b/70b, llama-3.1-8b/70b/405b, llama-3.2-1b/3b/11b-vision, llama-3.3-70b, llama-4-maverick/scout, llama-guard-2/3/4)
- DeepSeek (deepseek-chat/v3/v3.1/v3.2/v3.2-speciale, deepseek-r1/r1-0528/r1-turbo, deepseek-r1-distill variants, deepseek-coder-v2, deepseek-prover-v2)
- Qwen (qwen-2.5-7b/72b, qwen-turbo/plus/max, qwen3-8b/14b/30b/32b/235b, qwen3-coder/coder-plus/coder-next/coder-flash/coder-30b, qwen3-vl variants, qwq-32b, qwen3-next-80b, qwen3-max/max-thinking)
- Amazon Nova (nova-micro/lite/pro, nova-2-lite, nova-premier)
- Cohere (command-r, command-r-plus, command-a, command-r7b)
- X.AI Grok (grok-3/3-mini/3-beta/3-mini-beta, grok-4/4-fast, grok-4.1-fast, grok-code-fast-1)
- MiniMax (minimax-01, minimax-m1, minimax-m2/m2.1/m2.5/m2-her)
- ByteDance Seed (seed-1.6, seed-1.6-flash, seed-2.0, seed-2.0-thinking)
- MoonshotAI (kimi-k2, kimi-k2-thinking, kimi-k2.5, kimi-dev-72b)
- Mistral (codestral, devstral/devstral-medium/devstral-small, mistral-large/medium/small, mixtral-8x7b/8x22b, ministral-3b/8b/14b, pixtral, voxtral)
- NVIDIA (llama-3.1-nemotron-70b/ultra-253b, llama-3.3-nemotron-super-49b, nemotron-nano-9b-v2/12b-v2-vl, nemotron-3-nano-30b)
- Z.AI/GLM (glm-5, glm-4-32b, glm-4.5/4.5-air/4.5v, glm-4.6/4.6v, glm-4.7/4.7-flash)
- AllenAI (olmo-2/3-7b/32b-instruct/think, olmo-3.1-32b-instruct/think, molmo-2-8b)
- Perplexity (sonar, sonar-pro, sonar-pro-search, sonar-deep-research, sonar-reasoning-pro)
- NousResearch (hermes-2-pro/3/4-llama series, hermes-4-70b/405b, deephermes-3)
- Baidu ERNIE (ernie-4.5 series including VL and thinking variants)
- Aurora (openrouter/aurora-alpha — free cloaked reasoning model)
- And 30+ more providers (ai21, aion-labs, alfredpros, alpindale, anthracite-org, arcee-ai, bytedance, deepcogito, essentialai, ibm-granite, inception, inflection, kwaipilot, liquid, meituan, morph, nex-agi, opengvlab, prime-intellect, relace, sao10k, stepfun-ai, tencent, thedrummer, tngtech, upstage, writer, xiaomi, etc.)

Embedding Models (for RAG and semantic search):

OpenAI: text-embedding-3-small, text-embedding-3-large, text-embedding-ada-002
Google: text-embedding-004, gemini-embedding-001
Together AI: BAAI/bge-large-en-v1.5, BAAI/bge-base-en-v1.5, m2-bert-80M-32k-retrieval, multilingual-e5-large-instruct, gte-modernbert-base

Each model in MODEL_INFO includes capability flags:

is_function_calling_supported: Whether the model reliably supports tool/function calling
is_generation_supported: Whether the model supports text generation
is_embedding_supported: Whether the model is an embedding model

🤗 Contributing

Contributions in the form of issues are welcome! KISS Sorcar should be able to take care of them.

📄 License

Apache-2.0

✍️ Authors

Koushik Sen (ksen@berkeley.edu) | LinkedIn | X @koushik77
Yogya Mehrotra (ymehrotr@ucsc.edu) | LinkedIn
Marius Momeu (marius.momeu@berkeley.edu) | LinkedIn | X @MariusMomeu

KISS Sorcar Buggy

Koushik Sen