Trimli AI — Token Optimizer
No config changes to your AI tools. No prompts modified visibly. Just lower bills. The problem this solvesAI coding tools are expensive at scale. A typical developer sending 100 requests a day to GPT-4o spends $80–150/month on input tokens (agentic workflows send ~20K tokens per request including system prompts, file context, and conversation history) — most of it wasted on repeated context, verbose history, and filler the model doesn't need. Trimli AI sits between your tool and the API. It strips the waste, keeps the signal, and forwards a leaner prompt. The model never knows. Your bill does.
Across 100 requests a day that's $87/month back in your pocket at the conservative estimate. On longer agentic sessions the number is higher. Setup in 60 seconds1. Install the extension. A local proxy starts automatically on 2. Point your AI tool at the proxy:
3. Code normally. The optimizer runs silently on every request. Watch the status bar update in real time: How much will you actually save?Savings depend on how you work. Here's what to expect across common workflows:
The more context your session accumulates, the more the optimizer saves. Short queries get modest savings. Long agentic sessions routinely hit 55–65%. Real savings by modelBased on 100 requests/day at ~20,000 tokens/request (typical for agentic workflows — system prompts, file context, and conversation history):
Pro ($10/mo) pays for itself in under a day on any model. For a 5-person team: $175–$396/month saved. DashboardClick the
Web dashboard: sign in at app.trimliai.com to see full analytics and 30-day charts. Commands
Settings
Tiers
No account required on the free tier. A licence key is created automatically when you install. Upgrade at app.trimliai.com. FAQDoes it store my prompts? No. The proxy optimizes in-flight and immediately discards the messages. Nothing is logged, cached, or sent anywhere except directly to the upstream AI API. Will it change the quality of AI responses? No. Tested across 59 accuracy benchmarks — zero quality degradation detected. Does it work with streaming? Yes. Streaming responses pass through unchanged. Only the input prompt is compressed. Does it work offline? Yes. The proxy runs entirely on your machine. Does it work with Claude Code?
Yes — enable the forward proxy (Command Palette → What if I use multiple AI tools?
Point all of them at LicenseBusiness Source License 1.1 — free to use for individuals and teams. You may not offer a competing token optimization SaaS. Converts to Apache 2.0 on 2030-04-11. |