Skip to content
| Marketplace
Sign in
Visual Studio Code>Machine Learning>iri AI OptimizerNew to Visual Studio Code? Get it now.
iri AI Optimizer

iri AI Optimizer

iri-ai

|
1 install
| (0) | Free
Optimize AI API costs with intelligent caching, compression, and routing
Installation
Launch VS Code Quick Open (Ctrl+P), paste the following command, and press enter.
Copied to clipboard
More Info

iri AI Optimizer - VS Code Extension

Optimize your AI API costs directly in VS Code and Cursor with intelligent caching, compression, and smart routing.

Features

  • Smart Caching: Automatically cache AI responses locally. Same question = instant answer, no API cost
  • Prompt Compression: Reduce token usage while preserving meaning
  • Security Scanning: Detect API keys, passwords, and sensitive data before sending
  • Cost Tracking: See real-time savings in the status bar
  • Dashboard Sync: Usage data syncs to your iri AI dashboard

Installation

Option 1: Download from iri AI Dashboard

  1. Log in to your iri AI dashboard
  2. Go to Settings > Extensions
  3. Click Download VS Code Extension
  4. Open the downloaded .vsix file in VS Code

Option 2: Install from Command Line

# Download the extension
curl -o iri-optimizer.vsix https://your-domain.com/downloads/iri-optimizer-latest.vsix

# Install in VS Code
code --install-extension iri-optimizer.vsix

# Or for Cursor
cursor --install-extension iri-optimizer.vsix

Option 3: Install from VS Code

  1. Open VS Code
  2. Press Ctrl+Shift+P (or Cmd+Shift+P on Mac)
  3. Type "Install from VSIX"
  4. Select the downloaded .vsix file

Getting Started

  1. Login: Press Ctrl+Shift+P and run "iri: Login"
  2. A browser window opens - sign in with your iri AI account
  3. Once authenticated, the extension starts automatically
  4. Look for the savings indicator in the status bar: iri: $0.00 saved

Commands

Command Description
iri: Login Connect to your iri AI account
iri: Logout Disconnect from iri AI
iri: Show Savings View detailed cost savings
iri: Clear Cache Clear the local response cache
iri: Open Settings Configure optimization settings
iri: Toggle Proxy Enable/disable the optimization proxy

Settings

Setting Default Description
iri.enabled true Enable the optimization proxy
iri.proxyPort 9876 Local proxy server port
iri.caching.enabled true Enable response caching
iri.caching.ttlHours 24 Cache duration in hours
iri.compression.enabled true Enable prompt compression
iri.compression.level balanced Compression level: light, balanced, aggressive
iri.routing.enabled false Enable automatic model routing
iri.security.scanSecrets true Scan for API keys before sending
iri.security.scanPii false Scan for PII before sending
iri.sync.enabled true Sync usage to web dashboard

How It Works

  1. The extension runs a local proxy server on port 9876
  2. AI API calls are routed through this proxy
  3. The proxy applies optimizations (caching, compression, etc.)
  4. Savings are tracked and displayed in real-time

Privacy & Security

  • Your code stays local: All processing happens on your machine
  • API keys are encrypted: Stored securely using VS Code's secret storage
  • No source code exposure: Only compiled code is distributed
  • Usage data is optional: Disable sync in settings if preferred

Troubleshooting

Extension not working?

  1. Check if proxy is running: Run "iri: Toggle Proxy" twice
  2. Verify login status: Run "iri: Login"
  3. Check the Output panel: View > Output > Select "iri AI"

Port conflict?

Change the proxy port in settings:

{
  "iri.proxyPort": 9877
}

Support

  • Documentation: https://your-domain.com/docs/extension
  • Issues: https://your-domain.com/support
  • Dashboard: https://your-domain.com/app

iri AI - Save up to 40% on AI API costs

  • Contact us
  • Jobs
  • Privacy
  • Manage cookies
  • Terms of use
  • Trademarks
© 2026 Microsoft