Cursor for Speech - VS Code Extension

🎤 Advanced voice-to-text capabilities directly into the Cursor/VS Code editor with Web Speech API and OpenAI Whisper support. Requires OpenAI API key for premium Whisper transcription.

Created by Vrishn Viswa Sathyamoorthy

📧 Contact: vrishnviswasathyamoorthy@gmail.com | 📱 Phone: +1 (914) 879-5242 | 🐙 GitHub: @KingReaper96420

✨ Features

🎙️ DUAL TRANSCRIPTION MODES: Choose between Web Speech API (free) and OpenAI Whisper (premium accuracy).
🔑 OPENAI WHISPER SUPPORT: Premium transcription with better accuracy, punctuation, and context understanding.
🆓 FREE WEB SPEECH API: Built-in browser speech recognition as a free alternative.
🖱️ SEAMLESS INTEGRATION: Native VS Code extension with status bar integration and webview panel.
⌨️ HOTKEY SUPPORT: Use Ctrl+M (Cmd+M) to open voice panel, Ctrl+Shift+V (Cmd+Shift+V) to toggle recording.
🌍 MULTI-LANGUAGE: Support for 100+ languages with both transcription methods.
⚙️ CONFIGURABLE: Customize language, auto-insertion, transcription method, and other settings.
💰 COST-EFFECTIVE: Whisper API costs ~$0.006 per minute of audio.

🔑 OpenAI API Key Setup (Required for Whisper)

For premium Whisper transcription, you need an OpenAI API key:

Get API Key:
- Visit OpenAI Platform
- Sign up or log in to your account
- Create a new API key
- Copy the key (starts with sk-)
Configure Extension:
- Open VS Code/Cursor settings
- Search for "cursorforspeech"
- Set cursorforspeech.whisperApiKey to your API key
- Enable cursorforspeech.useWhisper for premium transcription
Cost Information:
- Whisper API: ~$0.006 per minute of audio
- Web Speech API: Free (built into browsers)
- You can switch between methods anytime

🚀 Installation Guide

Option 1: From VS Code Marketplace (Recommended)

Open VS Code/Cursor
Go to Extensions (Cmd+Shift+X or Ctrl+Shift+X)
Search for "Cursor for Speech"
Click "Install"

Option 2: Install from VSIX

Download the VSIX file from the releases page
Install via VS Code:
- Open Command Palette (Cmd+Shift+P or Ctrl+Shift+P)
- Run: Extensions: Install from VSIX...
- Select the cursorforspeech-3.0.0.vsix file

Option 3: Build from Source

Clone the Repository

git clone https://github.com/KingReaper96420/cursorforspeech.git
cd cursorforspeech

Install Dependencies
```
npm install
```
Compile and Package
```
npm run compile
npm run package
```

Install the generated VSIX

code --install-extension cursorforspeech-3.0.0.vsix

🛠️ How to Use

Quick Start

🎤 Click the mic in the editor toolbar (top-right) or the "🎤 Speech" button in the status bar (bottom right)
⏳ The first time, the AI model needs to download (please wait a moment)
🔴 Click the microphone button to start recording
🗣️ Speak clearly
⏹️ Click again to stop recording
⌨️ Your transcribed text will appear in the active editor!

Available Commands

Cursor for Speech: Toggle Voice Recording - Start/stop voice recording
Cursor for Speech: Open Voice Panel - Open the voice control panel
Cursor for Speech: Select Voice Recognition Language - Choose transcription language
Cursor for Speech: Send Selection to Cursor Chat - Copies selection and opens chat to paste

Hotkeys

Cmd+Shift+V (Mac) or Ctrl+Shift+V (Windows/Linux) - Toggle recording

A microphone icon appears in the editor toolbar (top-right) for any text-based editor
Click it to start/stop recording; transcription inserts at the cursor position

Access the full-featured voice panel via:

Editor toolbar mic icon (top-right of any text editor)
Command Palette: Cursor for Speech: Open Voice Panel
Status bar button (right-click for menu)

The panel provides:

Big mic button with recording states
Transcript area with Insert, Copy, Send to Cursor Chat actions
Language selection (60+ supported languages)
Auto-insertion toggle
Recording status and help

⚙️ Configuration

Open VS Code Settings (Cmd+, or Ctrl+,) and search for "Cursor for Speech" to configure:

Language: Set the default transcription language (default: auto-detect)
Max Tokens: Maximum length of transcription output (default: 128)
Auto Insert: Automatically insert transcribed text into the active editor (default: true)

🖥️ Compatibility

✅ VS Code/Cursor: 1.74.0+
✅ WebGPU Required: For optimal performance (may work with fallback on some systems)
✅ Operating Systems:
- macOS (Apple Silicon & Intel)
- Windows 10/11
- Linux (Ubuntu, Fedora, etc.)

🌍 Supported Languages

Auto-detect and 60+ languages including:

English, Chinese, German, Spanish, French
Japanese, Korean, Russian, Portuguese, Italian
Arabic, Hindi, Dutch, Swedish, Norwegian
And many more...

🔧 Troubleshooting

Common Issues

"WebGPU not supported"

Ensure you're using a modern browser/editor version
Check if hardware acceleration is enabled
Try updating your graphics drivers

"Microphone access denied"

Grant microphone permissions in your system settings
Restart VS Code/Cursor after granting permissions

"Model loading takes too long"

First-time model download can take 1-5 minutes depending on connection
Subsequent uses will be much faster (model is cached)

"No transcription output"

Ensure you're speaking clearly and loudly enough
Check that your microphone is working in other applications
Try adjusting the language setting if auto-detect isn't working

Development/Debug Mode

Open the extension development host:

code --extensionDevelopmentPath=./path/to/yap-for-cursor

Open Developer Tools (Cmd+Option+I or F12) to see console logs

🤝 Contributing

We welcome contributions! Please see CONTRIBUTING.md for guidelines.

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

📝 License

This project is licensed under the MIT License for Personal Use Only - see the LICENSE file for details.

GitHub: @KingReaper96420
Email: vrishnviswasathyamoorthy@gmail.com
Phone: +1 (914) 879-5242

❤️ Support The Developer

If you find cursorforspeech helpful, consider supporting Vrishn Viswa Sathyamoorthy!

📧 Email: vrishnviswasathyamoorthy@gmail.com
📱 Phone: +1 (914) 879-5242
🐙 GitHub: @KingReaper96420

⭐ Star the repository if you find it useful!

🔄 Migration from Yap for Cursor

If you were using the previous "Yap for Cursor" extension:

Uninstall the old setup:
- Remove the custom CSS imports from your VS Code settings
- Disable/uninstall the "Custom CSS and JS Loader" extension
- Run Disable Custom CSS and JS command if you had it enabled
Install this extension following the installation guide above
Enjoy the improved experience with native VS Code integration!

Cursor for Speech - Voice to Text

Vrishn Viswa Sathyamoorthy