Kaldi Reader - SCP & ARK Viewer
English | 简体中文 | 日本語
Kaldi Reader is a Visual Studio Code extension for reading Kaldi .scp files and inspecting .ark entries directly in VS Code. It focuses on Kaldi archive entries and keeps ordinary audio path linking in AudioLens.
Install
Install from the Visual Studio Marketplace:
https://marketplace.visualstudio.com/items?itemName=simzhou.kaldi-reader
Or from Open VSX:
https://open-vsx.org/extension/simzhou/kaldi-reader
Highlights
- Detects
*.ark:<offset> references in Kaldi .scp files and turns them into clickable links.
- Validates wav ark entries at the byte offset and opens them with AudioLens.
- Reads Kaldi binary
FloatMatrix(FM) entries as raw matrix text.
- Reads Kaldi binary
Int32Vector entries as raw integer vector text.
- Resolves relative ark paths from the
.scp file directory first, then from the current workspace root.
- Works as a workspace extension for local and Remote SSH workspaces.
- Follows the VS Code display language by default.
Scope
Kaldi Reader handles Kaldi ark offset links:
wav.ark:12345
feats.ark:12345
ali.ark:12345
AudioLens handles ordinary audio paths in text files, such as .wav, .flac, .mp3, .pcm, and .raw. AudioLens can still open .ark files directly, but Kaldi Reader owns text-link detection for *.ark:<offset>.
Relative Ark Paths
Relative ark paths are resolved with two stable bases:
- The directory of the current
.scp file.
- The current workspace root.
Other implicit CWD-based paths are not guessed automatically.
Install From VSIX
Install a local packaged build with:
code --install-extension dist/kaldi-reader-0.1.1.vsix
Development
npm install
npm run build
npm run typecheck
npm run package
Press F5 in VS Code and choose the extension development host. Open a Kaldi .scp file and click an ark entry link.
Author
SimZhou: https://simzhou.com/en/about/
Copyright
Copyright (c) 2026 SimZhou. All rights reserved.