Skip to content

Coldaine/ColdVox

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

551 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ColdVox

Internal alpha. This repository is under active cleanup and the Windows path is still being hardened.

ColdVox is a Rust voice pipeline: audio capture -> VAD -> STT -> text injection.

Current Reality

  • The checked-in default config stays deterministic and test-friendly: config/default.toml starts with the mock STT path.
  • The supported Windows live path for this wave is parakeet on NVIDIA/CUDA hardware.
  • The Windows GUI is not the shipped path for this wave. cargo run -p coldvox-gui is only a stub smoke check.
  • CI is not the gate for this wave. Local Windows validation is the gate.

Windows Live Path

Prerequisites:

  • Windows 11
  • NVIDIA GPU with working CUDA support
  • A downloaded Parakeet model directory that is either exposed through PARAKEET_MODEL_PATH or available in the normal shared/local discovery roots

Commands:

just windows-run-preflight
just windows-smoke
just test

To opt into the live runtime during the test gate:

$env:COLDVOX_RUN_WINDOWS_LIVE = '1'
just test

To run the live validation directly:

just windows-live

Validation artifacts are written under logs/windows-validation/<timestamp>-<mode>/.

Developer Commands

cargo check -p coldvox-app
cargo fmt --all -- --check

For the detailed Windows operator path, see docs/windows-live-runbook.md. For agent/developer repo truth, see AGENTS.md and docs/plans/current-status.md.

About

Voice dictation system for Linux desktop. Rust multi-crate workspace with audio capture pipeline, Silero VAD, GPU-accelerated STT (Parakeet/Whisper), and text injection backends (ydotool, AT-SPI, clipboard). Wayland/X11 platform-aware.

Resources

Stars

Watchers

Forks

Packages

 
 
 

Contributors