- Introduced a new configuration field for vocabulary extension path in DictateConfig.
- Updated transcription functions to apply vocabulary extensions from the specified file.
- Enhanced UI to allow users to browse and set the vocabulary extension file, with clear instructions on the format for replacement rules.
- Transcribe dropped files to text file via Save dialog (no paste)
- Default save folder: source folder of audio file
- Chunk long audio (45s) to avoid ORT errors with Parakeet
- Prefer ffmpeg for WAV/MP3 conversion (better compatibility)
- Add transcription-debug.log for debugging
- Support 24/32-bit WAV, ffmpeg fallback for exotic formats
Made-with: Cursor
- Add audio module: WAV/MP3 conversion to 16kHz mono for Parakeet
- Add drop zone for audio files (WAV, MP3) at bottom of settings UI
- Enable drag-and-drop in viewport, process dropped files
- Fix model download: use altunenes/parakeet-rs/tdt with correct filenames
- Move drop zone above status line
- Increase window width by ~1/3 (640px default, 500px min)
Made-with: Cursor
- Check model folder for required files (vocab.txt, encoder, decoder)
- Download from Hugging Face when model missing
- Auto-download on app start if path set but folder empty
- Download button and progress in Settings UI
Made-with: Cursor