- Rename middle-click-reader to read-aloud service - Change hotkey from Ctrl+middle-click to Alt+R - Replace edge-tts with Piper TTS for local neural voices - Update desktop and service files - Add piper-tts dependency - Update tests and setup scripts
Dictation Service
A Linux voice dictation service with system tray icon and on-demand text-to-speech.
Features
🎤 Dictation Mode (Alt+D)
- Real-time voice-to-text transcription
- Text automatically typed into focused application
- System tray icon for visual status (no notifications)
- Toggle on/off via Alt+D or tray icon click
- High accuracy using Vosk speech recognition
🔊 Read-Aloud (Middle-Click)
- Highlight text anywhere
- Middle-click (scroll wheel press) to read it aloud
- High-quality Microsoft Edge Neural TTS voice
- Works in all applications
- On-demand only (no automatic reading)
Quick Start
# 1. Install dependencies
uv sync
# 2. Setup dictation service
./scripts/setup-keybindings.sh
systemctl --user enable --now dictation.service
# 3. Setup read-aloud (optional)
./scripts/setup-middle-click-reader.sh
# 4. Use dictation
# Press Alt+D, speak, press Alt+D again
# 5. Use read-aloud
# Highlight text, middle-click
See docs/README.md for detailed documentation.
Requirements
- Linux (GNOME/Wayland tested)
- Python 3.12+
- Microphone
- System packages:
portaudio19-dev,ydotool,xclip,mpv, GTK libraries
License
[Your License]
Description