Kade Heyborne cca6bd2aee
Refactor middle-click reader to read-aloud service with Alt+R hotkey and Piper TTS
- Rename middle-click-reader to read-aloud service
- Change hotkey from Ctrl+middle-click to Alt+R
- Replace edge-tts with Piper TTS for local neural voices
- Update desktop and service files
- Add piper-tts dependency
- Update tests and setup scripts
2025-12-10 20:10:44 -07:00

Dictation Service

A Linux voice dictation service with system tray icon and on-demand text-to-speech.

Features

🎤 Dictation Mode (Alt+D)

  • Real-time voice-to-text transcription
  • Text automatically typed into focused application
  • System tray icon for visual status (no notifications)
  • Toggle on/off via Alt+D or tray icon click
  • High accuracy using Vosk speech recognition

🔊 Read-Aloud (Middle-Click)

  • Highlight text anywhere
  • Middle-click (scroll wheel press) to read it aloud
  • High-quality Microsoft Edge Neural TTS voice
  • Works in all applications
  • On-demand only (no automatic reading)

Quick Start

# 1. Install dependencies
uv sync

# 2. Setup dictation service
./scripts/setup-keybindings.sh
systemctl --user enable --now dictation.service

# 3. Setup read-aloud (optional)
./scripts/setup-middle-click-reader.sh

# 4. Use dictation
# Press Alt+D, speak, press Alt+D again

# 5. Use read-aloud
# Highlight text, middle-click

See docs/README.md for detailed documentation.

Requirements

  • Linux (GNOME/Wayland tested)
  • Python 3.12+
  • Microphone
  • System packages: portaudio19-dev, ydotool, xclip, mpv, GTK libraries

License

[Your License]

Description
AI Dictation Service with voice-to-text and AI conversation capabilities
Readme 80 MiB