Speech Interface (Faster Whisper)

Speech Interface (Faster Whisper)

kvadratni

Adds voice interaction to AI models using local speech recognition and text-to-speech. Lets you talk to AI assistants instead of typing, with real-time audio processing and 54+ voice options.

81454 views13Local (stdio)

What it does

  • Convert speech to text using faster-whisper
  • Generate speech from text with 54+ voice options
  • Transcribe audio and video files with timestamps
  • Create multi-speaker narrations for stories
  • Process real-time voice input with silence detection
  • Display audio visualization in modern UI

Best for

Developers wanting voice interfaces for AI assistantsCreating audio content and narrationsAccessibility for hands-free AI interactionTranscribing media files locally
Fully local processingModern PyQt UI with visualizationRemembers voice preferences

Alternatives