Skip to content

MaxAFriedrich/whisper_dictate

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Whisper Dictation

⚠️ This is a very early version that should be considered an MVP. ⚠️

This is a simple tool that is designed to make it feasible to dictate text using the whisper speech to text model from open AI. Whisper is not designed for dictation, so it requires quite a bit of fiddling to make it work nicely.Whisper is only really designed for transcription and translation.

Usage

Run the script and then start talking. The text should appear in the application and emulate your keyboard.Note that there is significant amounts of latency due to the architectureof the Whisper Machine Learning Model.

TODO

  • Reduce Latency
  • Add cli arguments
  • Add a better UX