The
rtst
is a Python package for comprehensive speech processing. It integrates speech recognition, language detection, translation, and text-to-speech functionalities into a single, easy-to-use function.
- Unified function for speech recognition, language detection, translation, and speaking the translation.
- Support for multiple speech recognition services: Google, Azure, AWS, and Hugging Face.
- Automatic translation of recognized speech to English.
- Vocalization of translated text.
Install the package using pip:
pip install rtst
Here’s how to use rtst
to recognize, translate, and vocalize speech:
The package provides a rtst
function, which takes care of the entire speech processing workflow:
import rtst
# Call the function with your preferred service and language code
rtst.process_speech()
This function will:
- Capture speech from the microphone.
- Recognize the speech using the specified service.
- Detect the language of the recognized speech.
- Translate the speech to English if it's in a different language.
- Vocalize the translated text.
For more advanced usage, refer to the package documentation. You can customize various aspects such as selecting different speech recognition services or handling different languages.
MIT License
Contributions to rtst
are welcome. Please follow the guidelines provided in the repository for contributing.