Dockerized Whisper C++ speech-to-text API for easy deployment and rapid integration. Offering the latest stable and nightly builds for efficient audio transcription.
-
Updated
Jun 5, 2024 - C++
Dockerized Whisper C++ speech-to-text API for easy deployment and rapid integration. Offering the latest stable and nightly builds for efficient audio transcription.
WhisperClip simplifies your life by automatically transcribing audio recordings and saving the text directly to your clipboard. With just a click of a button, you can effortlessly convert spoken words into written text, ready to be pasted wherever you need it. This application harnesses the power of OpenAI’s Whisper for free.
Pybind11 bindings for Whisper.cpp
This Python script provides a simple interface to transcribe audio files using the OpenAI API's speech-to-text functionality, powered by the Whisper model. The result is returned to the console as text or VTT (WebVTT) format.
The "Audio to Text Transcription with AssemblyAI and Streamlit" project is a web application that allows users to upload audio files and convert them into text using the AssemblyAI API.
Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)
Raiha Discord Accessibility Bot
Transcription and annotation interface for recorded audio or video files
Streamlined GUI for effortless audio transcription.
WhisperAudioTranscriber is an asynchronous audio recording and transcription tool built using Python. It utilizes the Hugging Face API, specifically leveraging the powerful capabilities of OpenAI's Whisper model
Automate Audio Transcription with OpenAI: Fast, Accurate, and Easy!
ClearSpeak is a real-time audio transcription application using Google's Speech-to-Text API. It features a Tkinter-based GUI, filtering background noise, and providing clear speech transcription.
Transcribe m4a audio file into a transcript text file
📼 A streamlit web interface designed to extract words from video/audio files into text • Python, FFmpeg, Whisper, YT-DLP
Uses the powerful WhisperS2T and Ctranslate2 libraries to batch transcribe multiple files
Deepgram Transcription Processor is a Python program designed to process transcription output obtained from Deepgram's transcription service. It extracts key information such as conversation, summary, and paragraphs from the transcription output JSON and writes them to separate text files for further analysis and reference.
cloud audio transcription with whisper or whisperX
Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.
Speech-To-Text (STT) project
Add a description, image, and links to the audio-transcription topic page so that developers can more easily learn about it.
To associate your repository with the audio-transcription topic, visit your repo's landing page and select "manage topics."