FitVoice/Speech2Diet is an application that allows people to track their food intake by voice recording what they eat throughout the day.
-
Updated
May 16, 2024 - TypeScript
FitVoice/Speech2Diet is an application that allows people to track their food intake by voice recording what they eat throughout the day.
A feature-rich portal to chat with GPT-4, Claude, Gemini, Mistral, & OpenAI Assistant APIs via a lightweight Node.js web app; supports customizable multimodality for voice, images, & files.
OpenAI .NET sdk - Azure OpenAI, ChatGPT, Whisper, and DALL-E
Batch Local Transcribe Audio/Movie To Text With Whisper AI Model. Keep Privacy Safe!
Interviewee is a Java application that uses OpenAI API to provide audio transcribes, answers to interview questions and translating questions and answers to another language. The application captures audio during interviews, transcribes the conversations, and displays subtitles in real time.
llm server using outlines for json/regex/cfg formatted generation
Batch Multi-Media Transcribe - Transcript using (CPU-CUDA-API)
Desktop application for Linux and Windows that utilizes distil-whisper models from HuggingFace, to enable real-time offline speech-to-text dictation.
Automatically generate, translate and overlay subtitles for any video.
AI-powered study companion for visually impaired students. Developed by Edumakers, from Tecnológico de Monterrey
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names
Transcribe, diarize, annotate and subtitle audio and video with Whisper ... fast!
Transcription and TTS Rest API (OpenAI Whisper, Speechbrain)
Interactive web tool for automatically ⚙️ transcribing and subtitling videos from URL or file uploads in your chosen language. The transcript appears alongside the video player, complete with embedded subtitles.
A python vlc player that transcribes subtitles on the Intel NPU
WhisperAPI is a fast and reliable API that transcribes video and audio files into text with support for all models and languages. It offers time-stamped results and translation to English.
Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI technology, whisper.cpp.
Using OpenAI's whisper or whisper-faster and ffmpeg take a list of video and audio files and provide subtitles
🌬️ Automatic Speech Recognition (ASR) system, for efficient and accurate voice transcription.
Speech to Text. How do you broadcast text transcribing using Assembly.ai and broadcast to browser? Python, Assembly.AI, websocket
Add a description, image, and links to the whisper-ai topic page so that developers can more easily learn about it.
To associate your repository with the whisper-ai topic, visit your repo's landing page and select "manage topics."