speech-translation

Here are 47 public repositories matching this topic...

NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

machine-translation tts speech-synthesis neural-networks deeplearning speaker-recognition asr multimodal speech-translation large-language-models speaker-diariazation generative-ai

Updated May 25, 2024
Python

espnet / espnet

Star

End-to-End Speech Processing Toolkit

deep-learning chainer end-to-end machine-translation pytorch speech-synthesis speech-recognition kaldi voice-conversion speaker-diarization speech-separation speech-enhancement spoken-language-understanding speech-translation singing-voice-synthesis

Updated May 25, 2024
Python

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Updated May 23, 2024
Python

microsoft / SpeechT5

Star

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

speech-synthesis speech-recognition speech-translation speech-pretraining speecht5 speech2c speechlm speechut speech-text-pretraining vatlm vallex

Updated Apr 24, 2024
Python

double22a / speech_dataset

Star

The dataset of Speech Recognition

audio text-to-speech deep-neural-networks deep-learning speech tts speech-synthesis dataset wav speech-recognition automatic-speech-recognition speech-to-text voice-conversion asr speech-separation speech-enhancement speech-segmentation speech-translation speech-diarization

Updated Mar 7, 2023

Dadangdut33 / Speech-Translate

Star

A realtime speech transcription and translation application using Whisper OpenAI and free translation API. Interface made using Tkinter. Code written fully in Python.

python translate whisper tkinter-python speech-translation speech-transcription

Updated Jan 18, 2024
Python

kahne / SpeechTransProgress

Star

Tracking the progress in end-to-end speech translation

natural-language-processing machine-translation artificial-intelligence natural-language-generation speech-processing spoken-language-processing speech-translation spoken-language-translation

Updated Oct 25, 2023

bzhangGo / zero

Star

Zero -- A neural machine translation system

transformer neural-machine-translation average-attention-network aan speech-translation depth-scaled-initialization deep-transformer l0drop adaptive-feature-selection massively-multilingual-translation opus-100 fast-bidirectional-decoder

Updated May 8, 2023
Python

echogarden-project / echogarden

Star

Easy-to-use speech toolset. Written in TypeScript. Includes tools for synthesis, recognition, alignment, speech translation, language detection, source separation and more.

text-to-speech speech language-detection speech-synthesis speech-recognition speech-to-text source-separation language-identification forced-alignment speech-translation speech-alignment

Updated May 15, 2024
TypeScript

JeffWang0325 / Microsoft-Azure-Cognitive-Services

Star

🖍️ This project combines multiple operations in Microsoft Azure Cognitive Services into one GUI, including QnA Maker, LUIS, Computer Vision, Custom Vision, Face, Form Recognizer, Text To Speech, Speech To Text and Speech Translation. It's very user-friendly for users to implement any operation mentioned above.

microsoft text-to-speech translation computer-vision azure speech-synthesis speech-recognition face face-recognition face-detection luis speech-to-text cognitive-services qna-maker qnamaker customvision luis-ai speech-translation formrecognizer

Updated Nov 2, 2021
C#

ictnlp / STEMM

Star

Code for ACL 2022 main conference paper "STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation".

machine-translation speech-to-text speech-translation

Updated Oct 25, 2023
Python

ReneeYe / ConST

Star

code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)

translation machine-translation pytorch transformer neural-machine-translation spoken-language-processing speec speech-translation contrastive-learning naacl2022

Updated May 25, 2022
Python

zhangshaolei1998 / Awesome-Simultaneous-Translation

Star

Paper list of simultaneous translation / streaming translation, including text-to-text machine translation and speech-to-text translation.

nlp natural-language-processing streaming awesome paper machine-translation text-translation paperlist speech-translation simultaneous-translation simultaneous-machine-translation

Updated Jan 14, 2024

liamdugan / speech-to-speech

Star

Code for the INTERSPEECH 2023 paper "Learning When to Speak: Latency and Quality Trade-offs for Simultaneous Speech-to-Speech Translation with Offline Models"

speech speech-processing speech-translation speech-to-speech simultaneous-translation

Updated Sep 1, 2023
Python

ictnlp / DASpeech

Star

Code for NeurIPS 2023 paper "DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation".

machine-translation speech-translation speech-to-speech speech-to-speech-translation

Updated Jan 16, 2024
Python

mt-upc / ZeroSwot

Star

Pushing the Limits of Zero-shot End-to-End Speech Translation

translation speech-translation

Updated Mar 30, 2024
Python

George0828Zhang / torch_cif

Star

A fast parallel PyTorch implementation of the "CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition" https://arxiv.org/abs/1905.11235.