Tracking the progress in end-to-end speech translation
-
Updated
Oct 25, 2023
Tracking the progress in end-to-end speech translation
A PyPI package for fast word/character error rate (WER/CER) calculation
code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)
10 digits recognition system based on DTW, HMM and GMM
Praaline is an open-source system to manage, annotate, visualise and analyse spoken language corpora
This is an implementation of paper "End-to-end Speech Translation via Cross-modal Progressive Training" (Interspeech2021)
Example codes for my PhD work on recognizing dimensional emotions in spoken dialogue
software that analyzes speech utterances
Code for the paper "Learning English with Peppa Pig" https://doi.org/10.48550/arXiv.2202.12917
Speech subtask of the 2017 NLI Shared Task
This project demonstrates the use of generic bi-directional LSTM models for predicting importance of words in a spoken dialgoue for understanding its meaning. The model operates on human-annotated corpus of word importance for its training and evaluation. The corpus can be downloaded from: http://latlab.ist.rit.edu/lrec2018
Convex combination of phonotactics for large-scale spoken language identification
🚧The Internet + project YiLuYuBan.The project is too messy, has moved to https://github.com/wanghao15536870732/ChatWithChinese
The Ruby Programming Language
A guide to spoken language processing
RNN for Spoken Language Understanding
All NLP related courses on DataCamp
Repository of the paper: "Spoken Language Intelligence of Large Language Models for Language Learning"
Add a description, image, and links to the spoken-language-processing topic page so that developers can more easily learn about it.
To associate your repository with the spoken-language-processing topic, visit your repo's landing page and select "manage topics."