interspeech

Star

Here are 21 public repositories matching this topic...

BakerBunker / FreeV

Star

[InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter

speech speech-synthesis vocoder interspeech

Updated Jun 12, 2024
Python

mariateleki / Comparing-ASR-Systems

Star

Code for our INTERSPEECH 2024 paper: Comparing ASR Systems in the Context of Speech Disfluencies.

speech-recognition automatic-speech-recognition speech-to-text interspeech disfluency disfluency-detection disfluency-detector whisperx interspeech2024 google-asr

Updated Jun 4, 2024
Jupyter Notebook

DmitryRyumin / INTERSPEECH-2023-Papers

Star

INTERSPEECH 2023 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!

Updated May 18, 2024

DmitryRyumin / NewEraAI-Papers

Star

The repository provides links to collections of influential and interesting research papers from top AI conferences, with open-source code to promote reproducibility and provide detailed implementation insights beyond the scope of the article. Stay up to date with the latest advances in AI research!

natural-language-processing computer-vision deep-learning text-classification signal-processing image-processing artificial-intelligence video-processing neural-networks emnlp cvpr iccv icassp ismir interspeech mashine-learning

Updated May 18, 2024
Python

doheejin / SB_loss_PA

Star

This repository is the implementation of the paper, "Score-balanced Loss for Multi-aspect Pronunciation Assessment" (Interspeech 2023).

nlp apa language-learning pronunciation assessment loss-functions scoring-functions interspeech pronunciation-scoring balanced-loss interspeech2023 score-balanced-loss automatic-pronunciation-assessment

Updated Apr 29, 2024
Python

soham97 / awesome-sound_event_detection

Star

Reading list for research topics in Sound AI

representation-learning audio-processing zero-shot-learning icassp sound-event-detection interspeech acoustic-scene-classification audio-captioning audio-generation audio-retrieval

Updated Apr 28, 2024

Nexdata-AI / Interspeech2020-Accented-English-Speech-Recognition-Competition-Data

Star

Interspeech2020 Accented English Speech Recognition Competition Data

audio deep-neural-networks recognition deep-learning speech dataset speech-recognition speech-to-text asr interspeech asr-model

Updated Apr 18, 2024

INTERSPEECH-2024 / MER

Star

Official repo for "Multi-Corpus Emotion Recognition Method based on Cross-Modal Gated Attention Fusion" in INTERSPEECH 2024

transformers computational-linguistics human-computer-interaction interspeech multimodal-emotion-recognition interspeech2024 gated-feature-fusion

Updated Mar 13, 2024
Python

KarelianSpeech / AnKaS

Star

AnKaS: Development and Analysis of the Database of Livvi-Karelian Speech Annotations [INTERSPEECH 2024]

interspeech interspeech2024 ankas

Updated Mar 13, 2024
JavaScript

gabrielmittag / NISQA

Star

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

text-to-speech deep-learning pytorch tts speech-synthesis voice-conversion icassp speech-quality quality-of-experience interspeech

Updated Mar 8, 2024
Python

FrenchKrab / IS2023-powerset-diarization

Star

Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.

speaker-diarization interspeech pyannote

Updated Oct 18, 2023
Jupyter Notebook

Lhx94As / PHO-LID

Star

PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification

pytorch interspeech spoken-language-identification

Updated Aug 24, 2023
Python

cmu-mlsp / Learning_from_weak_labels

Star

[Interspeech 2022] Tutorial - Learning from Weak Labels

interspeech weak-label

Updated Sep 18, 2022
MATLAB

hechmik / voxceleb_enrichment_age_gender

Star

Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021

machine-learning deep-learning sound gender-recognition age age-regression age-prediction interspeech voxceleb asru2021 voxceleb-enrichment

Updated Dec 18, 2021
Jupyter Notebook

doerlbh / MiniVox

Star

Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".

paper speaker-recognition online-learning speaker-diarization contextual-bandits bandit-algorithms interspeech self-supervised-learning acml interspeech2020 online-speaker-diarization

Updated Sep 20, 2021
Cuda

coolEphemeroptera / AESRC2020

Star

a deep accent recognition network

keras resnet speaker-recognition asr ctc mtl crnn arcface netvlad interspeech cosface ghostvlad circle-loss accent-recognition

Updated Aug 25, 2021
Python

ChingtingC / Code-Switching-Sentence-Generation-by-GAN

Star

Code-Switching Sentence Generation by Generative Adversarial Networks and its Application to Data Augmentation. (Interspeech 2019)

generative-adversarial-network code-switching interspeech

Updated Mar 29, 2021
Python

jlinear / ReMASC_Exp

Star

Baseline Experiments for ReMASC dataset.

vcs replay-attack interspeech remasc

Updated Mar 14, 2020
C

whydinkov / interspeech-2019

Star

Interspeech 2019 experiments

nlp sklearn keras audio-processing interspeech

Updated Aug 28, 2019
Python

ronggong / interspeech2018_submission01

Star

Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable and phoneme onset positions

hmm keras cnn forced-alignment hsmm beijing-opera singing-voice interspeech

Updated Aug 8, 2018
Python

Improve this page

Add a description, image, and links to the interspeech topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the interspeech topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

interspeech

Here are 21 public repositories matching this topic...

BakerBunker / FreeV

mariateleki / Comparing-ASR-Systems

DmitryRyumin / INTERSPEECH-2023-Papers

DmitryRyumin / NewEraAI-Papers

doheejin / SB_loss_PA

soham97 / awesome-sound_event_detection

Nexdata-AI / Interspeech2020-Accented-English-Speech-Recognition-Competition-Data

INTERSPEECH-2024 / MER

KarelianSpeech / AnKaS

gabrielmittag / NISQA

FrenchKrab / IS2023-powerset-diarization

Lhx94As / PHO-LID

cmu-mlsp / Learning_from_weak_labels

hechmik / voxceleb_enrichment_age_gender

doerlbh / MiniVox

coolEphemeroptera / AESRC2020

ChingtingC / Code-Switching-Sentence-Generation-by-GAN

jlinear / ReMASC_Exp

whydinkov / interspeech-2019

ronggong / interspeech2018_submission01

Improve this page

Add this topic to your repo