#

audio-segmentation

Here are 20 public repositories matching this topic...

radadiavasu / AudioAnalysis

Whole Audio Analysis with Python

python feature-extraction audio-classification audio-segmentation diarization pyaudio-processing pyaudio-analysis

Updated Jun 14, 2024
Python

nianlonggu / WhisperSeg

Code for ICASSP 2024 paper WhisperSeg: Positive Transfer of the Whisper Speech Transformer to Human and Animal Voice Activity Detection

transformer whisper audio-segmentation voice-activity-detection icassp2024 animal-sound-detection whisperseg

Updated Jun 12, 2024
Python

ina-foss / InaGVAD

Voice activity detection and speaker gender segmentation audiovisual corpus

radio benchmark corpus tv dataset gender audio-segmentation voice-activity-detection gender-prediction speech-dataset gender-bias speech-activity-detection speaker-gender speech-corpus audio-dataset audiovisual-dataset acoustic-diversity gender-representation

Updated Jun 6, 2024
Jupyter Notebook

huzaifakhan04 / music-recommendation-web-application-based-on-rhythmic-similarity-using-locality-sensitive-hashing

This repository contains a web application that integrates with a music recommendation system, which leverages a dataset of 3,415 audio files, each lasting thirty seconds, utilising a Locality-Sensitive Hashing (LSH) implementation to determine rhythmic similarity, as part of an assignment for the Fundamental of Big Data Analytics (DS2004) course.

music spotify data-science machine-learning big-data music-recommendation lsh web-application music-information-retrieval flask-application locality-sensitive-hashing ann cosine-distance audio-segmentation audio-processing audio-recommendation music-recommendation-system approximate-nearest-neighbors

Updated Mar 1, 2024
Jupyter Notebook

nuvita97 / music-source-separation

Music Source Separation web application using U-Net model with 2 main features: Audio Separation & Karaoke

css python deep-neural-networks audio-segmentation unet-model fastapi streamlit

Updated Feb 2, 2024
Jupyter Notebook

dangrebenkin / speech_audio_separator

A useful tool to split speech WAV PCM files to fragments with use of energy signal minimums (speech pauses).

audio-segmentation audio-processing

Updated Jan 28, 2024
Python

mt-upc / SegAugment

SEGAUGMENT: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations

data-augmentation audio-segmentation speech-translation

Updated Dec 21, 2023
Python

autosub

BingLingGroup / autosub

Command-line utility to transcribe/translate from video/audio/subtitles to subtitles

subtitles substation-alpha audio-segmentation xfyun cloud-speech-api voice-activity-detection baidu-api xunfei-api

Updated Dec 21, 2023
Python

dangrebenkin / wav2vec2_speech_markuper

Automatic generation of speech dataset markup using Wav2Vec2 ASR models

speech-recognition speech-to-text audio-segmentation forced-alignment wav2vec2

Updated Sep 20, 2023
Python

dangvansam / pyannote-onnx

PyAnnote Voice Activity Detection (ONNX version)

vad audio-segmentation speech-separation onnx speech-activity-detection audio-split audio-splitter pyannote voice-ac

Updated Sep 9, 2023
Jupyter Notebook

Metiu-Metiu / Neural-Texture-Sound-synthesis---data-sets

Synthetic sounds datasets and real sounds datasets of waterflow sounds for the repo 'Neural-Texture-Sound-Synthesis-with-physically-driven-continuous-controls'.

data-augmentation audio-segmentation synthetic-dataset-generation audio-datasets synthetic-dataset real-dataset audio-dataset-for-machine-learning

Updated Aug 30, 2023

boromir674 / music-album-creator

Build a digital music library by downloading and segmenting youtube videos.

music cli metadata automation youtube music-library youtube-downloader command-line-tool audio-segmentation audio-processing music-metadata

Updated Aug 14, 2023
Python

amsehili / auditok

An audio/acoustic activity detection and audio segmentation tool

vad audio-data audio-activities audio-segmentation voice-detection voice-activity-detection

Updated Mar 30, 2023
Python

Appen / UHV-OTS-Speech

A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

speech-recognition speech-processing audio-segmentation gender-classification speaker-diarization synthetic-speech-detection topic-detection speech-seperation speaker-identification accent-detection speech-transcription speech-annotation

Updated Mar 25, 2023
Forth

0x7o / PyanNet

Training and using audio segmentation

audio-segmentation

Updated Feb 27, 2023

mt-upc / SHAS

SHAS: Approaching optimal Segmentation for End-to-End Speech Translation

speech speech-to-text audio-segmentation speech-translation wav2vec2

Updated Feb 9, 2023
Python

ElHaban3ro / AsegTool

AsegTool is a tool designed to generate a segmentation file that is usable within my other tool. 🌵

video-processing audio-segmentation audio-processing video-segmentation

Updated Nov 27, 2022
JavaScript

LIMUNIMI / labelSignal

Automatic annotation of timbre variation for monophonic musical instruments

audio signal-processing audio-analysis audio-segmentation timbre sound-and-music-computing

Updated Feb 22, 2022
MATLAB

luuil / Tools

Our Little Tools

docker dockerfile tensorflow grpc locust audio-segmentation tensorflow-serving svg2png savedmodel

Updated Jul 26, 2021
Stylus

yxlijun / solfege-segmentation

pitch detection,CNN

cnn audio-segmentation f0-detection solfege-segmentation

Updated Sep 21, 2018
Python

Improve this page

Add a description, image, and links to the audio-segmentation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the audio-segmentation topic, visit your repo's landing page and select "manage topics."