Machine Hearing

Machine Hearing, or Machine Listening, is the use of Machine Learning and audio sensors to derive meaningful information from sound. This include listening for and diagnosing problems in machinery, understanding events and activities that cause noise, and estimation of how humans perceive certain sounds.

Here you can find some notes on the topic compiled by Jon Nordby.

This research is sponsored by Soundsensing, a provider of IoT audio sensors with built-in Machine Learning, used for Noise Monitoring and Condition Monitoring. The sensors are ideal for continious monitoring of audible noises and events, and can perform tasks such as Audio Classification, Audio Event Detection and Acoustic Anomaly Detection. Their sensors can transmit compressed and privacy-preserving spectrograms, allowing Machine Learning to be done in the cloud using familiar tools like Python. Or models can be deployed onto the sensor itself, for a highly efficient on-edge ML solution.

Pages

Some information is found in sub-pages.

Audio Quality

Recent work

EuroPython 2021: Sound Event Detection with Machine Learning

Youtube: Sound Event Detection with Machine Learning (EuroPython 2021)

July 26, 2021. Presented at EuroPython 2021. Video recording, slides, notes.

TinyML EMEA 2021: Perfect coffee roasting with TinyML sound sensing

June 7, 2021. Presented at tinyML EMEA Technical Forum 2021. Video recording coming, slides, notes.

TinyML Summit 2021: Environmental Sound Classification on microcontrollers

March 25, 2021. Video recording, slides, notes.

Classifying sound using Machine Learning

At KnowIt Oslo, 2020. Video recording, slides, notes

Environmental Sound Classification on Microcontrollers using Convolutional Neural Networks

Master thesis. Report and code available in the Github repository.

EuroPython2019: Audio Classification using Machine Learning

Youtube: Audio Classification using Machine Learning by Jon Nordby, EuroPython 2019

Presentation at EuroPython2019. Video recording, notes

PyCode2019: Recognizing sounds with Machine Learning and Python

Presentation at PyCode Conference 2019 in Gdansk. Slides, notes

Video recording. Coming, maybe in November.

SenseCamp2019: Classification of Environmental Sound using IoT sensors

Presentation at SenseCamp 2019 hosted by FORCE Technology Senselab. Slides: web, .PDF

NMBU lecture on Audio Classification

Report and lecture at NMBU Data Science.

Report | Slides

Stack Overflow answers

With example code in Python

Notes

Rough notes on various topics.

Applications. Practical applications of Machine Hearing
Tasks. Established problem formulations
Audio Quality. Metrics for measuring audio quality
Explainable models for Audio.
Features. Feature representations
Preprocessing. Preprocessing techniques
DCASE2018. Notes from DCASE2018 challenge and conference
Commercial solutions. Companies and products in Machine Hearing
Speech. Speech-specific techniques and applications
Music. Music-specific techniques and applications
Compressive Sensing.

Resources

Useful resources to learn more.

Presentations

Audio Event Detection w/Deep Learning. By Robert Coop, Ph.D, Head of AI and ML @ Stanley B&D. From Data Science Connect, 2028.

Books

Computational Analysis of Sound Scenes and Events. Tuomas Virtanen, Mark D. Plumbley, Dan Ellis. 2018.
Human and Machine Hearing - Extracting Meaning from Sound. Richard F. Lyon. 2017, revised 2018.
An Introduction to Audio Content Analysis - Applications in Signal Processing and Music Informatics. Alexander Lerch. 2012. Companion website: https://www.audiocontentanalysis.org/
Machine Learning for Audio, Image and Video Analysis: Theory and Applications (Advanced Information and Knowledge Processing). Francesco Camastra, 3 sections. From Perception to Computation, Machine Learning, Applications.

Online courses

CSC 83060: Speech and Audio Understanding. http://mr-pc.org/t/csc83060/ Brooklyn College (CUNY).
Deep Learning (for Audio) with Python by Valerio Velardo
PyTorch for Audio + Music Processing by Valerio Velardo

Software

Feature extraction

librosa. The go-to Python module.
essentia. C++ library, with Python bindings. Lots of Music Analysis extractors. Used by FreeSound and Acousticbrainz.
kapre. On-demand GPU computation of melspectrograms, for Keras
torchaudio. Audio processing in PyTorch

Data Augmentation

muda: Python library for augmenting annotated audio data
audiomentations.
scaper. Soundscape synthesis tool with automatic label handling.

Lecture notes

Audio Classification. http://www.cs.tut.fi/~sgn24006/PDF/L04-audio-classification.pdf Covers low-level features, MFCC. Classification by distance metrics. GMM. HMM.
Speech Signal Analysis, Lecture 2. January 2017, Hiroshi Shimodaira and Steve Renals. ! great diagrams of audio discretization, mel filters, wide versus narrow-band spectrograms.

Competions

Kaggle Whale detection
Kaggle FreeSound tagging 2018
Kaggle FreeSound
DCASE2014
DCASE2018
DCASE2019
DCASE2020
DCASE2021

Datasets

Online Communities

https://mircommunity.slack.com/ - Music Information Retrieval
The Sound of AI, Slack Community

Lists

Awesome Deep Learning Music
Fast.ai forums: Deep Learning with Audio. Large lists of resources, both in first post and "popular links". Feb 2019, 315 replies over 4 months.

Name		Name	Last commit message	Last commit date
Latest commit History 305 Commits
.github		.github
AppliedMlDays2022		AppliedMlDays2022
ICSV27		ICSV27
ICSV28		ICSV28
INTAP2021		INTAP2021
PyDataGlobal2021		PyDataGlobal2021
SimulaBI2022		SimulaBI2022
audio-quality		audio-quality
concepts		concepts
dataset-workshop		dataset-workshop
drafts		drafts
environmental-noise		environmental-noise
euronoise2021		euronoise2021
europython2019		europython2019
europython2021		europython2021
explainable		explainable
geekleml2021		geekleml2021
handson		handson
hear2021		hear2021
icassp2022		icassp2022
ideas		ideas
img		img
knowit2020		knowit2020
nora-annual-conference-2021		nora-annual-conference-2021
nordicaimeet2021		nordicaimeet2021
pycode2019		pycode2019
pydataberlin2020		pydataberlin2020
pydataberlin2022		pydataberlin2022
quality-control		quality-control
rise2022		rise2022
sensecamp2019		sensecamp2019
tinyml2021		tinyml2021
tinymlEMEA2021		tinymlEMEA2021
README.md		README.md
applications.md		applications.md
audio-segmentation.md		audio-segmentation.md
braindump.md		braindump.md
commercial.md		commercial.md
compressive-sensing.md		compressive-sensing.md
covid-19.md		covid-19.md
dcase2018.md		dcase2018.md
dcase2019.md		dcase2019.md
dcase2022.md		dcase2022.md
features.md		features.md
music.md		music.md
preprocessing.md		preprocessing.md
spectrogram-compression.md		spectrogram-compression.md
speech.md		speech.md
tasks.md		tasks.md

jonnor/machinehearing

Folders and files

Latest commit

History

Repository files navigation

Machine Hearing

Pages

Recent work

EuroPython 2021: Sound Event Detection with Machine Learning

TinyML EMEA 2021: Perfect coffee roasting with TinyML sound sensing

TinyML Summit 2021: Environmental Sound Classification on microcontrollers

Classifying sound using Machine Learning

Environmental Sound Classification on Microcontrollers using Convolutional Neural Networks

EuroPython2019: Audio Classification using Machine Learning

PyCode2019: Recognizing sounds with Machine Learning and Python

SenseCamp2019: Classification of Environmental Sound using IoT sensors

NMBU lecture on Audio Classification

Stack Overflow answers

Notes

Resources

Presentations

Books

Online courses

Software

Lecture notes

Competions

Datasets

Online Communities

Lists

About

Topics

Resources

Stars

Watchers

Forks

Languages