Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
updated at May 5, 2024, 7:12 p.m.
Command-line program to download videos from YouTube.com and other video sites
updated at May 5, 2024, 7:07 p.m.
Python Data Science Handbook: full text in Jupyter Notebooks
updated at May 5, 2024, 6:43 p.m.
Command line utility for forced alignment using Kaldi
updated at May 5, 2024, 6:30 p.m.
A library for audio and music analysis, feature extraction.
updated at May 5, 2024, 4:47 p.m.
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
updated at May 5, 2024, 3:44 p.m.
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
updated at May 5, 2024, 3:18 p.m.
Speech recognition module for Python, supporting several engines and APIs, online and offline.
updated at May 5, 2024, 3:10 p.m.
Nimfa: Nonnegative matrix factorization in Python
updated at May 5, 2024, 2:05 p.m.
Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm
updated at May 5, 2024, 1:35 p.m.
🎚️ Open Source Audio Matching and Mastering
updated at May 5, 2024, 11:48 a.m.
Instructional notebooks on music information retrieval.
updated at May 5, 2024, 11:26 a.m.
music21 is a Toolkit for Computational Musicology
updated at May 5, 2024, 11:18 a.m.
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
updated at May 5, 2024, 9 a.m.