Command-line program to download videos from YouTube.com and other video sites
updated at May 12, 2024, 12:27 a.m.
Tutorial material on the scientific Python ecosystem
updated at May 12, 2024, 12:24 a.m.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
updated at May 12, 2024, 12:10 a.m.
Python Data Science Handbook: full text in Jupyter Notebooks
updated at May 11, 2024, 11:40 p.m.
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
updated at May 11, 2024, 11:35 p.m.
Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm
updated at May 11, 2024, 11:08 p.m.
Audio processing by using pytorch 1D convolution network
updated at May 11, 2024, 3:16 p.m.
Command line utility for forced alignment using Kaldi
updated at May 11, 2024, 3:06 p.m.
A library for audio and music analysis, feature extraction.
updated at May 11, 2024, 1:15 p.m.
Speech recognition module for Python, supporting several engines and APIs, online and offline.
updated at May 11, 2024, 11:59 a.m.
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
updated at May 11, 2024, 11:43 a.m.
The Jupyter Notebooks behind my OReilly report, "A Whirlwind Tour of Python"
updated at May 11, 2024, 10:44 a.m.