Command-line program to download videos from YouTube.com and other video sites
created at Oct. 31, 2010, 2:35 p.m.
Python Data Science Handbook: full text in Jupyter Notebooks
created at Aug. 10, 2016, 2:24 p.m.
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
created at June 2, 2016, 3:04 p.m.
Speech recognition module for Python, supporting several engines and APIs, online and offline.
created at April 23, 2014, 4:53 a.m.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
created at March 7, 2016, 5:26 p.m.
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
created at Aug. 27, 2014, 12:43 p.m.
The Jupyter Notebooks behind my OReilly report, "A Whirlwind Tour of Python"
created at July 27, 2016, 2:39 p.m.
Tutorial material on the scientific Python ecosystem
created at May 19, 2010, 8:42 p.m.
A library for audio and music analysis, feature extraction.
created at Jan. 16, 2023, 9:53 a.m.
This library provides common speech features for ASR including MFCCs and filterbank energies.
created at Oct. 31, 2013, 2:42 a.m.
music21 is a Toolkit for Computational Musicology
created at Nov. 4, 2013, 7:31 p.m.
Python interface to the WebRTC Voice Activity Detector
created at April 23, 2016, 5:03 a.m.