Exercises for the lecture "Selected Topics in Audio Signal Processing"
created at Sept. 30, 2015, 1:22 p.m.
Command line utility for forced alignment using Kaldi
created at Oct. 26, 2015, 5:02 p.m.
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
created at Dec. 24, 2015, 1:41 p.m.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
created at March 7, 2016, 5:26 p.m.
Python interface to the WebRTC Voice Activity Detector
created at April 23, 2016, 5:03 a.m.
A fast MDCT implementation using SciPy and FFTs
created at May 23, 2016, 10:37 a.m.
A Python wrapper for the high-quality vocoder "World"
created at May 24, 2016, 12:25 p.m.
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
created at June 2, 2016, 3:04 p.m.
Analyze, visualize, and process sound field data recorded by spherical microphone arrays.
created at June 23, 2016, 3:56 p.m.
The Jupyter Notebooks behind my OReilly report, "A Whirlwind Tour of Python"
created at July 27, 2016, 2:39 p.m.