DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
updated at May 11, 2024, 11:35 p.m.
Python Data Science Handbook: full text in Jupyter Notebooks
updated at May 11, 2024, 11:40 p.m.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
updated at May 12, 2024, 12:10 a.m.
Tutorial material on the scientific Python ecosystem
updated at May 12, 2024, 12:24 a.m.
Command-line program to download videos from YouTube.com and other video sites
updated at May 12, 2024, 12:27 a.m.