DeepSpeech by mozilla

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

updated at May 11, 2024, 11:35 p.m.

C++

663 +0

24,393 +50

3,881 +5

GitHub
essentia by MTG

C++ library for audio and music analysis, description and synthesis, including Python bindings

updated at May 11, 2024, 3:59 p.m.

C++

108 +0

2,705 +11

521 +0

GitHub
Parselmouth by YannickJadoul

Praat in Python, the Pythonic way

updated at May 6, 2024, 4:09 a.m.

C++

21 +0

1,006 +1

110 +0

GitHub
loudness by deeuu

Audio library for modelling loudness

updated at April 28, 2024, 4:26 p.m.

C++

5 +0

33 +0

11 +0

GitHub
Yaafe by Yaafe

Audio features extraction

updated at March 30, 2024, 11:29 a.m.

C++

17 +0

241 +0

43 +0

GitHub
AudioTK by mbrucher

An audio digital processing toolbox based on a workflow/pipeline principle

updated at Jan. 28, 2024, 6:14 a.m.

C++

21 +0

250 +0

37 +0

GitHub