pyannote-audio by pyannote

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

updated at May 5, 2024, 7:12 p.m.

Jupyter Notebook

62 +0

5,089 +41

698 +6

GitHub
youtube-dl by ytdl-org

Command-line program to download videos from YouTube.com and other video sites

updated at May 5, 2024, 7:07 p.m.

Python

2,196 +1

128,650 +148

9,675 +17

GitHub
PythonDataScienceHandbook by jakevdp

Python Data Science Handbook: full text in Jupyter Notebooks

updated at May 5, 2024, 6:43 p.m.

Jupyter Notebook

1,777 -2

41,565 +64

17,570 +19

GitHub
Montreal-Forced-Aligner by MontrealCorpusTools

Command line utility for forced alignment using Kaldi

updated at May 5, 2024, 6:30 p.m.

Python

35 +0

1,210 +7

240 +0

GitHub
pydub by jiaaro

Manipulate audio with a simple and easy high level interface

updated at May 5, 2024, 5 p.m.

Python

135 +0

8,359 +13

1,000 +3

GitHub
audioFlux by libAudioFlux

A library for audio and music analysis, feature extraction.

updated at May 5, 2024, 4:47 p.m.

C

28 +0

2,054 +6

98 +0

GitHub
pyAudioAnalysis by tyiannak

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

updated at May 5, 2024, 3:44 p.m.

Python

211 +0

5,679 +10

1,177 -1

GitHub
pyo by belangeo

Python DSP module

updated at May 5, 2024, 3:26 p.m.

Python

65 +0

1,274 +0

125 +1

GitHub
DeepSpeech by mozilla

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

updated at May 5, 2024, 3:18 p.m.

C++

663 +0

24,343 +47

3,876 -4

GitHub
speech_recognition by Uberi

Speech recognition module for Python, supporting several engines and APIs, online and offline.

updated at May 5, 2024, 3:10 p.m.

Python

283 +0

8,056 +12

2,366 +6

GitHub
beets by beetbox

music library manager and MusicBrainz tagger

updated at May 5, 2024, 2:16 p.m.

Python

403 -1

12,410 +11

1,782 +4

GitHub
nimfa by mims-harvard

Nimfa: Nonnegative matrix factorization in Python

updated at May 5, 2024, 2:05 p.m.

Python

36 +0

526 +1

133 +0

GitHub
pyloudnorm by csteinmetz1

Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm

updated at May 5, 2024, 1:35 p.m.

Python

14 +0

582 +1

53 +0

GitHub
matchering by sergree

🎚️ Open Source Audio Matching and Mastering

updated at May 5, 2024, 11:48 a.m.

Python

40 +0

1,211 +8

145 +0

GitHub
Parselmouth by YannickJadoul

Praat in Python, the Pythonic way

updated at May 5, 2024, 11:27 a.m.

C++

21 +0

1,005 +4

110 +0

GitHub
musicinformationretrieval.com by iranroman

Instructional notebooks on music information retrieval.

updated at May 5, 2024, 11:26 a.m.

Jupyter Notebook

53 +0

1,199 +2

412 -1

GitHub
music21 by cuthbertLab

music21 is a Toolkit for Computational Musicology

updated at May 5, 2024, 11:18 a.m.

Python

75 +0

2,002 +8

383 +0

GitHub
pyroomacoustics by LCAV

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

updated at May 5, 2024, 9 a.m.

Python

44 +0

1,329 +3

415 +0

GitHub
tinytag by devsnd

Python library for reading audio file metadata, duration of MP3, OGG, OPUS, MP4, M4A, FLAC, WMA, Wave, AIFF and a few more

updated at May 5, 2024, 8:49 a.m.

Python

24 +0

667 +2

98 +0

GitHub
PyAV by PyAV-Org

Pythonic bindings for FFmpeg's libraries.

updated at May 5, 2024, 7:43 a.m.

Cython

60 -1

2,279 +3

343 +0

GitHub