audio by pytorch

Data manipulation and transformation for audio signal processing, powered by PyTorch

created at May 5, 2017, 12:38 a.m.

Python

74 +1

2,392 +6

630 +3

GitHub
speechpy by astorfi

speech balloon SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/

created at April 5, 2017, 3:35 a.m.

Python

41 +0

879 +0

105 +0

GitHub
persephone by persephone-tools

A tool for automatic phoneme transcription

created at Feb. 13, 2017, 11:41 p.m.

Python

17 +0

154 +0

26 -1

GitHub
kapre by keunwoochoi

kapre: Keras Audio Preprocessors

created at Dec. 14, 2016, 6:36 p.m.

Python

23 +0

915 +4

149 +0

GitHub
PythonDataScienceHandbook by jakevdp

Python Data Science Handbook: full text in Jupyter Notebooks

created at Aug. 10, 2016, 2:24 p.m.

Jupyter Notebook

1,777 +0

41,599 +34

17,583 +13

GitHub
WhirlwindTourOfPython by jakevdp

The Jupyter Notebooks behind my OReilly report, "A Whirlwind Tour of Python"

created at July 27, 2016, 2:39 p.m.

Jupyter Notebook

221 +0

3,644 +0

1,583 +2

GitHub
sound_field_analysis-py by AppliedAcousticsChalmers

Analyze, visualize, and process sound field data recorded by spherical microphone arrays.

created at June 23, 2016, 3:56 p.m.

Python

22 +0

87 +0

15 +0

GitHub
DeepSpeech by mozilla

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

created at June 2, 2016, 3:04 p.m.

C++

663 +0

24,393 +50

3,881 +5

GitHub
Python-Wrapper-for-World-Vocoder by JeremyCCHsu

A Python wrapper for the high-quality vocoder "World"

created at May 24, 2016, 12:25 p.m.

Cython

26 +0

694 +2

118 +2

GitHub
mdct by nils-werner

A fast MDCT implementation using SciPy and FFTs

created at May 23, 2016, 10:37 a.m.

Python

6 +0

49 +0

10 +0

GitHub
pysox by marl

Python wrapper around sox.

created at May 1, 2016, 6:06 p.m.

Python

13 +0

501 +3

79 +1

GitHub
py-webrtcvad by wiseman

Python interface to the WebRTC Voice Activity Detector

created at April 23, 2016, 5:03 a.m.

C

48 +0

1,890 +4

395 +0

GitHub
catchy by jvbalen

Python tools for the corpus analysis of popular music.

created at April 12, 2016, 8:37 p.m.

Python

5 +0

20 +0

2 +0

GitHub
resampy by bmcfee

Efficient sample rate conversion in python

created at April 8, 2016, 2:57 p.m.

Python

10 +0

246 +1

34 +0

GitHub
mutagen by quodlibet

Python module for handling audio metadata

created at April 7, 2016, 5:10 p.m.

Python

38 +0

1,445 +0

156 +0

GitHub
pyannote-audio by pyannote

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

created at March 7, 2016, 5:26 p.m.

Jupyter Notebook

63 +1

5,139 +50

700 +2

GitHub
sed_eval by TUT-ARG

Evaluation toolbox for Sound Event Detection

created at Feb. 25, 2016, 2:15 p.m.

Python

8 +0

131 +0

45 +0

GitHub
pyroomacoustics by LCAV

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

created at Dec. 24, 2015, 1:41 p.m.

Python

44 +0

1,332 +3

417 +2

GitHub
Montreal-Forced-Aligner by MontrealCorpusTools

Command line utility for forced alignment using Kaldi

created at Oct. 26, 2015, 5:02 p.m.

Python

35 +0

1,218 +8

240 +0

GitHub
gentle by lowerquality

gentle forced aligner

created at Oct. 23, 2015, 10 a.m.

Python

45 +0

1,391 +7

290 +0

GitHub