cdec by redpony

Decoder, aligner, and model optimizer for statistical machine translation and other structured prediction models based on (mostly) context-free formalisms

updated at Oct. 19, 2023, 10:32 a.m.

C++

33 +0

184 +0

78 +0

GitHub
RusPhonetizer by wilpert

Grammar rules and dictionaries for the phonetic transcription of Russian sentences

updated at Sept. 19, 2023, 7:21 a.m.

Python

12 +0

33 +0

2 +0

GitHub
barista by usc-sail

Barista is an open-source framework for concurrent speech processing.

updated at Sept. 1, 2023, 8:40 a.m.

C++

13 +0

35 +0

6 +0

GitHub
sail_align by nassosoassos

SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition and text alignment scheme that allows for the processing of very long (and possibly noisy) audio and is robust to transcription errors. It is mainly written as a perl library but its functionality also depends on freely available software, namely HTK, srilm and sclite.

updated at Aug. 28, 2023, 7:56 a.m.

Perl

14 +0

97 +0

14 +0

GitHub
rnnlm2wfst by glecorve

Conversion of recurrent neural network language models to weighted finite state transducers

updated at Aug. 18, 2023, 3:37 a.m.

C++

6 +0

54 +0

15 +0

GitHub
openfst-utils by benob

Utilities for manipulating finite state transducers with the OpenFst library.

updated at June 8, 2023, 9:35 p.m.

C++

5 +0

30 +0

12 +0

GitHub
juicer by idiap

Juicer is a Weighted Finite State Transducer (WFST) based decoder for Automatic Speech Recognition (ASR).

updated at June 3, 2023, 1:06 p.m.

C++

18 +0

60 +0

25 +0

GitHub
cpyp by redpony

C++ library for modeling with Pitman-Yor processes

updated at May 23, 2023, 3:35 p.m.

C++

9 +0

34 +0

11 +0

GitHub
cloud-asr by UFAL-DSG

Cloud-based Automatic Speech Recognition (ASR) platform and a public ASR webservice.

updated at April 6, 2023, 10:31 p.m.

Python

17 +0

65 +0

25 +0

GitHub
bigfatlm by jhclark

Hadoop MapReduce training of modified Kneser-Ney smoothed language models

updated at March 23, 2023, 9:16 a.m.

Java

6 +0

30 +0

10 +0

GitHub
openlat by benob

Toolkit for manipulating word lattices built on top of openfst

updated at Jan. 11, 2022, 8:02 a.m.

C++

1 +0

4 +0

0 +0

GitHub
kaldi-nnet-dur-model by alumae

Neural network phone duration model on top of the Kaldi speech recognition framework

updated at Dec. 9, 2021, 3:20 a.m.

Python

5 +0

25 +0

9 +0

GitHub
spectral-learn by ICML14MoMCompare

Collection of three method of moments based algorithms for learning stochastic languages.

updated at June 10, 2021, 11:31 a.m.

C++

2 +0

15 +0

1 +0

GitHub
kleene-lang by krbeesley

a high-level language, based on OpenFst, for finite-state programming

updated at Jan. 8, 2020, 4:23 a.m.

Java

4 +0

14 +0

4 +0

GitHub
openfst by kho

n Shortest Path for PDT

updated at Nov. 9, 2017, 1:50 a.m.

C++

3 +0

4 +0

0 +0

GitHub