Text Processing Book: Speech and Language Processing (3rd ed. draft)

Speech and Language Processing (3rd ed. draft) Project Website: https://web.stanford.edu/~jurafsky/slp3/ Description   Chapter Slides Relation to 2nd ed. 1: Introduction [Ch. 1 in 2nd ed.] 2: Regular Expressions, Text Normalization, and Edit Distance Text [pptx] [pdf] Edit Distance [pptx] [pdf] … Continue reading

Open Source Text Processing Project: PyJulius

PyJulius: Python interface to Julius speech recognition engine Project Website: https://pypi.python.org/pypi/pyjulius Github Link: https://github.com/Diaoul/pyjulius Description pyjulius provides a simple interface to connect to julius module server First you will need to run julius with the -module option (documentation here or … Continue reading

Open Source Text Processing Project: Julius

Julius: Open-Source Large Vocabulary Continuous Speech Recognition Engine Project Website: http://julius.osdn.jp/ Github Link: https://github.com/julius-speech/julius Description “Julius” is a high-performance, small-footprint large vocabulary continuous speech recognition (LVCSR) decoder software for speech-related researchers and developers. Based on word N-gram and context-dependent HMM, … Continue reading

Open Source Text Processing Project: Kaldi

Kaldi Speech Recognition Toolkit Project Website: http://kaldi-asr.org/ Github Link: https://github.com/kaldi-asr/kaldi Description What is Kaldi? Kaldi is a toolkit for speech recognition written in C++ and licensed under the Apache License v2.0. Kaldi is intended for use by speech recognition researchers. … Continue reading

Open Source Text Processing Project: pocketsphinx-ruby

pocketsphinx-ruby: Ruby speech recognition with Pocketsphinx Project Website: None Github Link: https://github.com/watsonbox/pocketsphinx-ruby Description This gem provides Ruby FFI bindings for Pocketsphinx, a lightweight speech recognition engine, specifically tuned for handheld and mobile devices, though it works equally well on the … Continue reading

Open Source Text Processing Project: pocketsphinx-python

pocketsphinx-python: Python interface to CMU SphinxBase and PocketSphinx libraries Project Website: https://pypi.python.org/pypi/pocketsphinx Github Link: https://github.com/bambocher/pocketsphinx-python Description Python interface to CMU SphinxBase and PocketSphinx libraries created with SWIG. Pocketsphinx packages include python support, however, it is based on Automake and not … Continue reading

Open Source Text Processing Project: Sphinx4

Sphinx-4 Speech Recognition System Project Website: None Github Link: https://github.com/cmusphinx/sphinx4 Description Sphinx-4 is a state-of-the-art, speaker-independent, continuous speech recognition system written entirely in the Java programming language. It was created via a joint collaboration between the Sphinx group at Carnegie … Continue reading

Open Source Text Processing Project: PocketSphinx

PocketSphinx Project Website: None Github Link: https://github.com/cmusphinx/pocketsphinx Description PocketSphinx is a lightweight speech recognition engine, specifically tuned for handheld and mobile devices, though it works equally well on the desktop This is PocketSphinx, one of Carnegie Mellon University’s open source … Continue reading

Open Source Text Processing Project: Sphinxtrain

Sphinxtrain Project Website: None Github Link: https://github.com/cmusphinx/sphinxtrain Description This is SphinxTrain, Carnegie Mellon University’s open source acoustic model trainer. This directory contains the scripts and instructions necessary for building models for the CMU Sphinx Recognizer. This distribution is free software, … Continue reading

Open Source Text Processing Project: sphinxbase

sphinxbase: CMU Sphinx common libraries Project Website: None Github Link: https://github.com/cmusphinx/sphinxbase Description This package contains the basic libraries shared by the CMU Sphinx trainer and all the Sphinx decoders (Sphinx-II, Sphinx-III, and PocketSphinx), as well as some common utilities for … Continue reading