Open Source Text Processing Project: UMDHMM

UMDHMM: Hidden Markov Model Toolkit Project Website: http://www.kanungo.com/software/software.html#umdhmm Github Link: None Description Hidden Markov Model (HMM) Software: Implementation of Forward-Backward, Viterbi, and Baum-Welch algorithms. The software has been compiled and tested on UNIX platforms (sun solaris, dec osf and linux) … Continue reading

Open Source Text Processing Project: GHMM

GHMM: The General Hidden Markov Model library Project Website: http://www.ghmm.org/ Github Link: None Description The General Hidden Markov Model library (GHMM) is a freely available C library implementing efficient data structures and algorithms for basic and extended HMMs with discrete … Continue reading

Open Source Text Processing Project: HTK

HTK: The Hidden Markov Model Toolkit Project Website: http://htk.eng.cam.ac.uk/ Github Link: None Description The Hidden Markov Model Toolkit (HTK) is a portable toolkit for building and manipulating hidden Markov models. HTK is primarily used for speech recognition research although it … Continue reading

Open Source Text Processing Project: MGIZA

MGIZA++: a multi-threaded word alignment tool based on GIZA++ Project Website: http://www.kyloo.net/software/doku.php/mgiza:overview Github Link: https://github.com/moses-smt/mgiza Description MGIZA++ is a multi-threaded word alignment tool based on GIZA++. It extends GIZA++ in multiple ways: Multi-threading MGIZA++ can make use of multi-core platforms … Continue reading

Open Source Text Processing Project: mkcls

mkcls: Training of word classes Project Website: http://www.fjoch.com/mkcls.html Github Link: https://github.com/moses-smt/giza-pp Description mkcls is a tool to train word classes by using a maximum-likelihood-criterion. The resulting word classes are especially suited for language models or statistical translation models. The program … Continue reading

Open Source Text Processing Project: GIZA++

GIZA++: Training of statistical translation models Project Website: http://www.fjoch.com/GIZA++.html Github Link: https://github.com/moses-smt/giza-pp Description GIZA++ is an extension of the program GIZA (part of the SMT toolkit EGYPT) which was developed by the Statistical Machine Translation team during the summer workshop … Continue reading

Open Source Text Processing Project: KenLM

KenLM: Faster and Smaller Language Model Queries Project Website: http://kheafield.com/code/kenlm/ Github Link: https://github.com/kpu/kenlm Description KenLM Language Model Toolkit benchmark | dependencies | developers | estimation | filter | moses | structures Ken Models with Computer Engineer Barbie KenLM estimates, filters, … Continue reading

Open Source Text Processing Project: IRSTLM

IRSTLM: The IRST Language Modeling Toolkit Project Website: http://hlt-mt.fbk.eu/technologies/irstlm Github Link: https://github.com/irstlm-team/irstlm Description The IRST Language Modeling (IRSTLM) Toolkit features algorithms and data structures suitable to estimate, store, and access very large n-gram language models. Our software has been integrated … Continue reading

Open Source Text Processing Project: SRILM

SRILM – The SRI Language Modeling Toolkit Project Website: http://www.speech.sri.com/projects/srilm/ Github Link: None Description SRILM – The SRI Language Modeling Toolkit SRILM is a toolkit for building and applying statistical language models (LMs), primarily for use in speech recognition, statistical … Continue reading

Open Source Text Processing Project: Thot

Thot: a Toolkit for Statistical Machine Translation Project Website: http://daormar.github.io/thot/ Github Link: https://github.com/daormar/thot Description Thot is an open source software toolkit for statistical machine translation (SMT). Originally, Thot incorporated tools to train phrase-based models. The new version of Thot now … Continue reading