Open Source Text Processing Project: Moses

Moses, the machine translation system Project Website: http://www.statmt.org/moses/ Github Link: https://github.com/moses-smt/mosesdecoder Description Moses is a statistical machine translation system that allows you to automatically train translation models for any language pair. All you need is a collection of translated texts … Continue reading

Open Source Text Processing Project: Open Text Summarizer

Open Text Summarizer Project Website: http://libots.sourceforge.net/ Github Link: None Description Automatic text summarization is the technique, where a computer program summarizes a document. A text is put into the computer and a highlighted (summarized) text is returned. The Open Text … Continue reading

Open Source Text Processing Project: Maximum Entropy Modeling Toolkit

Maximum Entropy Modeling Toolkit for Python and C++ Project Website: http://homepages.inf.ed.ac.uk/lzhang10/maxent_toolkit.html Github Link: https://github.com/lzhang10/maxent Description The Maximum Entropy Toolkit provides a set of tools and library for constructing maximum entropy (maxent) model in either Python or C++. Maxent Entropy Model … Continue reading

Open Source Text Processing Project: CRF++

CRF++: Yet Another CRF toolkit Project Website: https://taku910.github.io/crfpp/ Github Link: None Description CRF++ is a simple, customizable, and open source implementation of Conditional Random Fields (CRFs) for segmenting/labeling sequential data. CRF++ is designed for generic purpose and will be applied … Continue reading

Open Source Text Processing Project: GibbsLDA++

GibbsLDA++: A C/C++ Implementation of Latent Dirichlet Allocation Project Website: http://gibbslda.sourceforge.net/ Github Link: None Description GibbsLDA++ is a C/C++ implementation of Latent Dirichlet Allocation (LDA) using Gibbs Sampling technique for parameter estimation and inference. It is very fast and is … Continue reading