Open Source Text Processing Project: Sphinxtrain

Sphinxtrain Project Website: None Github Link: https://github.com/cmusphinx/sphinxtrain Description This is SphinxTrain, Carnegie Mellon University’s open source acoustic model trainer. This directory contains the scripts and instructions necessary for building models for the CMU Sphinx Recognizer. This distribution is free software, … Continue reading

Open Source Text Processing Project: gensim-simserver

gensim-simserver: Document similarity server, using gensim Project Website: http://radimrehurek.com/gensim/simserver.html Github Link: https://github.com/piskvorky/gensim-simserver Description Index plain text documents and query the index for semantically related documents. Simserver uses transactions internally to provide a robust and scalable similarity server. Conceptually, a service … Continue reading

Open Source Text Processing Project: sphinxbase

sphinxbase: CMU Sphinx common libraries Project Website: None Github Link: https://github.com/cmusphinx/sphinxbase Description This package contains the basic libraries shared by the CMU Sphinx trainer and all the Sphinx decoders (Sphinx-II, Sphinx-III, and PocketSphinx), as well as some common utilities for … Continue reading

Open Source Text Processing Project: CMU Sphinx

CMU Sphinx: Project by Carnegie Mellon University Project Website: http://cmusphinx.sourceforge.net/ Github Link: https://github.com/cmusphinx Description CMUSphinx collects over 20 years of the CMU research. All advantages are hard to list, but just to name a few: State of art speech recognition … Continue reading

Open Source Text Processing Project: nlgserv

nlgserv: JSON HTTP wrapper for SimpleNLG Project Website: https://pypi.python.org/pypi/nlgserv Github Link: https://github.com/mnestis/nlgserv Description nlgserv is a simple server that accepts JSON representations of sentences and generates English sentences from those. This was something I cobbled together to act as part … Continue reading

Open Source Text Processing Project: SimpleNLG

SimpleNLG: Java API for Natural Language Generation Project Website: None Github Link: https://github.com/simplenlg/simplenlg Description SimpleNLG is a simple Java API designed to facilitate the generation of Natural Language. It was originally developed at the University of Aberdeen’s Department of Computing … Continue reading

Open Source Text Processing Project: semanticizest

semanticizest: Standalone Semanticizer Project Website: https://semanticize.github.io/semanticizest/ Github Link: https://github.com/semanticize/semanticizest Description Semanticizest is a package for doing entity linking, also known as semantic linking or semanticizing: you feed it text, and it outputs links to pertinent Wikipedia concepts. You can use … Continue reading

Open Source Text Processing Project: causeofwhy

Cause of Why Project Website: None Github Link: https://github.com/bwbaugh/causeofwhy Description The goal of this project is to implement a Question Answering (QA) system that answers causal type questions. We use Wikipedia as a knowledge base, extracting answers to user questions … Continue reading

Open Source Text Processing Project: Quepy

Quepy: A python framework to transform natural language questions to queries in a database query language Project Website: None Github Link: https://github.com/machinalis/quepy Description Quepy is a python framework to transform natural language questions to queries in a database query language. … Continue reading

Open Source Text Processing Project: NLP-Caffe

NLP-Caffe: natural language processing with Caffe Project Website: None Github Link: https://github.com/Russell91/nlpcaffe Description NLP-Caffe is a pull request [1] on the Caffe framework developed by Yangqing Jia and Evan Shelhamer, among other members of the BVLC lab at Berkeley and … Continue reading