Open Source Text Processing Project: pocketsphinx-python

pocketsphinx-python: Python interface to CMU SphinxBase and PocketSphinx libraries Project Website: https://pypi.python.org/pypi/pocketsphinx Github Link: https://github.com/bambocher/pocketsphinx-python Description Python interface to CMU SphinxBase and PocketSphinx libraries created with SWIG. Pocketsphinx packages include python support, however, it is based on Automake and not … Continue reading

Open Source Text Processing Project: gensim-simserver

gensim-simserver: Document similarity server, using gensim Project Website: http://radimrehurek.com/gensim/simserver.html Github Link: https://github.com/piskvorky/gensim-simserver Description Index plain text documents and query the index for semantically related documents. Simserver uses transactions internally to provide a robust and scalable similarity server. Conceptually, a service … Continue reading

Open Source Text Processing Project: nlgserv

nlgserv: JSON HTTP wrapper for SimpleNLG Project Website: https://pypi.python.org/pypi/nlgserv Github Link: https://github.com/mnestis/nlgserv Description nlgserv is a simple server that accepts JSON representations of sentences and generates English sentences from those. This was something I cobbled together to act as part … Continue reading

Open Source Text Processing Project: semanticizest

semanticizest: Standalone Semanticizer Project Website: https://semanticize.github.io/semanticizest/ Github Link: https://github.com/semanticize/semanticizest Description Semanticizest is a package for doing entity linking, also known as semantic linking or semanticizing: you feed it text, and it outputs links to pertinent Wikipedia concepts. You can use … Continue reading

Open Source Text Processing Project: causeofwhy

Cause of Why Project Website: None Github Link: https://github.com/bwbaugh/causeofwhy Description The goal of this project is to implement a Question Answering (QA) system that answers causal type questions. We use Wikipedia as a knowledge base, extracting answers to user questions … Continue reading

Open Source Text Processing Project: Quepy

Quepy: A python framework to transform natural language questions to queries in a database query language Project Website: None Github Link: https://github.com/machinalis/quepy Description Quepy is a python framework to transform natural language questions to queries in a database query language. … Continue reading

Open Source Text Processing Project: NLP-Caffe

NLP-Caffe: natural language processing with Caffe Project Website: None Github Link: https://github.com/Russell91/nlpcaffe Description NLP-Caffe is a pull request [1] on the Caffe framework developed by Yangqing Jia and Evan Shelhamer, among other members of the BVLC lab at Berkeley and … Continue reading

Open Source Text Processing Project: SpeechRecognition

SpeechRecognition:Library for performing speech recognition, with support for several engines and APIs, online and offline. Project Website: https://pypi.python.org/pypi/SpeechRecognition/ Github Link: https://github.com/Uberi/speech_recognition Description Library for performing speech recognition, with support for several engines and APIs, online and offline. Speech recognition engine/API … Continue reading

Open Source Text Processing Project: TextRank

Python implementation of TextRank algorithm Project Website: None Github Link: https://github.com/davidadamojr/TextRank Description This is a python implementation of TextRank for automatic keyword and sentence extraction (summarization) as done in https://web.eecs.umich.edu/~mihalcea/papers/mihalcea.emnlp04.pdf. However, this implementation uses Levenshtein Distance as the relation between … Continue reading

Open Source Text Processing Project: IEPY

Information Extraction in Python Project Website: None Github Link: https://github.com/machinalis/iepy Description IEPY is an open source tool for Information Extraction focused on Relation Extraction. To give an example of Relation Extraction, if we are trying to find a birth date … Continue reading