Open Source Text Processing Project: YodaQA

YodaQA: A Question Answering system built on top of the Apache UIMA framework Project Website: http://ailao.eu/yodaqa/ Github Link: https://github.com/brmson/yodaqa Description YodaQA is an open source Factoid Question Answering system that can produce answer both from databases and text corpora using … Continue reading

Open Source Text Processing Project: InformationExtractionSystem

Information Extraction System Project Website: None Github Link: https://github.com/sanjaymeena/InformationExtractionSystem Description Information Extraction System can perform NLP tasks like Named Entity Recognition, Sentence Simplification, Relation Extraction etc. This document is an overview of various modules. For more information, please refer to … Continue reading

Open Source Text Processing Project: TextRank

Python implementation of TextRank algorithm Project Website: None Github Link: https://github.com/davidadamojr/TextRank Description This is a python implementation of TextRank for automatic keyword and sentence extraction (summarization) as done in https://web.eecs.umich.edu/~mihalcea/papers/mihalcea.emnlp04.pdf. However, this implementation uses Levenshtein Distance as the relation between … Continue reading

Open Source Text Processing Project: Serelex

Serelex – a tool for semantic relation extraction Project Website: None Github Link: https://github.com/jgc128/DefVectors Description A tool for semantic relation extraction. The program finds pairs of semantically related words based on the text definitions coming from the Wikipedia articles (other … Continue reading

Open Source Text Processing Project: RelationFactory

End-to-end relation extraction and knowledge base population pipeline Project Website: None Github Link: https://github.com/beroth/relationfactory Description RelationFactory is a relation extraction and knowledge-base population system. It was the top-ranked system in TAC KBP 2013 English Slot-filling (http://www.nist.gov/tac/2013/KBP/index.html). If you want to … Continue reading

Open Source Text Processing Project: IEPY

Information Extraction in Python Project Website: None Github Link: https://github.com/machinalis/iepy Description IEPY is an open source tool for Information Extraction focused on Relation Extraction. To give an example of Relation Extraction, if we are trying to find a birth date … Continue reading

Open Source Text Processing Project: dchest-stemmer

Stemmer package for Go Project Website: None Github Link: https://github.com/dchest/stemmer Description Stemmer package provides an interface for stemmers and includes English and German stemmers as sub-packages: porter2 sub-package implements English (Porter2) stemmer as described in http://snowball.tartarus.org/algorithms/english/stemmer.html german sub-package implements German … Continue reading

Open Source Text Processing Project: golibstemmer

Go (golang) bindings for libstemmer Project Website: None Github Link: https://github.com/rjohnsondev/golibstemmer Description Go bindings for the snowball libstemmer library including porter 2 This simple library provides Go (golang) bindings for the snowball libstemmer library including the popular porter and porter2 … Continue reading

Open Source Text Processing Project: go-porterstemmer

Go Porter Stemmer Project Website: None Github Link: https://github.com/reiver/go-porterstemmer Description A native Go clean room implementation of the Porter Stemming Algorithm. This algorithm is of interest to people doing Machine Learning or Natural Language Processing (NLP). This is NOT a … Continue reading

Open Source Text Processing Project: kljensen-snowball

Go implementation of the Snowball stemmers Project Website: None Github Link: https://github.com/kljensen/snowball Description A Go (golang) implementation of the Snowball stemmer for natural language processing. Status Latest release v0.3.4 (2013-05-19) Latest build status Build Status Go versions tested go1.0.3 Languages … Continue reading