Open Source Text Processing Project: Stanford Log-linear Part-Of-Speech Tagger

Stanford Log-linear Part-Of-Speech Tagger Project Website: http://nlp.stanford.edu/software/tagger.shtml Github Link: None Description A Part-Of-Speech Tagger (POS Tagger) is a piece of software that reads text in some language and assigns parts of speech to each word (and other token), such as … Continue reading

Open Source Text Processing Project: Stanford CoreNLP

Stanford CoreNLP – a suite of core NLP tools Project Website: http://stanfordnlp.github.io/CoreNLP/ Github Link: https://github.com/stanfordnlp/CoreNLP Description Stanford CoreNLP provides a set of natural language analysis tools. It can give the base forms of words, their parts of speech, whether they … Continue reading

Open Source Text Processing Project: Pattern

Pattern Project Website: http://www.clips.ua.ac.be/pattern Github Link: https://github.com/clips/pattern Description Pattern is a web mining module for the Python programming language. It has tools for data mining (Google, Twitter and Wikipedia API, a web crawler, a HTML DOM parser), natural language processing … Continue reading

Open Source Text Processing Project: MBSP

MBSP for Python Project Website: http://www.clips.ua.ac.be/pages/MBSP Description MBSP is a text analysis system based on the TiMBL and MBT memory based learning applications developed at CLiPS and ILK. It provides tools for Tokenization and Sentence Splitting, Part of Speech Tagging, … Continue reading

Open Source Text Processing Project: TextBlob

TextBlob: Simplified Text Processing Project Website: http://textblob.readthedocs.org/en/dev/ Github Link: https://github.com/sloria/textblob Description TextBlob is a Python (2 and 3) library for processing textual data. It provides a simple API for diving into common natural language processing (NLP) tasks such as part-of-speech … Continue reading

Open Source Text Processing Project: spaCy

spaCy: BUILD TOMORROW’S LANGUAGE TECHNOLOGIES Project Website: https://spacy.io/ Github Link: https://github.com/honnibal/spaCy Description spaCy is a library for industrial-strength natural language processing in Python and Cython. It features state-of-the-art speed and accuracy, a concise API, and great documentation. If you’re a … Continue reading

Open Source Text Processing Project: NLTK

NLTK: Natural Language Toolkit Project Website: http://www.nltk.org/ Github Link: https://github.com/nltk https://github.com/nltk/nltk Description NLTK is a leading platform for building Python programs to work with human language data. It provides easy-to-use interfaces to over 50 corpora and lexical resources such as … Continue reading