Open Source Text Processing Project: textacy

textacy: higher-level NLP built on spaCy Project Website: https://textacy.readthedocs.io Github Link: https://github.com/chartbeat-labs/textacy Description textacy is a Python library for performing higher-level natural language processing (NLP) tasks, built on the high-performance spaCy library. With the basics — tokenization, part-of-speech tagging, dependency … Continue reading

Open Source Text Processing Project: gensim-simserver

gensim-simserver: Document similarity server, using gensim Project Website: http://radimrehurek.com/gensim/simserver.html Github Link: https://github.com/piskvorky/gensim-simserver Description Index plain text documents and query the index for semantically related documents. Simserver uses transactions internally to provide a robust and scalable similarity server. Conceptually, a service … Continue reading

Open Source Text Processing Project: Gensim

Gensim: Topic Modelling for Humans Project Website: https://radimrehurek.com/gensim/ Github Link: https://github.com/piskvorky/gensim/ Description Gensim is a FREE Python library: Scalable statistical semantics Analyze plain-text documents for semantic structure Retrieve semantically similar documents Gensim started off as a collection of various Python … Continue reading

Open Source Text Processing Project: GibbsLDA++

GibbsLDA++: A C/C++ Implementation of Latent Dirichlet Allocation Project Website: http://gibbslda.sourceforge.net/ Github Link: None Description GibbsLDA++ is a C/C++ implementation of Latent Dirichlet Allocation (LDA) using Gibbs Sampling technique for parameter estimation and inference. It is very fast and is … Continue reading