Getting started with NLTK

About NLTK Open Source Text Processing Project: NLTK Install NLTK 1. Install the latest NLTK pakage on Ubuntu 16.04.1 LTS: textprocessing@ubuntu:~$ sudo pip install -U nltk Collecting nltk Downloading nltk-3.2.2.tar.gz (1.2MB) 35% |███████████▍ | 409kB 20.8MB/s eta 0:00:0 …… 100% … Continue reading

Open Source Text Processing Project: Wapiti

Wapiti – A simple and fast discriminative sequence labelling toolkit Project Website: https://wapiti.limsi.fr/ Github Link: https://github.com/Jekub/Wapiti Description Wapiti is a very fast toolkit for segmenting and labeling sequences with discriminative models. It is based on maxent models, maximum entropy Markov … Continue reading

Open Source Text Processing Project: segtok

segtok: sentence segmentation and word tokenization tools Project Website: http://fnl.es/segtok-a-segmentation-and-tokenization-library.html Github Link: https://github.com/fnl/segtok Description A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic features. The segtok package provides two modules, segtok.segmenter and segtok.tokenizer. The segmenter provides functionality for … Continue reading

Open Source Text Processing Project: nlp-with-ruby

nlp-with-ruby: Awesome NLP with Ruby Project Website: None Github Link: https://github.com/arbox/nlp-with-ruby Description This curated list comprises awesome resources, libraries, information sources about computational processing of texts in human languages with Ruby. That field is often referred to as NLP, Computational … Continue reading

Open Source Text Processing Project: textacy

textacy: higher-level NLP built on spaCy Project Website: https://textacy.readthedocs.io Github Link: https://github.com/chartbeat-labs/textacy Description textacy is a Python library for performing higher-level natural language processing (NLP) tasks, built on the high-performance spaCy library. With the basics — tokenization, part-of-speech tagging, dependency … Continue reading

Open Source Text Processing Project: vivekn sentiment

Sentiment analysis using machine learning techniques Project Website: http://sentiment.vivekn.com/ Github Link: https://github.com/vivekn/sentiment Description Sentiment analysis using machine learning techniques. Check info.py for the training and testing code. A demo of the tool is available here Refer this paper for more … Continue reading

Open Source Deep Learning Project: Paddle

Paddle: PArallel Distributed Deep LEarning Project Website: http://www.paddlepaddle.org/ Github Link: https://github.com/baidu/Paddle Description PaddlePaddle (PArallel Distributed Deep LEarning) is an easy-to-use, efficient, flexible and scalable deep learning platform, which is originally developed by Baidu scientists and engineers for the purpose of … Continue reading

Open Source Text Processing Project: Stanford Temporal Tagger

Stanford Temporal Tagger Project Website: http://nlp.stanford.edu/software/sutime.html Github Link: None Description SUTime is a library for recognizing and normalizing time expressions. That is, it will convert next wednesday at 3pm to something like 2016-02-17T15:00 (depending on the assumed current reference time). … Continue reading

Open Source Deep Learning Project: dlib

dlib: A toolkit for making real world machine learning and data analysis aplications in C++ Project Website: http://dlib.net Github Link: https://github.com/davisking/dlib Description Dlib is a modern C++ toolkit containing machine learning algorithms and tools for creating complex software in C++ … Continue reading

Open Source Deep Learning Project: torchnet

torchnet: Torch on steroids Project Website: None Github Link: https://github.com/torchnet/torchnet Description torchnet is a framework for torch which provides a set of abstractions aiming at encouraging code re-use as well as encouraging modular programming. At the moment, torchnet provides four … Continue reading