How to Custom Sentence Segmentation or Sentence Boundary Detection

A lot of NLP tools have sentence segmentation function, such as NLTK Sentence Segmentation, TextBlob Sentence Segmentation, Pattern Sentence Segmentation, spaCy Sentence Segmentation, but sometimes we need to custom the sentence segmentation or sentence boundary detection tool, how to do … Continue reading

A Beginner’s Guide to spaCy

About spaCy Open Source Text Processing Project: spaCy Install spaCy and related data model Install spaCy by pip: sudo pip install -U spacy Collecting spacy Downloading spacy-1.8.2.tar.gz (3.3MB) Downloading numpy-1.13.0-cp27-cp27mu-manylinux1_x86_64.whl (16.6MB) Collecting murmurhash=0.26 (from spacy) Downloading murmurhash-0.26.4-cp27-cp27mu-manylinux1_x86_64.whl Collecting cymem=1.30 (from … Continue reading

Open Source Text Processing Project: textacy

textacy: higher-level NLP built on spaCy Project Website: https://textacy.readthedocs.io Github Link: https://github.com/chartbeat-labs/textacy Description textacy is a Python library for performing higher-level natural language processing (NLP) tasks, built on the high-performance spaCy library. With the basics — tokenization, part-of-speech tagging, dependency … Continue reading

Open Source Text Processing Project: spaCy

spaCy: BUILD TOMORROW’S LANGUAGE TECHNOLOGIES Project Website: https://spacy.io/ Github Link: https://github.com/honnibal/spaCy Description spaCy is a library for industrial-strength natural language processing in Python and Cython. It features state-of-the-art speed and accuracy, a concise API, and great documentation. If you’re a … Continue reading