Open Source Text Processing Project: libstemmer

libstemmer: The C version of the libstemmer library Project Website: http://snowball.tartarus.org/dist/libstemmer_c.tgz Github Link: https://github.com/zvelo/libstemmer Description This contains all you need to include the snowball stemming algorithms into a C project of your own. If you download this, you don’t need … Continue reading

Open Source Text Processing Project: PyStemmer

Python stemming library using snowball stemmers Project Website: https://pypi.python.org/pypi/PyStemmer Github Link: https://github.com/snowballstem/pystemmer Description PyStemmer is a Python interface to the stemming algorithms from the Snowball project (http://snowball.tartarus.org/). A stemming algorithm (or stemmer) is a process for removing the commoner morphological … Continue reading

Open Source Text Processing Project: Snowball

Snowball Project Website: http://snowballstem.org/ Github Link: https://github.com/snowballstem/snowball Description Snowball is a small string processing language designed for creating stemming algorithms for use in Information Retrieval. This site describes Snowball, and presents several useful stemmers which have been implemented using it. … Continue reading

Open Source Text Processing Project: The Porter Stemming Algorithm

The Porter Stemming Algorithm Project Website: http://tartarus.org/martin/PorterStemmer/ Github Link: None Description This is the ‘official’ home page for distribution of the Porter Stemming Algorithm, written and maintained by its author, Martin Porter. The Porter stemming algorithm (or ‘Porter stemmer’) is … Continue reading