Open Source Text Processing Project: PTStemmer

PTStemmer – A Stemming toolkit for the Portuguese language Project Website: https://code.google.com/archive/p/ptstemmer/ Github Link: None Description FEATURES Java, Python, and .NET C# implementations of Orengo, Porter, and Savoy stemmers Fast: can stem more than 1.5M words/second on a normal desktop … Continue reading

Open Source Text Processing Project: OleanderStemmingLibrary

Oleander C++ stemming library Project Website: http://www.oleandersolutions.com/stemming/stemming.html Github Link: https://github.com/OleanderSoftware/OleanderStemmingLibrary Description Stemming is a normalization process used to reduce words down to their root. Stemming removes inflectional suffixes so that morphological variants of the same word can be compared more … Continue reading

Open Source Text Processing Project: The Porter Stemming Algorithm

The Porter Stemming Algorithm Project Website: http://tartarus.org/martin/PorterStemmer/ Github Link: None Description This is the ‘official’ home page for distribution of the Porter Stemming Algorithm, written and maintained by its author, Martin Porter. The Porter stemming algorithm (or ‘Porter stemmer’) is … Continue reading