Open Source Text Processing Project: Lingua::Stem

Lingua::Stem – Stemming of words Project Website: http://search.cpan.org/~snowhare/Lingua-Stem-0.84/ Github Link: None Description This routine applies stemming algorithms to its parameters, returning the stemmed words as appropriate to the selected locale. You can import some or all of the class methods. … Continue reading

Open Source Text Processing Project: PTStemmer

PTStemmer – A Stemming toolkit for the Portuguese language Project Website: https://code.google.com/archive/p/ptstemmer/ Github Link: None Description FEATURES Java, Python, and .NET C# implementations of Orengo, Porter, and Savoy stemmers Fast: can stem more than 1.5M words/second on a normal desktop … Continue reading

Open Source Text Processing Project: OleanderStemmingLibrary

Oleander C++ stemming library Project Website: http://www.oleandersolutions.com/stemming/stemming.html Github Link: https://github.com/OleanderSoftware/OleanderStemmingLibrary Description Stemming is a normalization process used to reduce words down to their root. Stemming removes inflectional suffixes so that morphological variants of the same word can be compared more … Continue reading