Open Source Text Processing Project: stemmify

Ruby module that converts a word to its approximate root form with the Porter stemmer

Project Website: None

Github Link:

Description

Ruby module that converts a word to its approximate root form with the Porter stemmer. For example, observing and observation reduce to observ.

This is a Rails gem for reducing words to their roots. For example, all the following words to are stemmed to “observ”, which is not a real word in this case: observance observances observancy observant observants observation observe observed observer observers observing observingly

The algorithm used here is based on the Porter stemmer. You can read more about Martin Porter’s stemmer at

tartarus.org/~martin/PorterStemmer/

Martin Porter explains:

The Porter stemming algorithm (or ‘Porter stemmer’) is a process for removing the commoner morphological and inflexional endings from words in English. Its main use is as part of a term normalisation process that is usually done when setting up Information Retrieval systems.


Leave a Reply

Your email address will not be published. Required fields are marked *