Open Source Text Processing Project: dchest-stemmer

Stemmer package for Go

Project Website: None

Github Link:

Description

Stemmer package provides an interface for stemmers and includes English and German stemmers as sub-packages:

porter2 sub-package implements English (Porter2) stemmer as described in http://snowball.tartarus.org/algorithms/english/stemmer.html

german sub-package implements German stemmer as described in http://snowball.tartarus.org/algorithms/german/stemmer.html

Installation

English stemmer:

go get github.com/dchest/stemmer/porter2
German stemmer:

go get github.com/dchest/stemmer/german
This will also install the top-level stemmer package.

Example

import (
“github.com/dchest/stemmer/porter2”
“github.com/dchest/stemmer/german”
)

// English.
eng := porter2.Stemmer
eng.Stem(“delicious”) // => delici
eng.Stem(“deliciously”) // => delici

// German.
ger := german.Stemmer
ger.Stem(“abhängen”) // => abhang
ger.Stem(“abhängiger”) // => abhang
Tests

Included test_output.txt and test_voc.txt are from the referenced original implementations, used only when running tests with go test.

License

2-clause BSD-like (see LICENSE and AUTHORS files).


Leave a Reply

Your email address will not be published. Required fields are marked *