Open Source Text Processing Project: nlp-with-ruby

nlp-with-ruby: Awesome NLP with Ruby Project Website: None Github Link: https://github.com/arbox/nlp-with-ruby Description This curated list comprises awesome resources, libraries, information sources about computational processing of texts in human languages with Ruby. That field is often referred to as NLP, Computational … Continue reading

Open Source Text Processing Project: pocketsphinx-ruby

pocketsphinx-ruby: Ruby speech recognition with Pocketsphinx Project Website: None Github Link: https://github.com/watsonbox/pocketsphinx-ruby Description This gem provides Ruby FFI bindings for Pocketsphinx, a lightweight speech recognition engine, specifically tuned for handheld and mobile devices, though it works equally well on the … Continue reading

Open Source Text Processing Project: Akiva

Akiva is a simple natural-language-processing, question-answering, artificial intelligence Project Website: None Github Link: https://github.com/thebigdb/akiva Description Akiva is a simple natural-language-processing, question-answering, artificial intelligence. Its main role is to take a question, deconstruct it in order to query the collaborative database … Continue reading

Open Source Text Processing Project: stemmify

Ruby module that converts a word to its approximate root form with the Porter stemmer Project Website: None Github Link: https://github.com/raypereda/stemmify Description Ruby module that converts a word to its approximate root form with the Porter stemmer. For example, observing … Continue reading

Open Source Text Processing Project: fast-stemmer

Fast Porter stemmer based on a C version of the algorithm Project Website: None Github Link: https://github.com/romanbsd/fast-stemmer Description Fast-stemmer is simply a wrapping around multithreaded Porter stemming algorithm. This gem adds a String#stem method, and it conflicts with the stemmer … Continue reading

Open Source Text Processing Project: Ruby-Stemmer

Ruby-Stemmer exposes SnowBall API to Ruby Project Website: http://locknet.ro/archive/2009-10-29-ann-ruby-stemmer.html Github Link: https://github.com/aurelian/ruby-stemmer Description This package includes libstemmer_c library released under BSD licence and available for free here. Support for latin language is also included and it has been generated with … Continue reading

Open Source Text Processing Project: SWING

SWING: An Open-Source Text Summarizer from WING Project Website: http://wing.comp.nus.edu.sg/downloads/swing/ Github Link: https://github.com/WING-NUS/SWING Description The Summarizer from the Web IR / NLP Group (WING), hence SWING, is a modular, state-of-the-art automatic extractive text summarization system. It produces informative summaries from … Continue reading

Open Source Text Processing Project: WhatLanguage

WhatLanguage: A language detection library for Ruby that uses bloom filters for speed. Project Website: None Github Link: https://github.com/peterc/whatlanguage Description Text language detection. Quick, fast, memory efficient, and all in pure Ruby. Uses Bloom filters for aforementioned speed and memory … Continue reading

Text Processing Book: Text Processing with Ruby

Text Processing with Ruby: Extract Value from the Data That Surrounds You Description Text is everywhere. Web pages, databases, the contents of files–for almost any programming task you perform, you need to process text. Cut even the most complex text-based … Continue reading