Open Source Text Processing Project: Open Text Summarizer

Open Text Summarizer

Project Website:

Github Link: None

Description

Automatic text summarization is the technique, where a computer program summarizes a document. A text is put into the computer and a highlighted (summarized) text is returned. The Open Text Summarizer is an open source tool for summarizing texts. The program reads a text and decides which sentences are important and which are not. It ships with Ubuntu, Fedora and other linux distros. OTS supports many (25+) languages which are configured in XML files.
Several academic publications have benchmarked it and praised it.

OTS is both a library and a command line tool. Word processors such as AbiWord and KWord can link to the library and summarize documents while the command line tool lets you summarize text on the console. The program can either print the summarized text as text or HTML. If HTML, the important sentences are highlighted. The program is multi lingual and works with UTF-8 encoding.

The Open Text Summarizer summarizes texts in English, German, Spanish, Russian, Hebrew, Esperanto and other languages. To Support more languages or tweak existing languages can be done by simply editing an XML file of rules.


Leave a Reply

Your email address will not be published. Required fields are marked *