Open Source Text Processing Project: gensim-simserver

gensim-simserver: Document similarity server, using gensim

Project Website:
Github Link:

Description

Index plain text documents and query the index for semantically related documents.

Simserver uses transactions internally to provide a robust and scalable similarity server.

Conceptually, a service that lets you :

train a semantic model from a corpus of plain texts (no manual annotation and mark-up needed)
index arbitrary documents using this semantic model
query the index for similar documents (the query can be either an id of a document already in the index, or an arbitrary text)


Leave a Reply

Your email address will not be published. Required fields are marked *