gensim-simserver: Document similarity server, using gensim
Project Website:
Github Link:
Description
Index plain text documents and query the index for semantically related documents.
Simserver uses transactions internally to provide a robust and scalable similarity server.
Conceptually, a service that lets you :
train a semantic model from a corpus of plain texts (no manual annotation and mark-up needed)
index arbitrary documents using this semantic model
query the index for similar documents (the query can be either an id of a document already in the index, or an arbitrary text)