relevanced is a multithreaded C++ server providing efficient document similarity scoring and classification as a networked service.
Using the established vector space model, it helps you answer questions like:
- How similar are these two documents?
- How similar is this new document to a collection of other documents?
- Which of several groups of documents is most like this new document?