-
Notifications
You must be signed in to change notification settings - Fork 79
Open
Description
Have you considered speed improvements that might result from computing distances between two corpora (queries and document collection)? With cosine similarity, this is simply a dot product between two term-document matrices. With network flows, perhaps a large network, where the same words in different documents would be distinct nodes, could be constructed? Running a for cycle over nearest_neighbors is not very fast even with the heuristics you implemented.
Metadata
Metadata
Assignees
Labels
No labels