You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When dealing with the scenario of personalized recommendation, the user profile vector set usually are very large, for example ~10m vectors or even above. We take 10m vectors as a baseline, because it still possible to store all the 10m data into one physical machine.
When dealing with the scenario of personalized recommendation, the user profile vector set usually are very large, for example ~10m vectors or even above. We take 10m vectors as a baseline, because it still possible to store all the 10m data into one physical machine.
10m vectors * 2048 dimensions * 4 byte float = 80 G memory
Current solution does not fit into the level, because write latency would be ~30s which is not acceptable.
One idea is that: we do not recommend for a single users, but for a cluster of similar users.
Two choices: online KMeans or SimHash?
The text was updated successfully, but these errors were encountered: