-
Notifications
You must be signed in to change notification settings - Fork 13
Consider revisiting pkg download algorithms #358
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Thanks, that's an interesting idea. We are indeed using the 'same' method as Arun in his For reference, the code is here: We'll look into Matt's idea |
Thanks for looking into it Ludovic. I did not know RDocumentation-app was here with an Issues tracker so this is great. I had emailed Datacamp last year and had been following up over email. Much better here on GitHub instead.
I believe the proposed adjustment is almost trivial. It's just a Here's that plot where |
@ludov04 isn't this long addressed by now? |
@filipsch If it's resolved, that's great. I just assumed |
The algorithm has indeed been revisited somewhere in august to exclude downloads occuring from the same ip on the same day. However, like @mattdowle mentioned, it's unlikely that viridisLite, pillar and R6 are the top downloaded packages, I guess something else is biasing that data. |
A series of emails from Matt Dowle. He thinks that the current algorithm overweights automatic downloads, and that counting multiple downloads from a single IP address in one day as a single download would give a more accurate representation of how many people are using which packages.
The text was updated successfully, but these errors were encountered: