Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature request: generalize gscholar.rb #2

Open
yannickwurm opened this issue Jul 23, 2013 · 4 comments
Open

Feature request: generalize gscholar.rb #2

yannickwurm opened this issue Jul 23, 2013 · 4 comments

Comments

@yannickwurm
Copy link

Hello,

your setup for finding & attaching PDFs to Bibdesk entries seems very useful.
However it also seems a bit laborious to do all the keyboard short-cuts for each entry.
I wonder if you could automate it and remove the applescript dependencies. As per the following pseudocode:

for each publication in mystuff.bib
     find relevant PDFs using google scholar
     if one PDF is clearly much more relevant than the others (e.g. exact title match)
           download the PDF
           put it in the appropriate place
           add it to the .bib file
     if several PDFs seem relevant 
           don't download or edit the .bib file
           instead just output the possibilities in stdout
     end
end 
@houshuang
Copy link
Owner

I had thought about something similar, for example making it possible to
automatically search Google scholar for the title of the currently selected
publication, find the PDF, download and attach etc. The problem is that
GScholar does not provide any official API, and in fact quite agressively
resists being scraped (even when it's just downloading the first page)...
I've argued several times that we need to build up an alternative to Google
Scholar (as Open Streetmap is to Google Maps), with open APIs, etc -
exactly for this kind of purpose.

Stian

On Tue, Jul 23, 2013 at 8:29 AM, Yannick Wurm [email protected]:

Hello,

your setup for finding & attaching PDFs to Bibdesk entries seems very
useful.
However it also seems a bit laborious to do all the short-cuts for each
entry.
I wonder if you could automate it and remove the applescript dependencies.
As per the following pseudocode:

for each publication in mystuff.bib
find relevant PDFs using google scholar
if one PDF is clearly much more relevant than the others (e.g. exact title match)
download the PDF
put it in the appropriate place
add it to the .bib file
if several PDFs seem relevant
don't download or edit the .bib file
instead just output the possibilities in stdout
end
end


Reply to this email directly or view it on GitHubhttps://github.com//issues/2
.

http://reganmian.net/blog -- Random Stuff that Matters

@yannickwurm
Copy link
Author

Hi Stian,

what do you mean agressively? What if you put a "sleep 10seconds" between two requests?

Cheers,
Yannick

@houshuang
Copy link
Owner

Literally stopping me from grabbing even one page without the proper
cookies, referer, browserid, etc. Feel free to experiment though, all my
code is totally open ;)

On Fri, Jul 26, 2013 at 12:55 PM, Yannick Wurm [email protected]:

Hi Stian,

what do you mean agressively? What if you put a "sleep 10seconds" between
two requests?

Cheers,
Yannick


Reply to this email directly or view it on GitHubhttps://github.com//issues/2#issuecomment-21633610
.

http://reganmian.net/blog -- Random Stuff that Matters

@yannickwurm
Copy link
Author

k thanks for the info
unfortunatly my time is too limited these days!!

cheers
yannick

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants