-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Checking and save mode #10
Comments
I may consider adding an option not to capture URLs that have already been captured before, but I probably wouldn't include any behavior that's dependent on Lynx since it's not available by default in the macOS command line. In the meantime you could filter the list for URLs before running spn.sh, which might be faster anyway since you could usually get results for a whole domain with the CDX API. |
Can't you get this behavior by using |
Yes, in theory you should be able to just do that. Sometimes when I use it, though, it causes the script to incorrectly mark the capture as failed because it gets the info in the initial JSON response when submitting the URL, which the script isn't expecting. I should probably fix this. |
Hi, overcast07
I borrowed a part of your code and combined it with lynx to make a script. The function of this script is to collect all the URLs in the website in a loop and query Wayback Machine's API. Only when Wayback Machine does not backup this link, the script will submit a request to Wayback Machine for backup. This makes my backup much faster, and maybe this particular mode can be added to your scripts ~
The text was updated successfully, but these errors were encountered: