You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Unfortunately this is not possible with wget itself, so the only point of such an option would be for the script to delete the directories afterwards in order to make the ZIM file smaller.
I am not sure if that would be so helpful. What do you think?
It is somewhat far-fetched, but you could set up Privoxy with "https-inspection" enabled and put "--no-check-certificate -e use_proxy=yes -e http_proxy=127.0.0.1:8118" into the wget command. This way the proxy would be able to read your HTTP requests and you could set it up to block the URL paths you want.
I suppose this is a much more desirable result than deleting the folders afterwards, if you really need it.
For example in my run of http://www.someweb.com I would like to exclude all of http://www.someweb.com/boringnotes/ from being crawled/written since there is nothing of interest to me there.
The text was updated successfully, but these errors were encountered: