You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Verified during a gleaner run that not all sitemap urls would transfer to S3 in summoned/ subdirectory. Upon inspection from the following error it appears that internal/summoner/sitemaps/sitemap_ng.go:DomainSitemap() for some reason appends bad characters (i.e. \u003nil\0003e) to URL causing gleaner to be unable to fetch with following error (example):
Verified during a gleaner run that not all sitemap urls would transfer to S3 in summoned/ subdirectory. Upon inspection from the following error it appears that internal/summoner/sitemaps/sitemap_ng.go:DomainSitemap() for some reason appends bad characters (i.e. \u003nil\0003e) to URL causing gleaner to be unable to fetch with following error (example):
logs/gleaner-2023-12-20-20-44-34.log:{"file":"/home/runner/work/gleaner/gleaner/internal/summoner/acquire/acquire.go:299","func":"github.com/gleanerio/gleaner/internal/summoner/acquire.getDomain.func2","level":"error","msg":"#112 bad url https://geoconnex.us/ca-gage-assessment/gages/LMN\u003cnil\u003e","time":"2023-12-20T20:44:48Z"}
The code attempts to fix this extraneous trailing whitespace by performing strings.Trimpspace but it doesn't remove these characters evidently.
The text was updated successfully, but these errors were encountered: