Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Beautiful Soup Warning Making it to stdout #618

Closed
liquidsec opened this issue Jul 20, 2023 · 3 comments
Closed

Beautiful Soup Warning Making it to stdout #618

liquidsec opened this issue Jul 20, 2023 · 3 comments
Assignees
Labels
bug Something isn't working

Comments

@liquidsec
Copy link
Collaborator

liquidsec commented Jul 20, 2023

The following message is occasionally making it into the scan stdout:

/home/****/.cache/pypoetry/virtualenvs/bbot-IFSyk-JB-py3.10/lib/python3.10/site-packages/bs4/builder/__init__.py:545: XMLParsedAsHTMLWarning: It looks like you're parsing an XML document using an HTML parser. If this really is an HTML document (maybe it's XHTML?), you can ignore or filter this warning. If it's XML, you should know that using an XML parser will be more reliable. To parse this document as XML, make sure you have the lxml package installed, and pass the keyword argument `features="xml"` into the BeautifulSoup constructor.
  warnings.warn(
@liquidsec liquidsec added the bug Something isn't working label Jul 20, 2023
@liquidsec
Copy link
Collaborator Author

Also spotted, Possibly related:

/opt/bbot/bbot/core/helpers/web.py:348: MarkupResemblesLocatorWarning: The input looks more like a filename than markup. You may want to open this file and pass the filehandle into Beautiful Soup.
  soup = BeautifulSoup(html, "html.parser")

@TheTechromancer
Copy link
Collaborator

This is also an open issue in Wappalyzer:

chorsley/python-Wappalyzer#85

@TheTechromancer
Copy link
Collaborator

Fixed in a90dca0

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants