Wappalyzer boefje detects less software instances than before #3037
Labels
boefjes
Issues related to boefjes
regression
Something is broken that explicitly worked before
😸 Review/QA feedback
Review/QA feedback provided
Describe the bug
With the replacement to the Wappalyzer boefje (#2727) some of the findings from the original Wappalyzer boefje disappeared. This is due to changes in the 'technologies.json' files. A result from this is less Software oois identified and thus less findings relating to outdated software components.
In the old version various jQuery objects were identified. With the new version these objects are not identified.
Expected behavior
At minimum, the wappalyzer boefje should be able to identify all common software instances identified on websites, including the software versions.
OpenKAT version
commit 233dc34 (HEAD -> main, origin/main, origin/HEAD)
Current solution idea
As discussed with @underdarknl - the current wappalyzer boefje is installed using the requirements.txt. The idea is to fork the wappalyzer branch and add more/newer technologies.json files from other projects. This should hopefully solve the reduced number of findings observed by wappalyzer.The python implementation of our Wappalyzer uses a different format in the technologies file than the 'original' Wappalyzer version. There is a fork of the original Wappalyzer that seems to be active by HTTArchive. This would require adjusting the parsing of the Wappalyzer boefje to match these formats. See: https://github.com/HTTPArchive/wappalyzer
The text was updated successfully, but these errors were encountered: