Hawkeye is a project security, vulnerability and general risk highlighting tool. It has a few goals:
- Designed to be entirely extensible by just adding new modules with the correct signature to lib/modules
- Modules return results via a common interface, which permits consolidated reporting and artefact generation
- Should be very easy to run regardless of the type of project that you're scanning
- As of version
1.0.0
many of the modules have had their identifiers changed and prefixed for langues added to them, for examplensp
is nownode-nsp
. This means you will need to udpate your.hawkeyerc
files, and any commands where you explicitly specify modules eghawkeye scan -m thing
.
Modules are basically little bits of code that either implement their own logic, or wrap a third party tool and standardise the output. They only run if the required criteria are met, for example; the nsp
module would only run if a package.json
is detected in the scan target - as a result, you don't need to tell Hawkeye what type of project you are scanning. The modules implemented so far are:
- File Names (files): Scan the file list recursively, looking for patterns as defined in data.js, taken from gitrob. We're looking for things like
id_rsa
, things that end inpem
, etc. - File Content Patterns (contents): Looks for patterns as defined in data.js within the contents of files, things like 'password: ', and 'BEGIN RSA PRIVATE KEY' will pop up here.
- File Content Entropy (entropy): Scan files for strings with high (Shannon) entropy, which could indicate passwords or secrets stored in the files, for example: 'kwaKM@£rFKAM3(a2klma2d'
- Credit Card Numbers (ccnumber): Scan for credit card numbers in files, validated using luhn.
- Node Security Project (node-nsp): Wraps Node Security Project to check your package.json for known vulnerabilities.
- NPM Check Updates (node-ncu): Wraps NPM Check Updates to check your package.json for outdated modules.
- CrossEnv (node-crossenv): See Node Cross-Env Malware. Checks your package.json for known malicious modules which contain this malware.
- Constant Hash Tables (node-chs): See Node Constant Hashtable. Checks if your package.json can be run against vulnerable versions of node.
- Bundler Audit (ruby-bundler-scan): Wraps Bundler Audit to check your Gemfile/Gemfile.lock for known vulnerabilities.
- Brakeman (ruby-brakeman): Wraps Brakeman for static analysis security vulnerability scanner for Ruby on Rails applications.
- Safety (python-safety): Wraps Safety to check your requirements.txt for known vulnerabilities. Unfortunately, safety does not provides a risk level classification of the vulneravilities, so every vulnerability is logged as high.
- Piprot (python-piprot): Wraps Piprot to check your requirements.txt for outdated dependencies.
- Bandit (python-bandit): Wraps Bandit to find common security issues in python code.
- FindSecBugs (java-find-secbugs): Wraps FindSecBugs to find common security issues in Java Projects. It analyzes the jar generated after performing
mvn package
orgradle stage
. - OWASP Dependency Check (java-owasp): Wraps OWASP Dep Check to find common security issues in Java Project Dependencies as noted by the National Vulnerability Database. It analyzes the jar generated after performing
mvn package
orgradle stage
.
I really, really do welcome people writing new modules so please check out lib/modules/example-shell/index.js as an example of how simple it is, and send me a pull request.
- Entropy is disabled by default because it can return a lot of results, which are mostly misses, to run it please use the
-m entropy
switch. Personally I use this manually checking over code bases I have inherited. - We only look inside the contents of files up to 1Mb, I plan to add configuration options in the future to allow you to change this.
I wanted Hawkeye to be as flexible as possible, as a result it supports numerous methods of execution.
As Hawkeye wraps a lot of other dependencies, the easiest way to run it is using docker. Simply type docker run --rm -v $PWD:/target stono/hawkeye
.
Lets say you have project which has a Dockerfile
, with lines like this in:
COPY . /app
VOLUME /app
You could add hawkeye to your compose file like this:
services:
app:
build: .
hawkeye:
image: stono/hawkeye
command: scan -t /app
volumes_from:
- app
You can simply do docker-compose run --rm --no-deps hawkeye
. Woo hoo.
If you want, you can do npm install -g hawkeye-scanner
, and type hawkeye scan
instead. However, this will require you to have all of the other dependencies installed on your host for the modules that you're wanting to scan with.
This is an example of running Hawkeye against one of your projects in GoCD:
<pipeline name="security-scan">
<stage name="Hawkeye" cleanWorkingDir="true">
<jobs>
<job name="scan">
<tasks>
<exec command="docker">
<arg>pull</arg>
<arg>stono/hawkeye</arg>
<runif status="passed" />
</exec>
<exec command="bash">
<arg>-c</arg>
<arg>docker run --rm -v $PWD:/target stono/hawkeye</arg>
<runif status="passed" />
</exec>
</tasks>
</job>
</jobs>
</stage>
</pipeline>
This is an example of running Hawkeye from an npm package.json against a local repository before commit, failing the commit if high or critical issues are found:
{
"name": "demoproj",
"version": "1.0.0",
"description": "demo",
"main": "app.js",
"dependencies": {
"express": "4.16.2"
},
"devDependencies": {
"pre-commit": "^1.2.2"
},
"scripts": {
"hawkeye:pre-commit": "hawkeye scan -t ./src -m contents -m files -f high"
},
"pre-commit": [
"hawkeye:pre-commit"
],
"author": "",
"license": "ISC"
}
As of version 0.9.0
, you can use the familiar .hawkeyerc
and .hawkeyeignore
pattern in your project root.
This file takes all the same options as hawkeye scan --help
. In this example, we'll run the contents
, entropy
, files
, ncu
and nsp
{
"modules": ["contents", "entropy", "files", "node-ncu", "node-nsp"],
"failOn": "medium"
}
This file should be a collection of patterns to exclude from the scan, and is equivalent to running --exclude
.
^test/
README.md
There are a few options available:
Hawkeye by default will attempt to detect a .git folder in your target, if it is there it will only scan git tracked files. Further to that, if a .git-crypt folder is detected, we will also exclude files which are GPG encrypted. If there is no .git in the target directory, then all files will be scanned.
You can override this behaviour with the --all
flag, which will scan all files regardless.
From a pipeline perspective, the --fail-on
command is useful, you might not wish for low
items to break your build, so you could use --fail-on medium
.
By default Hawkeye will look in your current working directory. You can override this behaviour though by specifying a --target
If you want to run specific modules only, you can use the --module
flag, which can be specified multiple times. For example hawkeye scan -m nsp -m ncu
would run just the nsp and ncu modules.
The --json
paramter allows you to write a much more detailed report to a file. See the Json section below for more information
-s, --sumologic http://sumologic-http-collector: Send the results to SumoLogic
This will post the results to a SumoLogic HTTP collector. See the SumoLogic section below for more information.
This parameter (which can be specified multiple times) allows you to specify patterns you wish to be excluded from the scan. For example hawkeye scan -e "^test/"
would exclude all your test files. All paths are relative to the --target
.
There are some global exclusions in place, and those are "^.git", "^.git-crypt" and "^node_modules".
The --file-limit
allows you to set a higher file limit thab the default (1000). This is useful when the target directory includes more files.
You can view the module status with hawkeye modules
. As previously mentioned you can see that entropy is disabled by default. If you want to run it, use the -m entropy
flag.
$ hawkeye modules
[info] Welcome to Hawkeye v0.11.0!
[info] Bundler Scan dynamically loaded
[info] File Contents dynamically loaded
[info] Entropy dynamically loaded
[info] Example Module dynamically loaded
[info] Secret Files dynamically loaded
[info] Node Check Updates dynamically loaded
[info] Node Security Project dynamically loaded
Module Status
[info] Enabled: Bundler Scan (bundlerScan)
Scan for Ruby gems with known vulnerabilities
[info] Enabled: File Contents (contents)
Scans files for dangerous content
[info] Disabled: Entropy (entropy)
Scans files for strings with high entropy
[info] Disabled: Example Module (example)
Example of how to write a module and shell out a command
[info] Enabled: Secret Files (files)
Scans for known secret files
[info] Enabled: Node Check Updates (ncu)
Scans a package.json for out of date packages
[info] Enabled: Node Security Project (nsp)
Scans a package.json for known vulnerabilities from NSP
At the moment, Hawkeye supports three output writers.
The default summary output to your console looks something like this. The log information is written to stdout
and the errors found to stderr
in a console parsable tablular output
$ hawkeye scan
[info] Welcome to Hawkeye v0.11.0!
[info] File Contents dynamically loaded
[info] Entropy dynamically loaded
[info] Example Module dynamically loaded
[info] Secret Files dynamically loaded
[info] Node Check Updates dynamically loaded
[info] Node Security Project dynamically loaded
[info] git repo detected, will only use git tracked files
[info] git-crypt detected, excluding files covered by GPG encryption
[info] -> git-crypt status -e
[info] Files excluded by git-crypt: 0
[info] -> git ls-tree --full-tree --name-only -r HEAD
[info] Files included in scan: 62
[info] Target for scan: /Users/kstoney/git/stono/hawkeye
[info] Fail at level: low
[info] Running module File Contents
[info] Running module Secret Files
[info] Running module Node Check Updates
[info] -> /Users/kstoney/git/stono/hawkeye/node_modules/npm-check-updates/bin/ncu -j
[info] Running module Node Security Project
[info] -> /Users/kstoney/git/stono/hawkeye/node_modules/nsp/bin/nsp check --reporter json
[info] scan complete, 16 issues found
[info] Doing writer: console
level description offender mitigation
-------- ----------------------------------------------------------------- -------------------------------- -------------------------------------------------
critical Incorrect Handling of Non-Boolean Comparisons During Minification uglify-js https://nodesecurity.io/advisories/39
critical Private SSH key regex_rsa Check contents of the file
critical Private SSH key id_rsa Check contents of the file
critical Potential cryptographic private key cert.pem Check contents of the file
critical Private key in file some_file_with_private_key_in.md Check line number: 1
high Regular Expression Denial of Service negotiator https://nodesecurity.io/advisories/106
high Module is one or more major versions out of date nodemailer Update to 4.0.1
high GNOME Keyring database file keyring Check contents of the file
medium Regular Expression Denial of Service uglify-js https://nodesecurity.io/advisories/48
medium Module is one or more minor versions out of date express Update to 4.15.2
medium Rubygems credentials file gem/credentials Might contain API key for a rubygems.org account.
medium Module is one or more minor versions out of date morgan Update to 1.8.1
medium Module is one or more minor versions out of date serve-favicon Update to 2.4.2
medium Module is one or more minor versions out of date body-parser Update to 1.17.1
medium Module is one or more minor versions out of date debug Update to 2.6.3
low Contains words: private, key some_file_with_private_key_in.md Check contents of the file
I plan to add options to supress log outputs etc in the future, but for now if you want to parse this output, you can supress the logs and just output the table like this:
$ (hawkeye scan >/dev/null) 2>&1 | tail -n +3
critical Incorrect Handling of Non-Boolean Comparisons During Minification uglify-js https://nodesecurity.io/advisories/39
critical Private SSH key regex_rsa Check contents of the file
critical Private SSH key id_rsa Check contents of the file
critical Potential cryptographic private key cert.pem Check contents of the file
critical Private key in file some_file_with_private_key_in.md Check line number: 1
high Regular Expression Denial of Service negotiator https://nodesecurity.io/advisories/106
high Module is one or more major versions out of date nodemailer Update to 4.0.1
high GNOME Keyring database file keyring Check contents of the file
medium Regular Expression Denial of Service uglify-js https://nodesecurity.io/advisories/48
medium Module is one or more minor versions out of date express Update to 4.15.2
medium Rubygems credentials file gem/credentials Might contain API key for a rubygems.org account.
medium Module is one or more minor versions out of date morgan Update to 1.8.1
medium Module is one or more minor versions out of date serve-favicon Update to 2.4.2
medium Module is one or more minor versions out of date body-parser Update to 1.17.1
medium Module is one or more minor versions out of date debug Update to 2.6.3
low Contains words: private, key some_file_with_private_key_in.md Check contents of the file
Here are some other handy examples:
(hawkeye scan >/dev/null) 2>&1 | tail -n +3 | grep critical
- output just critical items
Another option is for you to use a different output writer, for example...
You can output much more information in the form of a JSON artefact that groups by executed module.
Check out a sample here
The output of Hawkeye can be sent to a SumoLogic HTTP collector of your choice. In this example, I have a collector of hawkeye
, with a single HTTP source.
hawkeye scan --sumo https://collectors.us2.sumologic.com/receiver/v1/http/your-http-collector-url
...
[info] Doing writer: sumologic
[info] sending 16 results to SumoLogic
And in sumo logic, search for _collector="hawkeye" | json auto
:
The idea is that this project should be super extensible, I want people to write new handlers with ease. Simply create a handler in lib/modules
which exposes the following signature:
- key: A short alphanumeric key for your module
- name: The name of your module
- description: The description of your module
- enabled: True or Fale as to if this module should run by default, or if it needs to be specified with
--module
- function handles(path): A function to decide if this handler should run against the target path
- function run(results, done): The function which is called if handles returns true
The first argument passed is results
, this is where the module should send its results to, it exposes four methods for each 'level' of issue found, critical
, high
, medium
and low
. Those methods expect you to pass something like this:
results.critial('offender', 'description', 'extra', { additional: 'data' });
The best used of Hawkeye is in your pipeline. However, you now you can use this hosted version to get scanning even faster!. Some of the benefits of the hosted version are:
- Free (obviously)
- Links to GitHub and scans both Public and Private repositories
- GitHub "Push" scanning
- On-demand one-click scanning
- Scheduled scanning (great for continually checking for new issues in your deployed applications)
- Email notifications when we find new issues
- Consolidated dashboards
It's worth noting that as the SaaS version is not a build server, any modules which require builds will not run (for example; java vulnerability checkers that scan JAR files). For those, you should be putting it in your pipeline.