Resistance Lab Datasets

This repository contains both datasets we have collected as well as scripts used to collect them.

Layout of this repository

Main folders: contain the source, a sensible cleanup, and then a folder for each distinct output.

source_data: raw data sources from government and other state agency websites
cleaned_data: source files tidied into cleaner formats for easier comparison
analysis: a set of folders with workspace environments for each specific output
- analysis/0001-use-of-force: One directory per use case, first one given as an example.
  - analysis/0001-use-of-force/README.md: Description of this analysis, where it's used, author info, etc.
  - analysis/0001-use-of-force/Makefile: A makefile for generating this analysis (make)
  - analysis/0001-use-of-force/force-mappings.csv: Mappings to make source metadata more descriptive and easier to read
  - analysis/0001-use-of-force/use-of-force.py: Script to create the outputs in this directory
  - analysis/0001-use-of-force/**/*: Outputs generated by script

Utility folders

.github: actions to test the pipelines
bibliography: BibTeX files
pipelines: populates source directory, cleans data (run make pipelines to run them all)

Contributing

If you have relevant datasets then we would like to include them here. We expect datasets to:

Be automated where possible, with a script in the scripts directory
Come with Great Expectations test suites
Be well documented with README files

Feel free to open a ticket or email [email protected] with any questions.

Testing

Tests are provided using Great Expectations. You will need a recent version of Python installed to use this. The rest of the dependencies can then be installed with:

Run python3 -m venv venv && source venv/bin/activate to create a virtual environment
Install the dependencies with pip3 install -r requirements.txt
Run great_expectations init to create any missing directories

To create a test suite for your new dataset run great_expectations suite new

To edit a test suite run great expectations suite edit police-population.warning

To run the tests and show the results run great_expectations docs build

Name		Name	Last commit message	Last commit date
Latest commit History 114 Commits
.github/workflows		.github/workflows
_includes		_includes
analysis		analysis
assets		assets
bibliography/0001-taser-report		bibliography/0001-taser-report
cleaned_data		cleaned_data
great_expectations		great_expectations
pipelines		pipelines
source_data		source_data
tests		tests
.flake8		.flake8
.gitignore		.gitignore
CNAME		CNAME
Gemfile		Gemfile
Gemfile.lock		Gemfile.lock
Makefile		Makefile
README.md		README.md
_config.yml		_config.yml
_headers		_headers
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Resistance Lab Datasets

Layout of this repository

Contributing

Testing

About

Releases

Packages

Contributors 3

Languages

Resistance-Lab/resistancelab_data

Folders and files

Latest commit

History

Repository files navigation

Resistance Lab Datasets

Layout of this repository

Contributing

Testing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages