(c) 2013-2015 Open Knowledge Foundation (trading as Open Knowledge). Licensed under the AGPL
This is the code that powers the Open Data Index, an Open Knowledge initiative.
The Open Data Index displays a snapshot of data collected in an Open Data Survey.
This new Index codebase was written for the 2014 Global Survey, and can be used with any survey powered by the survey codebase.
The code generates a static site using Pelican, a Python-based static site generator.
Getting setup with the code is easy if you have some familiarity with Python and virtualenv.
- Setup a virtualenv for the project
- Install the dependencies:
pip install -r requirements.txt
- Install the CLI:
cd cli && python setup.py install && cd ../
- Prepare the data (download from the database):
odi prepare
- Populate the content sources:
odi populate
- Use
odi populate --limited
if you have a large amount of data (like the Global Index), and want a smaller set for local development (see Configuration below)
- Use
./develop_server.sh start
to run a server that watches and builds
Pelican documentation for more information
- Add a
config_instance.py
file to the root of the directory (this will not be version-controlled)- see
config_instance.example.py
for reference
- see
- All special config for the Open Data Index project is found on the ODI object
- Override any settings you wish in this file
- For example to use local database:
from config_default import *
ODI['database'] = {
'entries': 'http://global.dev.census.org:5000/api/entries/{year}.cascade.json',
'places': 'http://global.dev.census.org:5000/api/places/score/{year}.cascade.json',
'datasets': 'http://global.dev.census.org:5000/api/datasets/score/{year}.cascade.json',
'questions': 'http://global.dev.census.org:5000/api/questions.json',
}
- If you have a large dataset, rebuilding and watching on your local development server can be a pain. You can limit the build in this case wth the following settings, and run with
odi populate --limited
:
from config_default import *
ODI['limited']: {
'places': ['au'], # any place identifier
'datasets': ['timetables'] # any dataset identifier
}
Data is prepared by the python script. This pulls data
from the Open Data Index Survey (Census), processes it in various ways and then
writes it to the data
directory.
To run the script do:
odi prepare
Once the data has been prepared via the process.py
script, there is an additional step to create the source files that will be used to generate the static site. All source files for rendering pages live under content/pages
, and in this location, everything under datasets
, historical
and places
is generated by running the following command:
odi populate
If your your census data is sizable, or if you have many languages, Pelican's build times can be considerable, and this is a pain in development. To counter this, there is a --limited
flag for populate that will generate a subset of places
and datasets
(configured on the ODI object in the settings file):
odi populate --limited
Steps to create a snapshot for deployment::
odi populate
odi deploy
Data is found in the data
directory. This both powers the site build and is
usuable in its own right (as a Tabular Data Package).
The Open Data Index has a small set of tools and patterns for implementing visualisations. The first visualisation is the Choropleth map (described below), and this is the reference implementation for other visualisations to follow.
Visualisations have the following features:
- A permalink for the visualisation (e.g.: /vis/map/)
- An embeddable version of the visualisation (e.g.: /vis/map/embed/)
- Tools for sharing the visualisation to social networks
- Tools to filter data in the visualisation
- An interface to pass state to a visualisation via URL params
Displays Open Data Index data via a map interface.
This visualisation is embedded in the project home page, and also available via its permalink:
- /vis/map/
Or its embeddable version:
- /vis/map/embed/
The state of the map is configurable: internally, via the uiState object, and via query params passed on the URL to the map.
The use case for this is to enable embedding of a particular state. The uiState object has the following defaults:
uiStateDefaults = {
embed: {
width: '100%', // min-width for the map is 400px
height: '100%',// min-height for the map is 360px
title: 'Open Data Index'
},
filter: {
year: currentYear,
dataset: 'all'
},
panel: {
name: true,
tools: true,
help: true,
legend: true,
},
map: {
place: undefined
},
asQueryString: undefined
}
These defaults can be customized on initialisation of the map via URL params (a subset of the full uiState object):
filter_year
filter_dataset
panel_logo
panel_name
panel_tools
panel_help
panel_legend
map_place
embed_width
embed_height
embed_title
- 2014 / All datsets
http://staging.index.okfn.org/vis/map/embed/?embed_height=360
- 2014 / Government spending
http://staging.index.okfn.org/vis/map/embed?filter_dataset=spending&embed_height=360
Displays the Index data in a tabular format. This visualisation is not currently embeddedable.
There are two types of tables:
- Overview
- Slice
Open Data Index exposes a (simple) API for programmatic access to data. Currently, the API is available in both JSON and CSV formats.
- {format} refers to either
json
orcsv
Returns all entries in the database.
Returns entries sliced by year.
Available years:
- 2014
- 2013
Returns all datasets in the database.
Returns all datasets sliced by category, where category
is a slugified string of the dataset category.
Available categories:
- civic-information
- environment
- finance
- geodata
- transport
Returns all places in the database.
Returns all questions in the database.
There are two types of Open Data Index content to translate: strings in the codebase, and strings in the source data.
Translations are managed with the Transifex client:
http://docs.transifex.com/developer/client/
-
We use Babel for translations, and the commands are nicelt wrapped up in our CLI:
-
odi trans init
-
odi trans extract
-
odi trans update
-
odi trans compile
If migrating from a previous Index site to the new static site, you'll need to implement aliases from some old URLs to the new ones. For all common URLs related to the Open Data Index, this is taken care of via the alias
meta data per page. For the content sources that are not auto-generated (such as about, methodology, etc.), you can simply add a list of comma-separated URL paths to alias
.
The site features integration with Google Forms for feedback on the download page.