Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

merge pull request #39

Open
wants to merge 111 commits into
base: clean-up-contributions
Choose a base branch
from
Open
Changes from 1 commit
Commits
Show all changes
111 commits
Select commit Hold shift + click to select a range
2415521
add basic trello models
Feb 25, 2016
c98e066
move datatype coersions and field renaming
Feb 25, 2016
d89fb46
test fixes
Feb 25, 2016
cd84456
Merge branch 'master' into feature/trello
jthandy Mar 1, 2016
ff29b89
remove trello boilerplate, add card_location model
Mar 5, 2016
bee418c
Merge branch 'feature/trello' of github.com:analyst-collective/models…
Mar 5, 2016
9045346
fix whitespace
Mar 5, 2016
9c8d17c
Merge branch 'interface-investigation' into feature/trello
Mar 5, 2016
fba2613
trello card location model
Mar 5, 2016
72c51f4
use commentCard actions for location updates
Mar 5, 2016
1314047
change trello test namespace
Mar 5, 2016
251f415
add trello model tests and have runner handle all .sql files
Mar 5, 2016
6927f01
basic email interface model for pardot
jthandy Mar 8, 2016
1490bfd
Merge pull request #13 from analyst-collective/clean-up-contributions
Mar 8, 2016
11b77fe
Merge branch 'master' into feature/trello
Mar 9, 2016
12de9d2
Merge pull request #14 from analyst-collective/feature/trello
Mar 9, 2016
3c7f282
lots of work on email analysis
jthandy Mar 9, 2016
8f95341
Merge branch 'master' into interface/email
jthandy Mar 9, 2016
f754016
Updated email analysis.
jthandy Mar 9, 2016
ccdacc8
drop and create schema in first step of runner
Mar 10, 2016
fa1afd9
throw error if schema already exists
Mar 11, 2016
3c210a9
Merge pull request #15 from analyst-collective/drop-create-schema
Mar 11, 2016
fd5d6be
ignore config
jthandy Mar 15, 2016
9651caf
Merge pull request #18 from analyst-collective/interface/email
Mar 16, 2016
98445a3
zuora models v1
Mar 16, 2016
958fdd2
zuora models v1
Mar 16, 2016
eafbbb0
zuora models v1
Mar 16, 2016
97d3ea2
zuora v2
Mar 16, 2016
2901e7f
updates for all files to use dbt syntax
jthandy Mar 16, 2016
559dc02
remove old shit
jthandy Mar 17, 2016
09e7cca
Merge branch 'remove-old-shit' into models/stripe
jthandy Mar 17, 2016
96dedda
Merge pull request #21 from analyst-collective/remove-old-shit
Mar 17, 2016
4d9017d
initial stripe model and analysis
jthandy Mar 17, 2016
2cdc0c8
minor changes to update plans
jthandy Mar 17, 2016
7fc4ab0
Merge branch 'master' of github.com:analyst-collective/models
Mar 17, 2016
1956334
zuora v1
Mar 17, 2016
a54b7d3
zuora v1
Mar 17, 2016
b36f605
zuora v1
Mar 17, 2016
aaa2248
Merge pull request #19 from 3mei/master
Mar 17, 2016
e4e2769
Merge remote-tracking branch 'origin/master'
jthandy Mar 17, 2016
c9b8c19
Magento Hello World.
mattmonihan Mar 18, 2016
a21d445
clean up
jthandy Mar 19, 2016
70dcf7e
update dbt project to reference correct models dir
jthandy Mar 19, 2016
1046a21
Renamed column references and changed schema from 'magento' to 'sampl…
mattmonihan Mar 20, 2016
2b1006b
Adjusted analysis query for magento. Got it running successfully.
mattmonihan Mar 20, 2016
fe770d5
Merge pull request #25 from mattmonihan/magento_analysis
Mar 21, 2016
c1548c7
lots of stripe debugging
jthandy Mar 21, 2016
a4b10c1
updates to analysis
jthandy Mar 22, 2016
7b613d3
Merge pull request #27 from analyst-collective/spring-cleaning
Mar 22, 2016
55f3600
changes after stripe code review
jthandy Mar 22, 2016
62fa124
small updates to analytical query
jthandy Mar 22, 2016
fc9b2ec
major refactor of stripe analysis in response to comments in PR
jthandy Mar 23, 2016
b662986
more stripe changes
jthandy Mar 24, 2016
8b19da9
fix renewal calc
jthandy Mar 24, 2016
077dd7d
moved magento models to correct folder
jthandy Mar 24, 2016
e60b8da
split out pardot models
Mar 25, 2016
5ee44df
add new pardot files
Mar 25, 2016
847bb21
fix pardot defs and split out magento models
Mar 25, 2016
4499b0d
split out emails
Mar 25, 2016
39f4ad8
split out snowplow and segment
Mar 25, 2016
8c8aad2
split out trello model
Mar 25, 2016
3caf061
restructure trello tests
Mar 25, 2016
a43a8a4
split out zuora models
Mar 25, 2016
7c799d8
update dbt project file
Mar 25, 2016
e6dff22
updates based on code review
jthandy Mar 25, 2016
677e586
final CR comments addressed
jthandy Mar 25, 2016
7caf174
Merge pull request #28 from analyst-collective/models/stripe
Mar 25, 2016
2b3ca79
update configs to render to table
Mar 28, 2016
7e52da5
Create License.md
Mar 29, 2016
b81947a
change emails to pardot_emails (for now)
Mar 29, 2016
fa314ee
move stripe models folder
jthandy Mar 29, 2016
18af13e
Merge branch 'master' into one-model-per-file
Mar 29, 2016
f3fef6a
fix master merge (model --> models)
Mar 29, 2016
934c6b2
delete model directory
Mar 29, 2016
dda5bff
dedent stripe code
Mar 29, 2016
ded33b2
specify model and table_or_view in config
Mar 29, 2016
d97835a
add back in test path
Mar 29, 2016
a005f2c
mixpanel v1
Mar 30, 2016
df8ece5
mixpanel v1
Mar 30, 2016
0c75a6a
added retention analysis
Mar 31, 2016
5128e03
* changes
Apr 1, 2016
1c3e80a
added event cohorts
Apr 1, 2016
b496e6d
Merge pull request #29 from analyst-collective/one-model-per-file
drewbanin Apr 3, 2016
7e5e431
remove old configs
Apr 3, 2016
b76a144
update example configs
Apr 3, 2016
5b14dbd
Merge branch 'master' of github.com:analyst-collective/models
Apr 4, 2016
598cd6b
updates to readme based on current workflow
jthandy Apr 5, 2016
3bb3fdf
Merge pull request #31 from analyst-collective/readme-v0.2
Apr 5, 2016
6c2a2c3
updated mixpanel models
Apr 5, 2016
305e67b
mailchimp v1
Apr 6, 2016
6adcd87
removed mixpanel from the mailchimp branch
Apr 6, 2016
3e623c7
added base models for every action and added gains and losses
Apr 8, 2016
97db355
updated gains_losses
Apr 11, 2016
4f10835
Merge pull request #33 from 3mei/mailchimp
Apr 11, 2016
ec70d01
updates to mailchimp analysis for blog post
jthandy Apr 14, 2016
0ff459c
updates based on pr
jthandy Apr 15, 2016
6e96fa5
spacing!
jthandy Apr 15, 2016
78a51da
Merge pull request #35 from analyst-collective/mailchimp-updates
Apr 15, 2016
3af6b34
don't explicity reference env.schema in model defs
May 3, 2016
0f2ca78
replace calls to load() with ref()
May 3, 2016
8ab283a
add package namespace
May 5, 2016
7d09716
make version a string
May 5, 2016
1c945f6
make name and version top-level
May 6, 2016
90eee62
Merge pull request #36 from analyst-collective/feature/namespaced-ref…
drewbanin May 6, 2016
dc67d37
moved all of these to their own individual repos
jthandy Jun 30, 2016
bcbf61a
clean up old code
jthandy Aug 1, 2016
0f9d14f
clean up old code
jthandy Aug 1, 2016
6ab9779
updated to reflect the new purpose of this repo
jthandy Aug 1, 2016
69ee6aa
reflect deprecation
jthandy Aug 1, 2016
b72a3bc
.
jthandy Aug 1, 2016
a5a8c8e
update readme to include new stripe repo
jthandy Sep 23, 2016
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
updated to reflect the new purpose of this repo
  • Loading branch information
jthandy committed Aug 1, 2016
commit 6ab97799889da7da83586efa948507db65ca187a
42 changes: 7 additions & 35 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,39 +1,11 @@
### analyst-collective/models
### analyst-collective/analytics

A collection of data models and corresponding analysis for common data sets in SQL. These models are designed to be portable across organizations with minimal configuration.
This repository serves as an index for various dbt-based analytics packages. Please add additional packages by submitting PRs.

### Design Principles
- [Snowplow](https://github.com/fishtown-analytics/snowplow)
- [Quickbooks](https://github.com/fishtown-analytics/quickbooks)
- [Zendesk](https://github.com/analyst-collective/zendesk)
- [Mailchimp](https://github.com/analyst-collective/mailchimp)

This repository contains two primary types of objects: data models and data analyses.

##### Models
- A model is a table or view built either on top of raw data or other models. Models are not transient; they are materialized in the database.
- Models are composed of a single SQL `select` statement. Any valid SQL can be used. As such, models can provide functionality such as data cleansing, data transformation, etc.
- All models are built to be compiled and run with [dbt](https://github.com/analyst-collective/dbt).
- Models can be configured in dbt to be materialized as either views or tables.
- Model files should go into `/models` and saved with a `.sql` extension.
- Each model should be stored in its own `.sql` file. The file name will become the name of the table or view in the database.
- Other models should be referenced with the `ref` function. This function will resolve dependencies during the `compile` stage. The only tables referenced without this function should be source raw data tables.
- Models should be designed to minimize the selection from raw data tables. This minimizes the amount of mapping end users of models will need to do when configuring them for their local environment.

##### Analysis
- Analyses are `.sql` files that can be executed within a database query tool.
- All analysis should be built on top of models, not raw data.
- All named fields in a given analysis should be named within a given model.
- Confining analysis in this way ensures portability of analysis across multiple environments.

##### Dependencies

- All projects can include dependencies to other projects. Dependencies


### Contributing
All contributions to this repository must be for analytics on top of standardized datasets. The current process for contributing is to:
- fork this repo,
- build a test dataset,
- make and test changes, and
- submit a PR.

PRs without accompanying datasets cannot be tested and therefore will not be accepted. We suggest you use [data-generator](https://github.com/analyst-collective/data-generator) to generate your test datasets.

We do not believe that this is the ideal workflow to facilitate the Analyst Collective vision for open source analytics. In the future, we plan to extend dbt to be a package manager. Once this is accomplished, you can own your own analytics repositories and publish them to a common index that others can use. We will update the contribution guidelines here once this is accomplished.
These packages are all installed and built using dbt. For additional information on dbt, go [here](https://github.com/analyst-collective/dbt).