Skip to content

Scalefree's tool to automatically generate Data Vault models for the dbt package "datavault4dbt" based on metadata.

License

Notifications You must be signed in to change notification settings

ScalefreeCOM/turbovault4dbt

Repository files navigation

Welcome to turbovault4dbt


What is TurboVault4dbt?

TurboVault4dbt is an open-source tool that automatically generates dbt models according to our datavault4dbt-templates. It uses a metadata input of Your Data Vault 2.0 from one of the supported databases and creates ready-to-process dbt-models.

What are the prerequisites to use TurboVault4dbt?

TurboVault4dbt requires a metadata analysis done by hand and stored in supported metadata storage. Furthermore, Python must be installed as TurboVault4dbt is a software written in Python.

To use the generated models, a dbt project is required. Additionally, our dbt package datavault4dbt must be used, because the dbt models are calling macros of this package.

How does my metadata needs to look like?

You can find DDL scripts and templates for the metadata tables and the Excel sheet here]((https://github.com/ScalefreeCOM/turbovault4dbt/tree/main/metadata_ddl)).

Your metadata needs to be stored in the following eight tables/worksheets:

Where can I store my metadata?

Currently, TurboVault4dbt supports metadata input from

  • Snowflake
  • BigQuery
  • Google Sheets
  • Excel
  • SQLite DB Files

Our developers are constantly working on adding new connectors for more databases.

How do I install TurboVault4dbt?

To install Turbovault4dbt, follow the instructions on this page.

How do I connect TurboVault4dbt with my metadata?

You can configure the connection to Your metadata storage in the config.ini. Further explanation for the configuration input can be found here.

How do I execute TurboVault4dbt?

To execute TurboVault4dbt, You need Python installed. Execute the script according to Your database, where Your metadata is stored e.g. Snowflake --> turbovault_snowflake.py, BigQuery --> turbovault_bigquery.py, and so on.

Then, a GUI will open that looks like this:

On the left side, you can select which object types you want to generate. These are:

The right side lists all available source objects inside the connected metadata storage. You can select as many of them as you like.

You can also generate the sources.yml, properties.yml or create a Data Vault Model Visualization by ticking each respective checkbox.

Now you can click on "start" and Turbovault4dbt will generate all necessary dbt models that work with datavault4dbt!

Releases

v1.1.0 (22.05.2023) - Current Version
v1.0.3 (16.02.2023)
v1.0.2 (13.02.2023)
v1.0.1 (30.01.2023)
v1.0.0 (26.01.2023)


Designed for

About

Scalefree's tool to automatically generate Data Vault models for the dbt package "datavault4dbt" based on metadata.

Resources

License

Stars

Watchers

Forks

Packages

No packages published