Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* merge with master * [infra] adds upload header to storage and define more modes (#795) * feat(infra): adds upload header to storage * fix(infra): replace if header exists in storage * fix(infra): fix table_id in storage * feat(infra): add mode `architecture` * feat(infra): adjust mode * feat(infra): add new modes * feat(infra): adjust mode all * feat(infra): adjust mode all * feat(infra): change table-approve bd version * feat(infra): trigger table-approve * feat(infra): trigger table-approve * feat(infra): trigger table-approve * feat(infra): trigger table-approve * feat(infra): trigger table-approve * feat(infra): trigger table-approve * feat(infra): trigger table-approve * feat(infra): trigger table-approve * feat(infra): add the option to use bqstorage api (#847) * [infra] Add validate module [2] (#675) * draft validate.py * adding more validate_features * fix attributes and define some helper functions to cut repetition * improve storage upload exception * swipe dataset_id and table_id in Table * adds __init__ to modules * creates single function to generate metadata * redesign metadata API to be more intuitive * add metadata create to CLI * expose Metadata to users * fix small typos * add yaml generator * improves exception * adds ckanapi to reqs * Fix comment_treatment metadata function * Fix BaseDosDadosException imports * Fix dataset init file checks * Raise Exception in case of inconsistent metadata YAML order * cria configs a partir do Metadata * get rid o validate.py * configs come with data * add columns to table_config.yaml * Add tests to metadata module * Delete dataset_config.yaml * Refactor test_metadata * Improve metadata create tests * Add more table metadata create tests * Update metadata create docstring * Add all test_metadata tests placeholders * Add tests for Metadata is_updated method * First working version of Metadata validate method * Add Metadata validate tests * Improve metadata validate and its tests * Add metadata is_updated CLI entrypoint * Add Metadata validate CLI entrypoint * First metadata publish version * Fix metadata create/is_updated bugs, improve validate tests * Fix metadata's test_create_if_exists_pass * Refactor metadata code, improve validate method * Add metadata publish CLI entrypoint * Fix publish bugs, add resource_patch for bdm_table patches * Add response return value to publish, improve exceptions * Add metadata publish tests * Improve metadata publish and validate docstrings * Add partition_columns option to metadata create * Call is_updated before publish * Fix partitions_writer_condition * Fix ckan_data_dict * Update CKAN_URL * Integrate Table.create and Metadata * Fix YAML generation for array fields * feat(infra): adds _make_publish_sql * fix(infra): add partition columns to be created, fix dataset_id in dataset and autofill type from arq sheet * fix(infra): back patitions to str * fix(infra): enhance organization metadata validation * fix(infra): YAML complex fields are generated even if there is no data available * feat(infra): add extras field to dataset validation * fix(infra): clean spaces and put comma * fix(infra): partitions from string to list in _is_partitioned function * fix(infra): fix table_description.txt for tb.publish() * fix(infra): improve update_columns doc string * fix(infra): point metadata.CKAN_URL to staging website * fix(infra): handle new dataset/table case in Metadata.is_updated * Make CKAN_API_KEY and CKAN_URL come from config.toml * bump pyproject to 1.6.0-a0 * Add ckan config variables builder * Add default ckan config to configs/config.toml * Raise error in case of no CKAN_API_KEY when publishing * fix(infra): update ruamel.yaml and python dependencies * fix(infra): base initiation, migrate ckan_url and api_key to __init__ * fix(infra): handle ckan config None values * fix(infra): handle_complex_fields get correct data * feat(infra): improve update_columns * feat(infra): improve update_columns * fix(infra): change coluna to nome * bump to 1.6.0a4 * fix(infra): bump to 1.6.0a5 * fix(infra): force utf-8 in all open methods * feat(infra): release 1.6.0a6 * fix(infra): fix update_columns encoding * feat(infra): pump version 1.6.0a7 * Add extra dataset metadata fields for validation * Improve metadata validation * fix(infra): refactor metadata's ckan_data_dict * fix(infra): remove input_hints from YAMLs * fix(infra): shrink organization dataset YAML field * feat(infra): bump to version 1.6.0-alpha.8 * feat(infra): add test_create_force_columns_is_true metadata test * feat(infra): refactor metadata tests, add test_force_columns_is_false * feat(infra): refactor metadata tests * feat(infra): add partition_columns tests * fix(infra): refatora o pacote metadata (#826) * fix(infra): refatora o pacote metadata * fix(infra): adiciona parte da refatoração * fix(infra): corrige erros da refatoração * feat(infra): adiciona suporte ao comando 'python -m' * feat(infra): adiciona opção de versão * feat(infra): formata o código com black * fix(infra): corrige uns testes e comenta outros * fix(infra): nullify yaml's partitions in case of not-None empty values * fix(infra): fix Metadata.publish tests, remove debugging code * feat(infra): make creation of table_config.yaml only optional * fix(infra): make Metadata.validate work with new datasets and tables * feat(infra): make Metadata.publish handle new datasets or tables * fix(infra): create all dataset files * fix(infra): draft new dataset_description.txt * fix(infra): make table.py work with new YAML, refactor and fix tests * fix(infra): handle non-defined variables for dataset_description.txt template * refactor(infra): make Table and Dataset use Metadata as a component * fix(infra): add gcloud variables to YAML through config.toml * feat(infra): bump to 1.6.0-a9 * fix(infra): adiciona verificação de organização (#869) * fix(infra): adiciona verificação de organização * fix(infra): formatação com black * fix(infra): altera nome do trigger de data checks * feat(infra): rascunho da action de metadata checks * Revert "fix(infra): adiciona verificação de organização (#869)" This reverts commit c82d70a. * fix(infra): bring back all dataset_config.yaml fields to ckan_data_dict * fix(infra): ordena as bibliotecas * fix(infra): corrige formatação * fix/validate: corrige validate e adiciona actions (#876) * fix(infra): adiciona verificação de organização * fix(infra): formatação com black * fix(infra): altera nome do trigger de data checks * feat(infra): rascunho da action de metadata checks * [dados-fix] Sobe INPC (#879) * feat(docs): clarifications on partitions, temporal_coverage, suffixes. (#846) * fix(infra): inicio das correções dos testes * fix(infra): inicio das correções dos testes * fix(infra): mais alterações nos testes * [dados-bot] br_ms_vacinacao_covid19 (2021-10-18) (#884) Co-authored-by: terminal_name <github_email> * [dados-bot] br_ms_vacinacao_covid19 (2021-10-19) (#888) Co-authored-by: terminal_name <github_email> * [dados-atualizacao] br_anp_precos_combustiveis (#883) * atualiza dados dos preços de combustiveis * corrige erro de português no table_description * fix(infra): corrige ordenação das bibliotecas * fix(infra): corrige sintaxe nova Co-authored-by: Gustavo Aires Tiago <[email protected]> Co-authored-by: Ricardo Dahis <[email protected]> Co-authored-by: Lucas Moreira <[email protected]> * Revert "fix/validate: corrige validate e adiciona actions (#876)" This reverts commit 2d3fa09. * Revert "fix(infra): corrige formatação" This reverts commit cb19f31. * Revert "fix(infra): ordena as bibliotecas" This reverts commit 698db35. * Revert "Merge branch 'python-1.6.0' into add_validate_module_2" This reverts commit 9c305f2, reversing changes made to aee8c2a. * feat(infra): add support for organization metadata * fix(infra): complete all functions and methods docstrings * docs(infra): add metadata entrypoints walkthrough to docs Co-authored-by: hellcassius <[email protected]> Co-authored-by: joaoc <[email protected]> Co-authored-by: d116626 <[email protected]> Co-authored-by: Vinicius Aguiar <[email protected]> Co-authored-by: Gustavo Aires Tiago <[email protected]> Co-authored-by: Ricardo Dahis <[email protected]> Co-authored-by: Lucas Moreira <[email protected]> * fix(infra): use basedosdados-dev for inexistent dataset test * fix(infra): update bases with master files * feat(infra): pump version * fix(infra): update click dependency * fix(infra): force setup.py to use click==8.0.3 * feat(infra): add new modes to cli help * Update colab_data.md * fix(infra): fix none in _load_schema * fix(infra): fix the case when table are added for the first time * feat(infra): pump version * fix(infra): try to fix merge conflicts * fix(infra): fix data-check to master * fix(infra): fix data-check to master * feat(infra): add url and api_key in env action * fix(infra): remove space in env-setup * feat(infra): add metadata validate action * fix(infra): change actions bd version * feat(infra): trigger md validate * fix(infra): change action trigger * feat(infra): test table-approve * feat(infra): test table-approve * feat(infra): test table-approve * feat(infra): test table-approve, pump version * feat(infra): test table-approve, pump version * feat(infra): test table-approve * feat(infra): test table-approve * feat(infra): test table-approve * feat(infra): test table-approve * feat(infra): test table-approve * feat(infra): test table-approve * feat(infra): test table-approve * feat(infra): test table-approve * feat(infra): pusblish rais * feat(infra): pusblish rais * feat(infra): pusblish rais * feat(infra): pusblish rais * feat(infra): pusblish rais * fix(infra): fix _load_schema and publish rais * fix(infra): publish rais * fix(infra): publish rais * fix(infra): publish rais * fix(infra): publish rais * fix(infra): publish rais * fix(infra): publish rais * fix(infra): publish rais * feat: updates diretorio_escola, closes #921 * fix(infra): publish rais * fix(infra): publish escolas * fix(infra): remove lint check * fix(infra): try to use storage retry policy * fix(infra): tb-approve bd version * fix(infra): pump storage version * fix(infra): add conditional retry * fix(infra): add conditional retry * fix(infra): publish rais * fix(infra): publish rais * fix(infra): publish rais * fix(infra): publish rais * fix(infra): publish rais * fix(infra): publish rais * fix(infra): publish rais * fix(infra): publish rais * fix(infra): publish rais * fix(infra): publish rais * fix(infra): publish rais vinculos * fix(infra): publish rais vinculos * fix(infra): change metadata-validate trigger' * fix(infra): adjust copy_table * fix(infra): change metadata-validate trigger * fix(infra): change table approve logs * fix(infra): change metadata-validate logs * fix(infra): change tb-app mode order * fix(infra): change tb-app logs * feat(infra): reactivate actions * feat(infra): change action logs * feat(infra): deactivate data-check * feat(infra): change actions logs * feat(infra): change actions logs * feat(infra): republish rais * fix(infra): improve validate metadata tests * tests(infra): add test for invalid organization entry * feat(infra): add --all and --if_exists args to publish * feat(infra): bump to 1.6.0-b20 * fix(infra): prepare data-check * fix(infra):test tb-app * fix(infra): change data-check action * fix(infra): test data-check * fix(infra): test data-check * fix(infra): test data-check * fix(infra): change ci trigger * fix(infra): change ci trigger * fix(infra): change ci trigger * fix(infra): change ci trigger * fix(infra): change ci trigger * fix(infra): test data-check * docs(infra): add --all cli option docs * fix(infra): debug data-check * fix(infra): fix data-check ckan api env variable * debug(infra): verify data-check env variables * debug(infra): fix getenv * feat(infra): test data-check * feat(infra): test data-check * feat(infra): test data-check * feat(infra): test data-check * feat(infra): test data-check * feat(infra): test data-check * feat(infra): test data-check * feat(infra): test data-check * debug(infra): test runtime env variables * debug(infra): try os.environ.get for data-check * debug(infra): test cache for data-check * fix(infra): revert data-check changes * fix(infra): data-check original trigger * fix(infra): data-check original envs * fix(infra): deactivate tb-app branch trigger * fix(infra): update docs folder based on master branch * feat(infra): add update_locally option to metadata publish * feat(infra): add update_locally to metadata publish cli * solves #issue-181 * (feat) infra:COMMIT FINAL CARALHO!!!!! Co-authored-by: d116626 <[email protected]> Co-authored-by: Vítor Mussa <[email protected]> Co-authored-by: hellcassius <[email protected]> Co-authored-by: Vinicius Aguiar <[email protected]> Co-authored-by: Gustavo Aires Tiago <[email protected]> Co-authored-by: Ricardo Dahis <[email protected]> Co-authored-by: Lucas Moreira <[email protected]> Co-authored-by: rdahis <[email protected]>
- Loading branch information