-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Discrepancies between ddf_utils.create_datapackage (Python) and validate-ddf -i (Node) #548
Comments
Hi @lapidus, thanks for reporting the issue to us! I agree that it's better to unify the behavior for both libraries. Here are my suggestions:
So both ddf_utils and ddf-validation are correct, but using filename minus extension is recommended way. I suggest that ddf-validation change the
So I will add more default fields to create_datapackage. P.S @lapidus If you use
|
Thanks for answering Semio :)
|
@lapidus can you give steps for reproduction for issue 4 (version of |
We are using both Python (https://github.com/semio/ddf_utils) and Javascript tooling to generate the datapackage.json with its ddfSchema property.
When running on the very same dataset, the Python-based generator results in a 50% larger datapackage.json file.
It would be interesting to hear your thoughts (@buchslava, @semio) about harmonising the two libraries. So far we have identified 4 differences in outcome:
1. Resource.name is encoded differently:
validate-ddf
"path": "ddf--entities--jurisdiction.csv",
"name": "jurisdiction"
ddf_utils
"path": "ddf--entities--jurisdiction.csv",
"name": "ddf--entities--jurisdiction"
2. The default datapackage.json properties differ
The JavaScript version typically adds more placeholders such as title, license, author, version) whereas ddf_utils generates a bare minimum (name).
3. Python ddf_utils does not seem to work with multiple measures in one file?
ddf--datapoints--measure--measure--by--country--year.csv
4. Different files are excluded
The Python tools seem to do a better job when it comes to excluding files from ddf creation.
With
validate-ddf -i
.DS_Store and .ipynb files were accidentally encoded into the datapackage.json file whereas ddf_utils skipped over these.Thanks for any pointers and ideas!
The text was updated successfully, but these errors were encountered: