Datasets

This project is part of Datasets toolkit.

Running server (https://github.com/tivvit/datasets_server) is needed for this project to be useful.

Install

pip install datasetstools

CLI

datasets command is provided after the installation

Usage

It is recommended to configure the server address first. You may always provide it with -s.

config

datasets config
Server address (example.com): localhost
Port: 8000

The configuration will be saved to ~/.datasets and will be used also by the python library.

new

Generate new UID for the data set and creates file dataset.yaml with prefilled structure.

scan

Rescan the data sets.

info, usages, chagelog

info shows all the information about the data set. The data set is recognized based on dataset.yaml which is searched bottom-up.

usages shows only usages and changelog only the changelog respectively

Lib

Python library for interacting with the Datasets.

Init

from datasets import Datasets

ds = Datasets()
# Without args the address in ~/.datasets will be used or {"addres": "http:localhost:5000"} may be used

Info

Returns information about the data set identified by the UID. Second param - usage

ds.info("8b88a424-dbd8-4032-8be7-a930a415b9a5", {"user": "tivvit"})

Paths

Returns list of paths where the data set may be found. Second param - usage

ds.paths("8b88a424-dbd8-4032-8be7-a930a415b9a5", {"user": "tivvit"})
# ["/data/a", "/data/b"]

Create

Creates data set in the database. Useful for pragmatical data set creation.

data - dict with the data set attributes
path - path where should the dataset.yaml should be created (optional).

Returns data set UID.

ds.create(data={"name": "Best DS", ...}, path="")
# "8b88a424-dbd8-4032-8be7-a930a415b9a5"

Usage log

Actions are logged to the usage log, the second parameter is optional and will be stored in the usage log.

Development

Feel free to contribute.

Copyright and License

Released under MIT license

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
datasets		datasets
Dockerfile		Dockerfile
LICENSE		LICENSE
example.py		example.py
readme.md		readme.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Datasets

Install

CLI

Usage

config

new

scan

info, usages, chagelog

Lib

Init

Info

Paths

Create

Usage log

Development

Copyright and License

About

Releases

Packages

Contributors 2

Languages

License

datasets-org/datasets

Folders and files

Latest commit

History

Repository files navigation

Datasets

Install

CLI

Usage

config

new

scan

info, usages, chagelog

Lib

Init

Info

Paths

Create

Usage log

Development

Copyright and License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages