Trivial Extensible Job-submission system

Clusters typically come with job-submission and queueing systems. These systems handle a queue of jobs, which might spawn multiple nodes, have a priorities, dependencies, expected runtimes, deadlines...

Tej doesn't aim at doing any of that. It just allows you to submit a job to a single server, that will run it immediately, and allow you to check its status and get its results later on.

Of course, tej is extensible, which allows you to add some queueing and scheduling abilities should you want to.

The goal of tej is to be usable without having to configure the server beforehand; it will setup the structure it needs on the server on the first run if necessary (in its simplest form, a ~/.tej directory on the server, that will contain the jobs).

Usage

Sets up tej on the server (optional, else it gets setup on the first run, with default options):

$ tej setup [email protected] \
    --queue /scratch/tejqueue \
    --make-link ~/.tej \
    --plugin default

This takes a destination to SSH into, the location of tej's directory (there can be several on a server; by default, ~/.tej is used), --make-link creates a link so that future invocations will be redirected to /scract/tejqueue, and --plugin selects which plugins to setup on the server (since tej is extensible, other scheduling/running subsystems might be added in the future).

Submit a simple job:

$ tej submit [email protected] myjobdir
Job submitted as:
myjobdir_user_123456

Here myjobdir is assumed to have the default layout, and no metadata is added. The directory will be uploaded in its entirety, and start.sh will be run.

Submit a job explicitely:

$ tej submit [email protected] --queue=/scratch/tejqueue \
    --id example_job \
    --script bin/jobinit \
    myjobdir
Job submitted as:
example_job

Get the status of a job:

$ tej status [email protected] --id myjobdir_user_123456
Job is still running (1:28:57)
$ tej status [email protected] --queue=/scratch/tejqueue \
    --id example_job
Job is finished (1:30:01)
$ tej status [email protected] --id myjobdir_user_567890
No job 'myjobdir_user_567890'

Download the output from a finished job:

$ tej download [email protected] --id myjobdir_user_123456 \
    output/log.txt
$ tej download [email protected] --id myjobdir_user_123456 \
    results.csv view.png input.bin

Note that there is no need for the file to be an output. The files are downloaded to the current directory.

Kill a running job:

$ tej kill [email protected] --id example_job
Job 'example_job' has already completed
$ tej kill [email protected] --id myjobdir_user_123456
Job 'myjobdir_user_123456' killed
$ tej kill [email protected] --id myjobdir_user_567890
No job 'myjobdir_user_567890'

Cleanup a finished job:

$ tej delete [email protected] --id example_job
Deleted job 'example_job'

Note that this is still alpha software. The command-line interface, in particular, is likely to evolve. Feel free to give me your opinion on it or direct me your feature requests/patches on Github.

Name

"tej" /tɛʒ/ is French slang for throwing/casting. It's intended here to be used as a verb ("let me tej it to the server...", "Is it done yet? I tej'd that yesterday!"). Probably not the best name, but it wasn't taken, and it's short.

Name		Name	Last commit message	Last commit date
Latest commit History 74 Commits
.travis		.travis
tej		tej
tests		tests
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE.txt		LICENSE.txt
MANIFEST.in		MANIFEST.in
README.rst		README.rst
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Trivial Extensible Job-submission system

Usage

Name

About

Releases

Packages

Languages

License

rexissimus/tej

Folders and files

Latest commit

History

Repository files navigation

Trivial Extensible Job-submission system

Usage

Name

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages