Master QLP generation from qlp_parallel script #912

awhoward · 2024-06-04T19:55:40Z

Addresses PR #910

Please keep the branch open. I should have used another branch, but I'd just created this one and switched gears to work on this issue.

awhoward · 2024-06-04T22:26:37Z

@bjfultn -- check out the auto throttling behavior with the --load parameter.

"""
Script Name: qlp_parallel.py

Description:
This script uses the 'parallel' utility to execute the recipe called
'recipes/quicklook_match.recipe' to generate standard Quicklook data
products. The script selects all KPF files based on their
type (L0/2D/L1/L2/master) from the standard data directory using a date
range specified by the parameters start_date and end_date. L0 files are
included if the --l0 flag is set or none of the --l0, --2d, --l1, --l2
flags are set (in which case all data types are included). The --2d,
--l1, and --l2 flags have similar functions. The script assumes that it
is being run in Docker and will return with an error message if not.
If start_date is later than end_date, the arguments will be reversed
and the files with later dates will be processed first.

The --ncpu parameter determines the maximum number of cores used. If the
--load parameter (a percentage, e.g. 90 = 90%) is set to a non-zero value,
this script will be throttled so that no new files will have QLPs
processed until the load is below that value. Note that throttling works
in steady state; it is possible to overload the system with the first set
of jobs if --ncpu is set too way high. Also, the system runs with a
little higher load than commanded, e.g., if you want 90% load, set it for
80%.

Invoking the --print_files flag causes the script to print the file
names, but not compute Quicklook data products.

Arguments:
start_date Start date as YYYYMMDD, YYYYMMDD.SSSSS, or YYYYMMDD.SSSSS.SS
end_date End date as YYYYMMDD, YYYYMMDD.SSSSS, or YYYYMMDD.SSSSS.SS

Options:
--l0 Select all L0 files in date range
--2d Select all 2D files in date range
--l1 Select all L1 files in date range
--l2 Select all L2 files in date range
--master Select all master files in date range
--ncpu Number of cores used for parallel processing; default=10
--load Maximum load (1 min average); default=0 (only activated if !=0)
--print_files Display file names matching criteria, but don't generate Quicklook plots
--help Display this message

Usage:
python qlp_parallel.py YYYYMMDD.SSSSS YYYYMMDD.SSSSS --ncpu NCPU --load LOAD --l0 --2d --l1 --l2 --master --print_files

Examples:
./scripts/qlp_parallel.py 20230101.12345.67 20230101.17 --ncpu 50 --l0 --2d
./scripts/qlp_parallel.py 20240501 20240505 --ncpu 150 --load 90
"""

awhoward · 2024-06-05T03:24:15Z

@bjfultn -- let's chat about this before merging. There's a problem with the automatic load throttling. I tested it outside of Docker, but it doesn't work inside because parallel can't access load information in /proc in the container. I tried a few fixes, but couldn't fix the issue.

awhoward · 2024-06-05T20:44:54Z

Ready to merge as per our discussion, @bjfultn.

awhoward added 4 commits June 4, 2024 12:32

added master processing

3ed7645

added datecode sorting

6f6f7c1

allow do_reversed

ca003ea

autothrottling with --load parameter

c8af53f

awhoward added 3 commits June 4, 2024 15:31

remove print statement

676d395

added 0.5 sec delay between starting jobs

e803799

added 0.5 sec delay between starting jobs

1f5e044

awhoward added the don't merge label Jun 5, 2024

add warning message for --load when run in Docker

5d35b60

awhoward removed the don't merge label Jun 5, 2024

Merge branch 'develop' into AWH_flat_investigation

ed94aed

bjfultn merged commit 72a21a6 into develop Jun 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Master QLP generation from qlp_parallel script #912

Master QLP generation from qlp_parallel script #912

awhoward commented Jun 4, 2024

awhoward commented Jun 4, 2024

awhoward commented Jun 5, 2024

awhoward commented Jun 5, 2024

Master QLP generation from qlp_parallel script #912

Master QLP generation from qlp_parallel script #912

Conversation

awhoward commented Jun 4, 2024

awhoward commented Jun 4, 2024

awhoward commented Jun 5, 2024

awhoward commented Jun 5, 2024