Auto Generate `environment.yml` #339

TimothyWillard · 2024-10-10T18:52:28Z

Describe your changes.

This pull request adds a script called build/create_environment_yml.R to generate a consistent conda environment.yml file that can be used to install the rest of flepiMoP into. It also runs this script in a GitHub action that will automatically regenerate the file when one of the dependencies has changed.

What does your pull request address? Tag relevant issues.

This pull request takes another step towards GH-191 by extracting the conda environment file portion of GH-329's build/hpc_install.sh into it's own thing.

Tag relevant team members.

n/a at the moment, draft PR

Heavily inspired by the original `batch/slurm_init.sh` script. The init script is a run once script that takes care of installation of dependencies and setup whereas prerun sets env vars needed per a run.

Initial version of the HPC install script, some what inspired by the slurm init script.

* Changed how the R arrow version is formatted for readability. * Changed the final output command to print diagnostic info correctly.

Added slurm's --partition flag to the `batch/inference_job_launcher.py` script for usage on UNC's Longleaf cluster.

The longleaf specific init/pre-run scripts are now surpassed by the generic `build/hpc_install.sh` script.

Remove the --partition flag for the slurm partition to use from the inference job launcher script. This will be handled in a new flepiscripts script.

* Changed `flepiMoP` git clone to use ssh instead of http to allow for edits from HPC. * Add `set -e` to error clearly on a command failure. * Install `gempyor` from cloned `flepiMoP` repo directly, yet to do the same for R packages.

Swapped out manual install from GitHub with the `build/setup.R` script which handles a few dependencies and installs CLIs as executables.

In light of disucssion about how directories are structured on longleaf split out the project path into a work directory. Still need to do the same for rockfish, for now assuming work and user dirs are the same there.

* Keep the work directory as where project is supposed to go. * Move flepiMoP source and conda env to $HOME.

Out sourced the per run setup into `build/flepi_init.sh` so users are not forced to update/reinstall just to run.

Use `devtools::install` for `install.packages` for better handling of source package installs.

The install script throws unexpected warnings about being unable to install arrow, even though arrow is already installed by conda.

The `inference::install_cli` function now installs to the bin folder provided by the conda environment. Co-authored-by: Carl A. B. Pearson <[email protected]>

Users now must call the init script themselves.

Moved R dependencies install from individually for each of the custom R packages into the conda environment. This should alleviate warnings relating to arrow install and streamline the install of R dependencies.

Add an explicit call to `inference::install_cli()` after flepiMoP custom R packages.

The covidcast package is not available through conda-forge, so has to be installed through CRAN.

* Downgrade R to 4.3 to resolve r-MMWRweek. * Correct directory change and missing repo errors.

Custom R script to generate a consistent conda `environment.yml` file.

* Added dnachun to the channels for osx-arm64 builds of r-truncnorm and r-ggraph. * Added an explicit r-sf dependency for covidcast that has to be installed manually outside of the environment file. * Bug fix to check if file exists before comparing.

Custom GitHub action to run the `build/create_environment_yml.R` script and then add those changes to a pull request if there are any.

Change the ref from a particular commit to the branch of the PR so the HEAD is attached.

Make it clear that the commit is made by a GitHub action by putting that in the commit message title.

jcblemai · 2024-10-11T10:54:21Z

Any reason for PR close while keeping the branch ? Hard to do ? or something for later

TimothyWillard · 2024-10-11T12:05:26Z

No, just forgot to do so while working on this yesterday.

TimothyWillard and others added 30 commits September 13, 2024 12:27

Copy slurm_init.sh to slurm_init_longleaf.sh

aa0c077

Restore slurm_init.sh

22cb130

Merge branch 'copy-file' into GH-191/longleaf-batch-submission

90e29b2

Added UNC Longleaf Specific Init/Prerun Scripts

c1d54ef

Heavily inspired by the original `batch/slurm_init.sh` script. The init script is a run once script that takes care of installation of dependencies and setup whereas prerun sets env vars needed per a run.

Draft implementation of HPC install script

e238d66

Initial version of the HPC install script, some what inspired by the slurm init script.

Minor changes to hpc_install.sh

2436640

* Changed how the R arrow version is formatted for readability. * Changed the final output command to print diagnostic info correctly.

Added slurm --partition flag to inference script

bba583a

Added slurm's --partition flag to the `batch/inference_job_launcher.py` script for usage on UNC's Longleaf cluster.

Initial pass at HPC install on rockfish

2c8c952

Remove longleaf specific slurm scripts

194d2e1

The longleaf specific init/pre-run scripts are now surpassed by the generic `build/hpc_install.sh` script.

Remove --partion flag

f76d68a

Remove the --partition flag for the slurm partition to use from the inference job launcher script. This will be handled in a new flepiscripts script.

Minor updates to hpc_install.sh

8af3698

* Changed `flepiMoP` git clone to use ssh instead of http to allow for edits from HPC. * Add `set -e` to error clearly on a command failure. * Install `gempyor` from cloned `flepiMoP` repo directly, yet to do the same for R packages.

initial tweaks to make flepimop-inference-* runnable

23d9de1

further install scripts fixes

9ac9cf0

fix reinvocation of inference-slot

d32a58c

initial installation for ubuntu re-org

9f2c085

updates addressing use of installed r scripts

e2bc41f

README revs

3095ae4

add arrow installation

1b33dc3

Switch R pkg install to use build/setup.R

ec0c479

Swapped out manual install from GitHub with the `build/setup.R` script which handles a few dependencies and installs CLIs as executables.

Add $WORKDIR to hpc_install.R

e754364

In light of disucssion about how directories are structured on longleaf split out the project path into a work directory. Still need to do the same for rockfish, for now assuming work and user dirs are the same there.

Add missing flepi path arg to setup.R

5e4f398

Force pin arrow version between python and R

0d9f813

Change rockfish default directories

9ca12ed

* Keep the work directory as where project is supposed to go. * Move flepiMoP source and conda env to $HOME.

Split hpc_install.sh into init and install

f78ee75

Out sourced the per run setup into `build/flepi_init.sh` so users are not forced to update/reinstall just to run.

Use devtools::install in setup.R

6d69186

Use `devtools::install` for `install.packages` for better handling of source package installs.

Unset error exit around R pkg install

7adbdfa

The install script throws unexpected warnings about being unable to install arrow, even though arrow is already installed by conda.

Add set +e an exit to flepi_init.sh

ae6666f

install_cli installs to conda bin

f3fb1a7

The `inference::install_cli` function now installs to the bin folder provided by the conda environment. Co-authored-by: Carl A. B. Pearson <[email protected]>

Remove init call from install script

e443626

Users now must call the init script themselves.

Remove old version restrictions, add optparse

480bfd7

TimothyWillard and others added 14 commits October 9, 2024 13:23

Move R deps install into conda environment

385eeb5

Moved R dependencies install from individually for each of the custom R packages into the conda environment. This should alleviate warnings relating to arrow install and streamline the install of R dependencies.

Readd inference CLI install

f0571ae

Add an explicit call to `inference::install_cli()` after flepiMoP custom R packages.

Update example command to use installed CLI

53f2f49

Manually install covidcast package

ec7978d

The covidcast package is not available through conda-forge, so has to be installed through CRAN.

Downgrade r-base dependency to 4.3

2e6dfca

* Downgrade R to 4.3 to resolve r-MMWRweek. * Correct directory change and missing repo errors.

Remove symlinks if exists on reinstall

42259a2

Script to generate environment.yml

aed8442

Custom R script to generate a consistent conda `environment.yml` file.

GitHub action to generate environment.yml

b0685c5

Custom GitHub action to run the `build/create_environment_yml.R` script and then add those changes to a pull request if there are any.

Remove unneeded comment

a9e9f47

Merge main into GH-191/auto-generate-environment.yml

9acda2c

GitHub action checkout with attached head

ca10838

Change the ref from a particular commit to the branch of the PR so the HEAD is attached.

Update environment.yml

6ad753d

Make clear source of environment.yml commit

27a80f7

Make it clear that the commit is made by a GitHub action by putting that in the commit message title.

TimothyWillard closed this Oct 10, 2024

TimothyWillard deleted the GH-191/auto-generate-environment.yml branch October 11, 2024 12:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Auto Generate `environment.yml` #339

Auto Generate `environment.yml` #339

TimothyWillard commented Oct 10, 2024

jcblemai commented Oct 11, 2024

TimothyWillard commented Oct 11, 2024

Auto Generate environment.yml #339

Auto Generate environment.yml #339

Conversation

TimothyWillard commented Oct 10, 2024

Describe your changes.

What does your pull request address? Tag relevant issues.

Tag relevant team members.

jcblemai commented Oct 11, 2024

TimothyWillard commented Oct 11, 2024

Auto Generate `environment.yml` #339

Auto Generate `environment.yml` #339