Scheduler not installed raised as an Autosubmit Critical error in the middle of the run. ( Should be an Autosubmit Error ) #2102

mbatllem · 2025-02-05T09:15:28Z

Hello,

I'm not sure if this issue is related to the workflow or the AS, but I suspect it is more related to AS.

I'm using AS 4.1.11 and WF 4.2.0 on MN5.

In my experiment a1yv, the last chunk was apparently successfully COMPLETED I can see this from the model logs and also from the file:
/gpfs/scratch/ehpc01/bsc998159/a1yv/LOG_a1yv/a1yv_19900101_fc0_332_SIM_COMPLETED.

However, when I run autosubmit monitor a1yv or check the AS GUI, this same SIM job still appears as RUNNING. The experiment crashed, outputting the following in the nohup:

[CRITICAL] Scheduler is not installed. [eCode=7052]

I would like to continue the experiment ASAP. Would it be okay if I change the job status from RUNNING to WAITING and re-submit? Or would this prevent you from investigating the root cause of the issue?

The text was updated successfully, but these errors were encountered:

LuiggiTenorioK · 2025-02-05T09:18:50Z

I think this is related to Autosubmit and not the API. Transferring the issue ➡➡➡

mbatllem · 2025-02-05T09:23:59Z

Oh, you’re right! I'm sorry, I actually thought I was writing in the AS repo. Thanks @LuiggiTenorioK

dbeltrankyl · 2025-02-05T09:29:25Z

Hello @mbatllem

It is not shown as completed in the GUI/API or autosubmit monitor because the Autosubmit instance is stopped.

If you haven't prompted recovery or setstatus commands yet, just doing the autosubmit run $expid should be enough for Autosubmit to continue the run. And it is the recommended way of doing it

It also should be fine to set it to COMPLETED or even perform an autosubmit recovery $expid -s ( --all is not needed there)

The error is strange, tho. That shows up when the command ( sacct squeue... ) is not found in the remote. Maybe the platform had some weird error in which the slurm was not detected. Just resume the experiment and we'll see if it still happens.

dbeltrankyl · 2025-02-05T09:30:13Z

Also you don't need to resubmit the job as it is completed

dbeltrankyl · 2025-02-05T09:32:38Z

I'll update the issue title to

Scheduler not installed raises an Autosubmit Critical in the middle of the run.

I think we need to change this critical raise to only pop-up when you try to connect to the platforms, if it happens in the middle of the run, it should be an error raise so Autosubmit can reconnect to the platform.

mbatllem · 2025-02-05T09:37:13Z

Thank you for your quick responses!

mbatllem · 2025-02-05T10:09:14Z

Hello again, apparently this also happened here: /gpfs/scratch/ehpc01/bsc998159/a236/LOG_a236/a236_19900101_wf_5_LRA_GENERATOR_COMPLETED

LuiggiTenorioK transferred this issue from BSC-ES/autosubmit-api Feb 5, 2025

dbeltrankyl added this to the 4.1.13 milestone Feb 5, 2025

dbeltrankyl changed the title ~~SIM job COMPLETED but monitored as RUNNING~~ Scheduler not installed raises an Autosubmit Critical in the middle of the run. Feb 5, 2025

dbeltrankyl changed the title ~~Scheduler not installed raises an Autosubmit Critical in the middle of the run.~~ Scheduler not installed raised as an Autosubmit Critical in the middle of the run. Feb 5, 2025

dbeltrankyl changed the title ~~Scheduler not installed raised as an Autosubmit Critical in the middle of the run.~~ Scheduler not installed raised as an Autosubmit Critical error in the middle of the run. ( Should be Autosubmit Error ) Feb 5, 2025

dbeltrankyl changed the title ~~Scheduler not installed raised as an Autosubmit Critical error in the middle of the run. ( Should be Autosubmit Error )~~ Scheduler not installed raised as an Autosubmit Critical error in the middle of the run. ( Should be an Autosubmit Error ) Feb 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scheduler not installed raised as an Autosubmit Critical error in the middle of the run. ( Should be an Autosubmit Error ) #2102

Scheduler not installed raised as an Autosubmit Critical error in the middle of the run. ( Should be an Autosubmit Error ) #2102

mbatllem commented Feb 5, 2025

LuiggiTenorioK commented Feb 5, 2025

mbatllem commented Feb 5, 2025

dbeltrankyl commented Feb 5, 2025

dbeltrankyl commented Feb 5, 2025 •

edited

Loading

dbeltrankyl commented Feb 5, 2025

mbatllem commented Feb 5, 2025

mbatllem commented Feb 5, 2025

Scheduler not installed raised as an Autosubmit Critical error in the middle of the run. ( Should be an Autosubmit Error ) #2102

Scheduler not installed raised as an Autosubmit Critical error in the middle of the run. ( Should be an Autosubmit Error ) #2102

Comments

mbatllem commented Feb 5, 2025

LuiggiTenorioK commented Feb 5, 2025

mbatllem commented Feb 5, 2025

dbeltrankyl commented Feb 5, 2025

dbeltrankyl commented Feb 5, 2025 • edited Loading

dbeltrankyl commented Feb 5, 2025

mbatllem commented Feb 5, 2025

mbatllem commented Feb 5, 2025

dbeltrankyl commented Feb 5, 2025 •

edited

Loading