-
Notifications
You must be signed in to change notification settings - Fork 14
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Partitioning script does not complete, ends up producing mostly *.mms.data visibility files #63
Comments
Hi @Sam-Legodi, that's strange. Are you trying to run the pipeline in a directory in which it was already run? You should not need to overwrite the |
Hi @Jordatious . I've ran the pipeline with and without changing the "vis" parameter manually and I still produced similar behaviour as in my query above. For some reason, partitioning doesn't always "finish running" and errors out with the above errors while some partitioned data will be produced successfully. Maybe my resource request via the config may be causing an issue? The "slurm" section of my main config files generally are something like:
... Running "$./findErrors.sh " in the working directory of one of my last attempts to run the master version pipeline gave:
... |
I have been trying to use the master version of the pipeline (via: source /idia/software/pipelines/master/setup.sh) but have been met with errors mostly at the partitioning stage of the pipeline. So far, I've tried running the pipeline on several observations of similar datasets with slightly varying results. The common issue is that the partitioning stage results in *.mms.data visibility files instead of *.mms files in the sub-SPW subdirectories that are specified by the "spw" parameter in the config file. Sometimes I have had successful partitioning of data and sometimes not, leading to my confusion and unsuccessful attempts at a workaround. The partitioning stage error logs report errors like the one near the following log text:
"
2024-03-18 14:10:13 INFO msmetadata_cmpt.cc::open Performing internal consistency checks on /idia/raw/meerkat-cal/EXT-20210318-RT-01/1691421383/1691421383_sdp_l0.ms...
2024-03-18 14:10:16 INFO MSMetaData::_computeScanAndSubScanProperties Computing scan and subscan properties...
2024-03-18 14:10:22 INFO mstransform::::casa ##########################################
2024-03-18 14:10:22 INFO mstransform::::casa ##### Begin Task: mstransform #####
2024-03-18 14:10:22 INFO mstransform::::casa mstransform( vis='/idia/raw/meerkat-cal/EXT-20210318-RT-01/1691421383/1691421383_sdp_l0.ms', outputvis='1691421383_sdp_l0.1299
1350MHz.mms', createmms=True, separationaxis='scan', numsubms=62, tileshape=[0], field='', spw='*:12991350MHz', scan='', antenna='*&', correlation='', timerange='', intent='', array='', uvrange='', observation='', feed='', datacolumn='DATA', realmodelcol=False, keepflags=True, usewtspectrum=True, combinespws=False, chanaverage=False, chanbin=1, hanning=False, regridms=False, mode='channel', nchan=-1, start=0, width=1, nspw=1, interpolation='linear', phasecenter='', restfreq='', outframe='', veltype='radio', preaverage=False, timeaverage=False, timebin='0s', timespan='', maxuvwdistance=0.0, docallib=False, callib='', douvcontsub=False, fitspw='', fitorder=0, want_cont=False, denoising_lib=True, nthreads=4, niter=1, disableparallel=False, ddistart=-1, taql='', monolithic_processing=False, reindex=True )2024-03-18 14:10:23 INFO ParallelDataHelper::::casa Analyzing MS for partitioning
2024-03-18 14:36:52 INFO ParallelDataHelper::::casa 15 subMSs failed to be created. This is not an error, if due to selection when creating a Multi-MS
2024-03-18 14:36:52 WARN ParallelDataHelper::go::casa Error post processing MMS results /idia/raw/meerkat-cal/EXT-20210318-RT-01/1691421383/1691421383_sdp_l0.ms: [Errno 39] Directory not empty: '/idia/projects/meerkat-cal/process/Sam/1691421383_run14/1299
1350MHz/1691421383_sdp_l0.12991350MHz.mms.data/1691421383_sdp_l0.12991350MHz.mms.0007.ms' -> '/idia/projects/meerkat-cal/process/Sam/1691421383_run14/12991350MHz/1691421383_sdp_l0.12991350MHz.mms.data/1691421383_sdp_l0.12991350MHz.mms.0000.ms'2024-03-18 14:36:52 INFO mstransform::::casa Task mstransform complete. Start time: 2024-03-18 16:10:22.129315 End time: 2024-03-18 16:36:52.293223
2024-03-18 14:36:52 INFO mstransform::::casa ##### End Task: mstransform #####
2024-03-18 14:36:52 INFO mstransform::::casa ##########################################
"
OTHER error messages that I've seen are similar to the following:
"
2024-03-18 14:10:30 SEVERE mstransform::::casa::MPIServer-4 Task mstransform raised an exception of class OSError with the following message: Output MS /idia/projects/meerkat-cal/process/Sam/1691421383_run14/880
933MHz/1691421383_sdp_l0.880933MHz.mms.data/1691421383_sdp_l0.880~933MHz.mms.0003.ms already exists - will not overwrite it."
ANOTHER thing I've noticed is that the "vis" parameter in the sub-SPW subdirectory config files is set to the raw data visibility file that is not writable, is this normal behaviour? I have been manually changing these to the name of the .mms file for each specific sub-SPW directory. I do this before submitting my jobs. I've also ran the pipeline without doing this and have seen cases where some of the scripts try to write to the raw data .ms file which is not supposed to happen.
I've attached a typical config file I use for your perusal, if needed.
1691421383_run14-default_config.txt
The text was updated successfully, but these errors were encountered: