Page 2 of 2

Re: Problem rearding submitting job to cluster

Posted: Fri Aug 05, 2022 3:53 am
by biodc172
Yvan Fournier wrote: Mon Aug 01, 2022 5:53 pm Hello,

I'll check the code. We have never encountered this issue, but I wonder whether it might be due to latency/delays in the filesystem operation leading to a directory not being empty yet. Not sure, just guessing, but I'll see if I can make the code more robust there.

In the meantime, as a workaround, you can use the advanced checkpoint/restart settings in the GUI so as to have only one checkpoint at the end of the computation. This should avoid going through the path that causes the error.

Best regards,

Yvan
Hi,

My cluster doesn't support GUI. I'm wondering if I could set the checkpoint/restart settings without GUI?

Best regards.

Re: Problem rearding submitting job to cluster

Posted: Fri Aug 05, 2022 12:44 pm
by Yvan Fournier
Hello,

Yes, you can do this in cs_user_parameter.c, calling the appropriate cs_restart... functions (not sure there are examples for this in the Doxyhen documentation, but the API is documented).

But a simpler solution is to use the GUI on a workstation and copy/sync the setuo.xml file on the cluster.

Regards,

Yvan