Restart the run when the checkpoint folder is missing
Posted: Tue Dec 13, 2022 4:25 pm
Hi user,
Last night there was uneven breakdown in Curta cluster due to cooling failure. Few of our cases terminated after almost 45 hours of run. The checkpoint file is not saved for these cases, however a temporary restart file is available in the folder. Is there any way to recover the results for further run.
It will be great help for me
With Thanks and Regards
Sofen
Last night there was uneven breakdown in Curta cluster due to cooling failure. Few of our cases terminated after almost 45 hours of run. The checkpoint file is not saved for these cases, however a temporary restart file is available in the folder. Is there any way to recover the results for further run.
It will be great help for me
With Thanks and Regards
Sofen