4.1. Frequently Asked Questions (FAQ)
4.1.1. How do I restart a DEAD task?
On platforms that utilize Rocoto workflow software (including Hera and Orion), if something goes wrong with the workflow, a task may end up in the DEAD state:
rocotostat -w land_analysis.xml -d land_analysis.db
CYCLE TASK JOBID STATE EXIT STATUS TRIES DURATION
=============================================================================
200001030000 prepexp 16779414 SUCCEEDED 0 1 11.0
200001030000 prepobs 16779415 SUCCEEDED 0 1 0.0
200001030000 prepbmat 16779416 SUCCEEDED 0 1 9.0
200001030000 runana 16779434 SUCCEEDED 0 1 68.0
200001030000 runfcst - DEAD 256 1 2186.0
This means that the dead task has not completed successfully, so the workflow has stopped. Once the issue has been identified and fixed (by referencing the log files), users can re-run the failed task using the rocotorewind
command:
rocotorewind -w land_analysis.xml -d land_analysis.db -v 10 -c 200001030000 -t runfcst
where -c
specifies the cycle date (first column of rocotostat
output) and -t
represents the task name
(second column of rocotostat
output). After using rocotorewind
, the next time rocotorun
is used to
advance the workflow, the job will be resubmitted.