Recovering from errors on the checkpoint data set
About this task
If there is a write error on the checkpoint data set, perform the following procedure:
- Stop IBM Z Workload Scheduler.
- Rename the checkpoint data set to a temporary name.
- Allocate a new checkpoint data set.
- Copy the old checkpoint data set into the new data set. This can be done by ISPF COPY or by IDCAMS REPRO.
- Start IBM Z Workload Scheduler again.
If there is a read error on the checkpoint data set, perform the following procedure:
- If a good new-current-plan does not exist:
- Stop IBM Z Workload Scheduler.
- Delete the checkpoint data set and reallocate it.
- Re-create the current plan using the refresh procedure (for details, see Re-creating the current plan from the long-term plan).
- If a good new-current-plan data set exists:
- Stop IBM Z Workload Scheduler.
- Check which job-tracking log is the current one. This can be done by reviewing the messages in the message log, or by browsing the JT log and checking the time stamp in position 13 in the first record of the data set. The data set with the latest time stamp in the first record is current.
- Copy the data from the active job-tracking log into the job-tracking log referenced by the EQQJT01 ddname.
- Determine which JS file was active. If EQQJS1DS defines the current data set, then continue with the next step. Otherwise, either copy the EQQJS2DS to the EQQJS1DS or switch the ddnames in the JCL procedure.
- Delete and reallocate the IBM Z Workload Scheduler checkpoint data set.
- Change the JTOPTS statement to specify JOBSUBMIT(NO) and CURRPLAN(NEW), and start the scheduler.
- Enter the Modify Current® Plan dialog to set correct status for all operations in the current plan.
- When all operations have correct status, enter the SERVICE FUNCTIONS panel and enable job submission again. Restore the JTOPTS statement if you changed it.