Changes between Version 1 and Version 2 of Workshops/JobCheckpointing/Examples/BASH
- Timestamp:
- 03/13/2026 11:58:35 PM (2 days ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
Workshops/JobCheckpointing/Examples/BASH
v1 v2 98 98 99 99 {{{ 100 [tulaneID@cypress1 ~]$cat log_3300698. groupID.out100 [tulaneID@cypress1 ~]$cat log_3300698.out 101 101 Info[20260313-22:18:11]: Start on cypress01-066; JOB_ID=3300698; RESTARTS=0 102 102 Info[20260313-22:18:11]: Settings: … … 128 128 Bash SIGTERM: saving i=120 and exiting 99 129 129 Info[20260313-22:20:11]: Program exit code (from timeout wrapper): 124 130 Info[20260313-22:20:11]: Timeout TERM observed; checkpoint advanced (0 →120). Requeueing...130 Info[20260313-22:20:11]: Timeout TERM observed; checkpoint advanced (0->120). Requeueing... 131 131 Info[20260313-22:20:11]: Requeued via scontrol. 132 132 Info[20260313-22:20:27]: Start on cypress01-066; JOB_ID=3300698; RESTARTS=1 … … 159 159 Bash SIGTERM: saving i=240 and exiting 99 160 160 Info[20260313-22:22:27]: Program exit code (from timeout wrapper): 124 161 Info[20260313-22:22:27]: Timeout TERM observed; checkpoint advanced (120 →240). Requeueing...161 Info[20260313-22:22:27]: Timeout TERM observed; checkpoint advanced (120->240). Requeueing... 162 162 Info[20260313-22:22:27]: Requeued via scontrol. 163 163 Info[20260313-22:22:57]: Start on cypress01-066; JOB_ID=3300698; RESTARTS=2 … … 190 190 Bash SIGTERM: saving i=360 and exiting 99 191 191 Info[20260313-22:24:57]: Program exit code (from timeout wrapper): 124 192 Info[20260313-22:24:57]: Timeout TERM observed; checkpoint advanced (240 →360). Requeueing...192 Info[20260313-22:24:57]: Timeout TERM observed; checkpoint advanced (240->360). Requeueing... 193 193 Info[20260313-22:24:57]: Requeued via scontrol. 194 194 Info[20260313-22:25:27]: Start on cypress01-066; JOB_ID=3300698; RESTARTS=3 … … 221 221 Bash SIGTERM: saving i=480 and exiting 99 222 222 Info[20260313-22:27:27]: Program exit code (from timeout wrapper): 124 223 Info[20260313-22:27:27]: Timeout TERM observed; checkpoint advanced (360 →480). Requeueing...223 Info[20260313-22:27:27]: Timeout TERM observed; checkpoint advanced (360->480). Requeueing... 224 224 Info[20260313-22:27:27]: Requeued via scontrol. 225 225 Info[20260313-22:27:57]: Start on cypress01-066; JOB_ID=3300698; RESTARTS=4
