Changes between Version 2 and Version 3 of Workshops/JobCheckpointing/Examples/BASH


Ignore:
Timestamp:
03/20/2026 01:45:57 PM (25 hours ago)
Author:
Carl Baribault
Comment:

Added missing done, corrected numbered bullets

Legend:

Unmodified
Added
Removed
Modified
  • Workshops/JobCheckpointing/Examples/BASH

    v2 v3  
    6262    exit 0
    6363  fi
     64done
    6465}}}
    6566
     
    6869To run the BASH checkpointing job example, defaulting to checkpointing every 20 application iterations and a total of 500 iterations, perform the following.
    6970
    70 1. Edit the files '''checkpoint_runner.sh''' and '''checkpoint_signal_iter.sh''' in your current directory.
     711. Edit the files '''checkpoint_runner.sh''' and '''checkpoint_signal_iter.sh''' as shown above in your current directory.
    7172 For file editing with nano, etc., see [[https://wiki.hpc.tulane.edu/trac/wiki/cypress/FileEditingSoftware/Example|File Editing Example]].
    7273
    73 2. Submit the job via the following command.
     742. Change permissions on the BASH application script, '''checkpoint_signal_iter.sh''' executable via the following command.
     75
     76{{{
     77[tulaneID@cypress1 ~]$ chmod u+x checkpoint_signal_iter.sh
     78}}}
     79
     803. Submit the job via the following command.
    7481
    7582{{{
     
    7784}}}
    7885
    79 2. Monitor the job's output via the following command, substituting the job ID for <jobID>.
     864. Monitor the job's output via the following command, substituting the job ID for <jobID>.
    8087
    8188{{{
     
    8390}}}
    8491
    85 3. Here are normal results for the output and error files, '''log_<jobID>.err''' and  '''log_<jobID>.out''', observing that the job cancelled and requeued itself many times.
     925. Here are normal results for the output and error files, '''log_<jobID>.err''' and  '''log_<jobID>.out''', observing that the job cancelled and requeued itself many times.
    8693
    8794{{{