Changes between Version 1 and Version 2 of Workshops/JobCheckpointing/Examples/Python


Ignore:
Timestamp:
03/20/2026 01:48:46 PM (25 hours ago)
Author:
Carl Baribault
Comment:

Clarified and renumbered running instructions

Legend:

Unmodified
Added
Removed
Modified
  • Workshops/JobCheckpointing/Examples/Python

    v1 v2  
    7878To run the Python checkpointing job example, defaulting to checkpointing every 20 application iterations and a total of 500 iterations, perform the following.
    7979
    80 1. Edit the files '''checkpoint_runner.sh''' and '''checkpoint_signal_iter.py''' in your current directory.
     801. Edit the files '''checkpoint_runner.sh''' and '''checkpoint_signal_iter.py''' as shown above in your current directory.
    8181 For file editing with nano, etc., see [[https://wiki.hpc.tulane.edu/trac/wiki/cypress/FileEditingSoftware/Example|File Editing Example]].
    8282
     
    8787}}}
    8888
    89 2. Monitor the job's output via the following command, substituting the job ID for <jobID>.
     893. Monitor the job's output via the following command, substituting the job ID for <jobID>.
    9090
    9191{{{
     
    9393}}}
    9494
    95 3. Here are normal results for the output and error files, '''log_<jobID>.err''' and  '''log_<jobID>.out''', observing that the job cancelled and requeued itself many times. (Not all cancellations were captured in the error file.)
     954. Here are normal results for the output and error files, '''log_<jobID>.err''' and  '''log_<jobID>.out''', observing that the job cancelled and requeued itself many times. (Not all cancellations were captured in the error file.)
    9696
    9797{{{