Changes between Initial Version and Version 1 of Workshops/JobParallelism/AfterYourJobHasCompleted


Ignore:
Timestamp:
01/19/26 14:06:17 (25 hours ago)
Author:
Carl Baribault
Comment:

Setting up for sacct - WIP

Legend:

Unmodified
Added
Removed
Modified
  • Workshops/JobParallelism/AfterYourJobHasCompleted

    v1 v1  
     1[[PageOutline]]
     2= After your job has completed - determining cumulative core efficiency =
     3
     4== Assumptions ==
     5See [wiki:Workshops/JobParallelism/WhileYourJobIsRunning#Assumptions Assumptions] - same as for running jobs.
     6
     7== Preliminary: tools available ==
     8
     9=== LONI clusters ===
     10
     11LONI clusters provide the self-contained commands '''seff''' and '''qshow'''.
     12
     13  * '''seff''' (See [https://github.com/SchedMD/slurm/tree/master/contribs/seff seff on github].)
     14
     15  On LONI QB4 cluster:
     16
     17 {{{
     18[loniID@qbd2 ~]$ seff -h
     19Usage: seff [Options] <Jobid>
     20       Options:
     21       -h    Help menu
     22       -v    Version
     23       -d    Debug mode: display raw Slurm data
     24[loniID@qbd2 ~]$ seff -v
     25seff Version 2.1
     26
     27 }}}
     28
     29   * '''qshow''' (provided by LONI)
     30
     31  On LONI QB4 cluster:
     32
     33 {{{
     34[loniID@qbd2 ~]$ qshow -h
     35** usage: qshow -n <options> <base-name> <begin #> <end #> <command>
     36...
     37Show and optionally kill user processes on remote nodes or execute
     38commands...
     39[loniID@qbd2 ~]$ qshow -v
     40qshow 2.74
     41 }}}
     42
     43=== Cypress ===
     44
     45 In the following we'll need to use the '''sacct''' command for analyzing completed jobs on Cypress. (Cypress uses an older version of SLURM (v14.03.0) with insufficient support for the seff command.)
     46
     47 Here are the relevant outputs that we'll need from '''sacct'''.
     48
     49||='''sacct''' output column=||=Description=||=Format=||
     50||'''TotalCPU'''||Total core hours used||[DD-[hh:]]mm:ss)||
     51||'''CPUTimeRAW'''||Total cores hours allocated||Seconds||
     52
     53== Cumulative core efficency: (total core hours used) / (total core hours allocated ==
     54
     55 foo