Context Navigation

Changes between Version 4 and Version 5 of Workshops/JobParallelism

Timestamp:: 01/17/2026 11:42:37 PM (7 weeks ago)
Author:: Carl Baribault
Comment:: Iterated on section Before running...

Legend:

: Unmodified
: Added
: Removed
: Modified

Workshops/JobParallelism

-              v4
+              v5
 See [wiki:IntroToMulti-Processing2025August Module 2 of 8 - Introduction to Multi-processing] for more information on tools available on Cypress for creating and preparing your jobs at the various levels of parallelism - programming, single job, and multi-job.
 === Considerations before you build and/or run your job ===
+=== Before running your job ===
 Before you run your job you should consider the following in order for your job to run most efficiently on Cypress.
 …
 ==== Review your software provider's information ====
+Before you run your job, you should review the software provider's information in order to for your job to take the best advantage of the job's requested resources such as the following.
+ * Hardware requirements, including memory, number of processors, and/or supported or optimal thread count
+ * Guidelines, best practices, and tips & tricks.
+See [wiki:Workshops/IntroToMulti-Processing2025August#CodesforMulti-CoresMulti-Nodes.Offloading Codes for Multi-Cores, Multi-Nodes. Offloading].
+==== Choosing programming tools ====
+* Application programming level
+==== Choices for application programming tools ====
  Refer to the following table to determine what programming model to use based on the type of algorithm your job requires.
+ ||= Algorithm Type =||= Programming Model =||= Hardware Used =||= Example =||
+ ||Multithreaded (shared memory) ||OpenMP       ||1 Node, >=2 cores ||TBD ||
+ ||Problem domain decomposition  ||MPI          ||>=2 Nodes ||TBD ||
+ ||Massively Parallel Single Instruction Multiple Threads (SIMT) ||GPU kernels  ||GPUs      ||TBD ||
+ ||Hybrid Parallel ||MPI + (OpenMP or GPU kernels)  ||>=2 Nodes + GPUs     ||TBD ||
+ ||= Algorithm Type =||= Programming Model =||= Hardware Used =||= Examples =||
+ ||Single Instruction Multiple data (SIMD) ||Compiler vectorization       ||Intel Advanced Vector Extensions (AVX), 256-bit vector processor ||See [https://wiki.hpc.tulane.edu/trac/wiki/cypress#MathLibraries Math Libraries] ||
+ ||Multithreaded (shared memory) ||OpenMP       ||1 Node, >=2 cores ||See [wiki:cypress/Programming/OpenMp OpenMP] ||
+ ||Problem domain decomposition  ||MPI          ||>=2 Nodes ||See [wiki:cypress/Programming/Mpi MPI]||
+ ||Massively Parallel, Single Instruction Multiple Threads (SIMT) ||#pragma offload (GPU kernels not available on Cypress)  ||Coprocessors - !XeonPhi (GPUs not available on Cypress)     ||See [wiki:cypress/XeonPhi XeonPhi], [wiki:Workshops/cypress/OffloadingWithOpenMP Offloading to Accelerator] ||
+ ||Hybrid Parallel ||MPI + OpenMP ||>=2 Nodes     ||See [wiki:cypress/using#HybridJobs Hybrid Jobs] job script||
+* Job scripting level
+==== Choices for Job scripting ====
+ * Many independent tasks
  If you have data to be processed that is already divided - or can be divided - into separate files that can be processed entirely independent of each other, then you should consider using either a job array or GNU Parallel. For more information, see [wiki:IntroToMulti-Processing2025August#RunningManySerialParallelJobs Running Many Serial/Parallel Jobs].
+ See [wiki:IntroToMulti-Processing2025August#RunningManySerialParallelJobs Running Many Serial/Parallel Jobs] if your computational workload can be split easily - or perhaps with some minimal or one-time effort - into many independent tasks, requiring minimal communication. For more information.
  ||= Task Criteria =||= Job Scripting Model =||
+ ||TBD ||Job Array   ||
  ||TBD ||GNU Parallel  ||
+ * Many dependent tasks
+ Otherwise, see [wiki:cypress/Programming/Mpi MPI] if your computational workload includes too many tasks to run on a single node '''and''' the tasks require a significant level of inter - communication '''during''' the computation.