Changes between Version 1 and Version 2 of cypress/Programming/SpeedupScaling


Ignore:
Timestamp:
08/18/15 12:24:51 (9 years ago)
Author:
fuji
Comment:

Legend:

Unmodified
Added
Removed
Modified
  • cypress/Programming/SpeedupScaling

    v1 v2  
    22= Parallel Computing =
    33==  Moore's law ==
     4[[Image(Moores_law_(1970-2011).png, 25%)]]
     5
    46* 2X transistors/Chip Every 1.5 years Called “Moore’s Law”
    57* Microprocessors have become smaller, denser, and more powerful.
     
    1517* When task switches occur frequently enough the illusion of parallelism is achieved.
    1618
     19[[Image(CPUpowerDensity.tiff)]]
    1720
    1821== Multi-core ==
     
    2629A coprocessor is a computer processor used to supplement the functions of the primary processor (the CPU). Operations performed by the coprocessor may be floating point arithmetic, graphics, signal processing, string processing, encryption or I/O Interfacing with peripheral devices. By offloading processor-intensive tasks from the main processor, coprocessors can accelerate system performance.
    2730
     31[[Image()]]
    2832
    2933=== GPU ===
    3034A graphics processing unit (GPU), also occasionally called visual processing unit (VPU), is a specialized electronic circuit designed to rapidly manipulate and alter memory to accelerate the creation of images in a frame buffer intended for output to a display. GPUs are used in embedded systems, mobile phones, personal computers, workstations, and game consoles. Modern GPUs are very efficient at manipulating computer graphics, and their highly parallel structure makes them more effective than general-purpose CPUs for algorithms where processing of large blocks of data is done in parallel. In a personal computer, a GPU can be present on a video card, or it can be on the motherboard.
    31 
     35[[Image()]]
    3236
    3337==== GPU computing ====
     
    3842Intel Many Integrated Core Architecture or Intel MIC (pronounced Mick or Mike) is a coprocessor computer architecture developed by Intel, the Teraflops Research Chip multicore chip research project, and the Intel Single-chip Cloud Computer multicore microprocessor.
    3943
    40 
     44[[Image()]]
    4145----
    4246
    4347
    4448== Speedup Factor ==
    45 
     49[[Image(Speedup_fac1.png)]]
    4650* Suppose you have a code that takes ts~ seconds to run on one processor.
    4751
     
    6569
    6670This is the Amdahl's Law.
     71[[Image(Speedup_fac2.png)]]
     72
     73== Overhead time ==
     74
     75* Since some extra procedures are required for parallelization, there are overheads of each parallel process.
     76* For MPI codes, the communication between processes is a major overhead.
     77   * We can reduce this negative impact by overlapping communication with computations.
     78* For OpenMP, there are overhead time at the fork and join threads.
     79
     80[[Image(Speedup_fac3.png)]]