Parallel computing
- Map of parallel computing

Parallel computing

Parallel computing is a style of programming where a computation is broken into parts that run simultaneously across multiple processors, cores, or machines. The motivation is straightforward: a single core has a clock speed ceiling, and modern CPUs gain performance by adding more cores rather than running each core faster. To take advantage of that, programs have to be written with parallelism in mind.

Not every program benefits equally. Amdahl's law shows that the sequential fraction of a program — the part that cannot be parallelized — sets a hard ceiling on speedup regardless of how many cores you add. Gustafson's law is the more optimistic counterpart: if you scale the problem size alongside the hardware, speedup grows linearly. In practice, HPC workloads follow Gustafson's regime — you buy more nodes to solve a bigger problem, not just to solve the same one faster.

Map of parallel computing

introduction-to-parallel-computing
SAXPY
synchronization-primitve
- sync-semaphore
- sync-mutex
- sync-monitor
- sync-linda
- sync-csp
- sync-mbox
Numbers every programmer should know
Amdahl's law
Gustafson's law
cache
- l1-cache
- l2-cache
- l3-cache
- cache-coherence
  - cache-snoopy-protocols
    - wti
    - msi
    - mesi
    - moesi
    - dragon
    - firefly
  - cache-directory-protocols
cuda
OpenMP
MPI
queuing-theory
numa
false-sharing
ABA problem
Trace monoid
hazard-pointer
cache-coherence
Roofline model
numa
embarrassingly-parallel
fork-join-model
rcu
lock
lock-convoy
lock-contention
spinlock
smp
flops
- gflops
- tflops
atomics
- atomic-ops
michael-scott-queue
bespoke-algorithm
interconnect
- infiniband
- rdma

Table of Contents

Parallel computing

Map of parallel computing