https://haojian.github.io/DSC102SP24/static_files/presentations/9DataParallelismReplication.pdf

3 Paradigms of Multi-Node Parallelism

Shared-Nothing Data Parallelism

Data Partitioning Strategies

Tradeoffs:

Replication of partition across nodes is common to enable fault tolerance and better parallel runtime

Cluster Architectures

Manager-Worker: Manager tells workers what to do and when to talk to other nodes

Peer-to-Peer: workers talk to each other directly (decentralized)

Quantifying Speedup from Parallelism

Screen Shot 2024-05-22 at 3.11.36 PM.png