We need a diverse dataset, and as the number of examples increases, the accuracy / evaluation metrics of the model improve (with respect to the bias variance tradeoff)
Main Idea: Workload takes long for a single processor
Solution: Split workload among processors / machines in a divide & conquer approach
Recall:
each process can have multiple threads
each CPU core can only execute 1 instruction / 1 thread
each thread runs its computation simultaneously
All threads share address space (data)
Special Case: Hyperthreading virtualizes a core to run multiple threads
A directed graph representation of a program with vertices