Figure2showsanexamplemultiplyinganinputactivation vector a (of length 8) by a 16×8 weight matrix W yielding an output activation vector b (of length 16) on N =4PEs. The elements of a, b, and W are color coded with their PE assignments. Each PE owns 4 rows of W, 2 elements of a, and 4 elements of b.