Previous | Next --- Slide 32 of 49
Back to Lecture Thumbnails
cmchiang

If we fuse those three computations, we don't need to store and load the temporary variable twice and hence increase the arithmetic intensity.

SebL

It is also a way to implement and improve parallelism

Please log in to leave a comment.