GPU Diagnosis Introduction top htop iotop ntop valgrind Premature Pessimisation PP 1 PP 2 PP 3 Algorithms A 1 A 2 Hardware GPU Vectorisation CPU Cache About Introduction