CUDA Tutorials, Samples, and Analysis
Author: Greg GutmannAffiliation: Tokyo Institute of Technology, Nvidia University Ambassador, Nvidia DLI Introduction In this post, I will outline what is needed to install and run several of Nvidia's profiling…
Author: Greg GutmannAffiliation: Tokyo Institute of Technology, Nvidia University Ambassador, Nvidia DLI Introduction The following post goes over a simple demonstration of CUDA graphs, using the vector add code from…
Author: Greg GutmannAffiliation: Tokyo Institute of Technology, Nvidia University Ambassador, Nvidia DLI Typical cuRAND Usage The code on this page demonstrates one common approach to generating random numbers on GPU…
Author: Greg GutmannAffiliation: Tokyo Institute of Technology, Nvidia University Ambassador, Nvidia DLI Introduction The following code demonstrates peer-to-peer memory copy in CUDA. This is intended as a simple example. For…
Author: Greg GutmannAffiliation: Tokyo Institute of Technology, Nvidia University Ambassador, Nvidia DLI Introduction This is a continuation of my posts on CUDA programming, for the previous post on thread indexing…
Author: Greg GutmannAffiliation: Tokyo Institute of Technology, Nvidia University Ambassador, Nvidia DLI Thread Indexing In the part 1 post we ended with looking at launching kernels with multiple threads; however,…
Author: Greg GutmannAffiliation: Tokyo Institute of Technology, Nvidia University Ambassador, Nvidia DLI Prerequisites: some C/C++ familiarity and technical background, or good Googling skills. Introduction Many have discussed the benefits of…