Table of Contents

Course Slides

Introduction to the Course

http://www.eecg.toronto.edu/~moshovos/CUDA08/slides/lecture-001.ppt (1.148 Mb) Overview of the CUDA execution model, and of a typical NVIDIA GPU containing system. Also, overview of the course.

Introduction to CUDA Programming

http://www.eecg.toronto.edu/~moshovos/CUDA08/slides/002 - Introduction to CUDA Programming_1.ppt (412.5 Kb)

CUDA Hardware

http://www.eecg.toronto.edu/~moshovos/CUDA08/slides/003 - Hardware.ppt (1,000 Kb)

Optimizing for CUDA

http://www.eecg.toronto.edu/~moshovos/CUDA08/slides/004 - Optimizing code.ppt (2.088 Mb)

Reductions

http://www.eecg.toronto.edu/~moshovos/CUDA08/slides/005 - Reduction.ppt (1.236 Mb)

Histograms and Sparse Matrix Multiplication

http://www.eecg.toronto.edu/~moshovos/CUDA08/slides/006 - Histogram and Sparse Arrays.ppt (242.5 Kb)

Scans and Streams

http://www.eecg.toronto.edu/~moshovos/CUDA08/slides/007 - Scans.ppt (1.708 Mb)

http://www.eecg.toronto.edu/~moshovos/CUDA08/slides/Scans Explained_1.ppt (211 Kb)

Added slides on explaining the balanced tree algorithm for calculating scans.

Textures

http://www.eecg.toronto.edu/~moshovos/CUDA08/slides/008 - Textures.ppt (1.742 Mb)

hiCUDA

Lecturer: Prof. Tarek Abdelrahman http://www.eecg.toronto.edu/~moshovos/CUDA08/slides/hicuda.pdf (1.273 Mb)

Profiler, Assembly, and Floating Point

http://www.eecg.toronto.edu/~moshovos/CUDA08/slides/009 - FP and Assembly.ppt (2.734 Mb)