====== Course Slides ====== ==== Introduction to the Course ==== [[http://www.eecg.toronto.edu/~moshovos/CUDA08/slides/lecture-001.ppt]] (1.148 Mb) Overview of the CUDA execution model, and of a typical NVIDIA GPU containing system. Also, overview of the course. ==== Introduction to CUDA Programming ==== [[http://www.eecg.toronto.edu/~moshovos/CUDA08/slides/002 - Introduction to CUDA Programming_1.ppt]] (412.5 Kb) ==== CUDA Hardware ==== [[http://www.eecg.toronto.edu/~moshovos/CUDA08/slides/003 - Hardware.ppt]] (1,000 Kb) ==== Optimizing for CUDA ==== [[http://www.eecg.toronto.edu/~moshovos/CUDA08/slides/004 - Optimizing code.ppt]] (2.088 Mb) ==== Reductions ==== [[http://www.eecg.toronto.edu/~moshovos/CUDA08/slides/005 - Reduction.ppt]] (1.236 Mb) ==== Histograms and Sparse Matrix Multiplication ==== [[http://www.eecg.toronto.edu/~moshovos/CUDA08/slides/006 - Histogram and Sparse Arrays.ppt]] (242.5 Kb) ==== Scans and Streams ==== [[http://www.eecg.toronto.edu/~moshovos/CUDA08/slides/007 - Scans.ppt]] (1.708 Mb) [[http://www.eecg.toronto.edu/~moshovos/CUDA08/slides/Scans Explained_1.ppt]] (211 Kb) Added slides on explaining the balanced tree algorithm for calculating scans. ==== Textures ==== [[http://www.eecg.toronto.edu/~moshovos/CUDA08/slides/008 - Textures.ppt]] (1.742 Mb) ==== hiCUDA ==== **Lecturer: Prof. Tarek Abdelrahman** [[http://www.eecg.toronto.edu/~moshovos/CUDA08/slides/hicuda.pdf]] (1.273 Mb) ==== Profiler, Assembly, and Floating Point ==== [[http://www.eecg.toronto.edu/~moshovos/CUDA08/slides/009 - FP and Assembly.ppt]] (2.734 Mb)