The CUDA Toolkit includes 100+ code samples, utilities, whitepapers, and additional documentation to help you get started developing, porting, and optimizing your applications for the CUDA architecture. You can get quick access to many of the toolkit resources on this page, CUDA documentation, or download the complete toolkit.
Please note that you may need to install the latest NVIDIA drivers and CUDA Toolkit to compile and run the code samples.
Refer to the samples release notes for more information.
|
||
![]() This sample shows how to copy CUDA image back to OpenGL using the most efficient methods. |
![]() ![]()
Download - Windows (x86) |
|
|
||
![]() A CUDA program that demonstrates how to compute a stereo disparity map using SIMD SAD (Sum of Absolute Difference) intrinsics. Requires Compute Capability 2.0 or higher. |
![]() ![]()
Download - Windows (x86) |
|
|
||
![]() This sample demonstrates 3D Volumetric Filtering using 3D Textures and 3D Surface Writes. |
![]() ![]()
Download - Windows (x86) |
|
|
||
![]() Variational optical flow estimation example. Uses textures for image operations. Shows how simple PDE solver can be accelerated with CUDA. |
![]() ![]()
Download - Windows (x86) |
|
|
||
![]() This sample demonstrates how to effectively use the CUDA Video Encoder API encode H.264 video. Video input in YUV formats are taken as input (either CPU system or GPU memory) and video output frames are encoded to an H.264 file |
![]() ![]() |
|
|
||
![]() Bilateral filter is an edge-preserving non-linear smoothing filter that is implemented with CUDA with OpenGL rendering. It can be used in image recovery and denoising. Each pixel is weight by considering both the spatial distance and color distance between its neighbors. Reference:"C. Tomasi, R. Manduchi, Bilateral Filtering for Gray and Color Images, proceeding of the ICCV, 1998, http://users.soe.ucsc.edu/~manduchi/Papers/ICCV98.pdf" |
![]() ![]()
Download - Windows (x86) |
|
|
||
![]() Simple example that demonstrates the use of 2D surface references (Write-to-Texture) |
![]() ![]()
Download - Windows (x86) |
|
|
||
![]() This sample illustrates how to use function pointers and implements the Sobel Edge Detection filter for 8-bit monochrome images. |
![]() ![]()
Download - Windows (x86) |
|
|
||
![]() Use of Pitch Linear Textures |
![]() ![]()
Download - Windows (x86) |
|
|
||
![]() Simple example that demonstrates use of Textures in CUDA. |
![]() ![]()
Download - Windows (x86) |
|
|
||
![]() Simple example that demonstrates use of Textures in CUDA. This sample uses the new CUDA 4.0 kernel launch Driver API. |
![]() ![]()
Download - Windows (x86) |
|
|
||
![]() Simple example that demonstrates use of 3D Textures in CUDA. |
![]() ![]()
Download - Windows (x86) |
|
|
||
![]() This sample demonstrates how Discrete Cosine Transform (DCT) for blocks of 8 by 8 pixels can be performed using CUDA: a naive implementation by definition and a more traditional approach used in many libraries. As opposed to implementing DCT in a fragment shader, CUDA allows for an easier and more efficient implementation. |
![]() ![]()
Download - Windows (x86) |
|
|
||
![]() Discrete Haar wavelet decomposition for 1D signals with a length which is a power of 2. |
![]() ![]()
Download - Windows (x86) |
|
|
||
![]() Naturally(Hadamard)-ordered Fast Walsh Transform for batching vectors of arbitrary eligible lengths that are power of two in size. |
![]() ![]()
Download - Windows (x86) |
|
|
||
![]() This sample demonstrates efficient implementation of 64-bin and 256-bin histogram. |
![]() ![]()
Download - Windows (x86) |
|
|
||
![]() Fast image box filter using CUDA with OpenGL rendering. |
![]() ![]()
Download - Windows (x86) |
|
|
||
![]() This sample shows how to post-process an image rendered in OpenGL using CUDA. |
![]() ![]()
Download - Windows (x86) |
|
|
||
![]() High Quality DXT Compression using CUDA. This example shows how to implement an existing computationally-intensive CPU compression algorithm in parallel on the GPU, and obtain an order of magnitude performance improvement. |
![]() ![]()
Download - Windows (x86) |
|
|
||
![]() This sample demonstrates two adaptive image denoising techniques: KNN and NLM, based on computation of both geometric and color distance between texels. While both techniques are implemented in the DirectX SDK using shaders, massively speeded up variation of the latter technique, taking advantage of shared memory, is implemented in addition to DirectX counterparts. |
![]() ![]()
Download - Windows (x86) |
|
|
||
![]() This sample implements the Sobel edge detection filter for 8-bit monochrome images. |
![]() ![]()
Download - Windows (x86) |
|
|
||
![]() This sample implements a Gaussian blur using Deriche's recursive method. The advantage of this method is that the execution time is independent of the filter width. |
![]() ![]()
Download - Windows (x86) |
|
|
||
![]() This sample demonstrates how to efficiently use the CUDA Video Decoder API to decode MPEG-2, VC-1, or H.264 sources. YUV to RGB conversion of video is accomplished with CUDA kernel. The output result is rendered to a D3D9 surface. The decoded video is not displayed on the screen, but with -displayvideo at the command line parameter, the video output can be seen. Requires a Direct3D capable device and Compute Capability 1.1 or higher. |
![]() ![]() |
|
|
||
![]() This sample demonstrates how to efficiently use the CUDA Video Decoder API to decode video sources based on MPEG-2, VC-1, and H.264. YUV to RGB conversion of video is accomplished with CUDA kernel. The output result is rendered to a OpenGL surface. The decoded video is black, but can be enabled with -displayvideo added to the command line. Requires Compute Capability 1.1 or higher. |
![]() ![]() |
|
|
||
![]() This sample demonstrates how to efficiently implement a Bicubic B-spline interpolation filter with CUDA texture. |
![]() ![]()
Download - Windows (x86) |
|
|
||
![]() This sample demonstrates how 2D convolutions with very large kernel sizes can be efficiently implemented using FFT transformations. |
![]() ![]()
Download - Windows (x86) |
|
|
||
![]() This sample implements a separable convolution filter of a 2D signal with a gaussian kernel. |
![]() ![]()
Download - Windows (x86) |
|
|
||
![]() Texture-based implementation of a separable 2D convolution with a gaussian kernel. Used for performance comparison against convolutionSeparable. |
![]() ![]()
Download - Windows (x86) |
|
|
||
![]() This sample demonstrates basic volume rendering using 3D Textures. |
![]() ![]()
Download - Windows (x86) |