SLI Zone
NVIDIA.com Developer Home

NVIDIA OpenCL SDK - Linear Algebra

The GPU Computing SDK provides examples with source code, utilities, and white papers to help you get started writing GPU Computing software. The full SDK includes dozens of code samples covering a wide range of applications.

Refer to the following README for related SDK information ( README )

The latest NVIDIA display drivers are required to run code samples. Please obtain the latest display driver here.

The NVIDIA CUDA Toolkit is required to compile code samples. Please obtain the CUDA Toolkit from CUDA Zone.

Select the category to view:

OpenCL Vector Addition For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

Element by element addition of two 1-dimensional arrays. Implemented in OpenCL for CUDA GPU's, with functional comparison against a simple C++ host CPU implementation.
  Minimum Required GPU
Minimum Required GPUor later
Minimum Required GPU




Download - Windows (x86)
Download - Windows (x64)
Download - Linux/Mac


OpenCL Dot Product For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

Dot Product (scalar product) of set of input vector pairs. Implemented in OpenCL for CUDA GPU's, with functional comparison against a simple C++ host CPU implementation.
  Minimum Required GPU
Minimum Required GPUor later
Minimum Required GPU




Download - Windows (x86)
Download - Windows (x64)
Download - Linux/Mac


OpenCL Matrix Vector Multiplication For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

Simple matrix-vector multiplication example showing increasingly optimized implementations.
  Minimum Required GPU
Minimum Required GPUor later
Minimum Required GPU




Download - Windows (x86)
Download - Windows (x64)
Download - Linux/Mac


OpenCL Matrix Transpose For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

Efficient matrix transpose.
  Minimum Required GPU
Minimum Required GPUor later
Minimum Required GPU




Download - Windows (x86)
Download - Windows (x64)
Download - Linux/Mac


OpenCL Matrix Multiplication For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

This sample implements matrix multiplication and is exactly the same as Chapter 6 of the programming guide. It has been written for clarity of exposition to illustrate various OpenCL programming principles, not with the goal of providing the most performant generic kernel for matrix multiplication. CUBLAS provides high-performance matrix multiplication.
  Minimum Required GPU
Minimum Required GPUor later
Minimum Required GPU




Download - Windows (x86)
Download - Windows (x64)
Download - Linux/Mac


OpenCL Separable Convolution For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

This sample implements convolution filter of a 2D image with arbitrary separable kernel.
  Minimum Required GPU
Minimum Required GPUor later
Minimum Required GPU




Download - Windows (x86)
Download - Windows (x64)
Download - Linux/Mac

Last Update: 2/28/2010
NVPerfHUD 4