-------------------------------------------------------------------------------- -------------------------------------------------------------------------------- NVIDIA CUDA Windows Release Notes Version 1.0 -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- New Features -------------------------------------------------------------------------------- Hardware Support o Additional hardware support added - Quadro FX 5600 - Quadro FX 4600 - GeForce 8800 Ultra - GeForce 8600 GTS - GeForce 8600 GT - GeForce 8500 GT Compiler and Toolchain o PTX ISA support o 64-bit integer support o maxrregcount option added to NVCC Mathematical Functions o Additional functions added - sincos() - rsqrt() - exp10() o Improved accuracy of mathematical functions Miscellaneous o Asynchronous Launches o Asynchronous device to device memory copy CUFFT Library o Real to Complex and Complex to Real FFT support o Increased maximum 1-D FFT size to 8 million elements CUBLAS Library o Additional functions added - cublasIsamin() - cublasIcamin() -------------------------------------------------------------------------------- Major Performance Improvements -------------------------------------------------------------------------------- o Improved device to device memory copy bandwidth o Improved launch overhead -------------------------------------------------------------------------------- Major Bug Fixes -------------------------------------------------------------------------------- o Dynamic loading of CUDA, CUDA runtime, and CUDA Libraries now supported o Fixed memory leak that required reboot o Exception handling fixed for GPUs without a display attached o D3D interopability no longer limited to a single VBO -------------------------------------------------------------------------------- Known Issues -------------------------------------------------------------------------------- o Individual GPU program launches are limited to a run time of less than 5 seconds on a GPU with a display attached. Exceeding this time limit usually causes a launch failure reported through the CUDA driver or the CUDA runtime. GPUs without a display attached are not subject to the 5 second run time restriction. For this reason it is recommeded that CUDA is run on a GPU that is NOT attached to a display and does not have the Windows desktop extended onto it. In this case, the system must contain at least one NVIDIA GPU that serves as the primary graphics adapter. o Context creation is not thread safe. Applications must take care that only one thread creates a context at a time. o Launches that use texture are synchronous. o When two 8800GTX GPUs are run in SLI mode, cudaGetDeviceCount() correctly reports a single device, but cudaDeviceProperties() only returns the amount of memory on the first card, instead of the total amount of memory installed on both cards used for SLI. o The compiler does not emit error messages in a format that can be processed by the Microsoft Visual Studio environment. o OpenGL interopability may not function correctly on some systems with multiple displays enabled. To avoid this issue, set the following option in the NVIDIA Control Panel: 3D Settings -> Manage 3D settings -> Multi-display/mixed-GPU acceleration -> Single display performance mode o For graphics interoperability, D3D or OpenGL must be running on the same GPU as the compute context. As a result, graphics interopability does not work on systems with multiple GPUs installed. -------------------------------------------------------------------------------- Revision History -------------------------------------------------------------------------------- 06/2007 - Version 1.0 06/2007 - Version 0.9 02/2007 - Version 0.8 - Initial public Beta -------------------------------------------------------------------------------- Open64 Sources -------------------------------------------------------------------------------- The Open64 source files controlled under terms of the GPL license are located via anonymous ftp at download.nvidia.com in the CUDAOpen64 directory. -------------------------------------------------------------------------------- More Information -------------------------------------------------------------------------------- For more information and help with CUDA, please visit http://www.nvidia.com/cuda