--------------------------------------------------------------------------------
--------------------------------------------------------------------------------
NVIDIA CUDA 
Windows Release Notes
Version 1.0
--------------------------------------------------------------------------------
--------------------------------------------------------------------------------

--------------------------------------------------------------------------------
New Features
--------------------------------------------------------------------------------

  Hardware Support
  o  Additional hardware support added
     - Quadro FX 5600
     - Quadro FX 4600
     - GeForce 8800 Ultra 
     - GeForce 8600 GTS
     - GeForce 8600 GT
     - GeForce 8500 GT 

  Compiler and Toolchain
  o  PTX ISA support
  o  64-bit integer support
  o  maxrregcount option added to NVCC

  Mathematical Functions
  o  Additional functions added
     - sincos()
     - rsqrt()
     - exp10()
  o  Improved accuracy of mathematical functions     

  Miscellaneous
  o  Asynchronous Launches
  o  Asynchronous device to device memory copy

  CUFFT Library
  o  Real to Complex and Complex to Real FFT support
  o  Increased maximum 1-D FFT size to 8 million elements

  CUBLAS Library
  o  Additional functions added
     - cublasIsamin()
     - cublasIcamin()

--------------------------------------------------------------------------------
Major Performance Improvements
--------------------------------------------------------------------------------

  o  Improved device to device memory copy bandwidth
  o  Improved launch overhead

--------------------------------------------------------------------------------
Major Bug Fixes
--------------------------------------------------------------------------------

  o  Dynamic loading of CUDA, CUDA runtime, and CUDA Libraries now supported
  o  Fixed memory leak that required reboot
  o  Exception handling fixed for GPUs without a display attached
  o  D3D interopability no longer limited to a single VBO

--------------------------------------------------------------------------------
Known Issues
--------------------------------------------------------------------------------

o Individual GPU program launches are limited to a run time 
  of less than 5 seconds on a GPU with a display attached.
  Exceeding this time  limit usually causes a launch failure
  reported through the CUDA driver or the CUDA runtime.  GPUs
  without a display attached are not subject to the 5 second 
  run time restriction.  For this reason it is recommeded that 
  CUDA is run on a GPU that is NOT attached to a display and 
  does not have the Windows desktop extended onto it.  In this
  case, the system must contain at least one NVIDIA GPU that
  serves as the primary graphics adapter.

o Context creation is not thread safe.  Applications must take
  care that only one thread creates a context at a time.

o Launches that use texture are synchronous.

o When two 8800GTX GPUs are run in SLI mode, cudaGetDeviceCount()
  correctly reports a single device, but cudaDeviceProperties() 
  only returns the amount of memory on the first card, instead
  of the total amount of memory installed on both cards used for
  SLI.

o The compiler does not emit error messages in a format that 
  can be processed by the Microsoft Visual Studio environment.

o OpenGL interopability may not function correctly on some
  systems with multiple displays enabled.  To avoid this issue,
  set the following option in the NVIDIA Control Panel:
  3D Settings -> Manage 3D settings ->
  Multi-display/mixed-GPU acceleration -> 
  Single display performance mode

o For graphics interoperability, D3D or OpenGL must be running on the
  same GPU as the compute context.  As a result, graphics interopability 
  does not work on systems with multiple GPUs installed.  

--------------------------------------------------------------------------------
Revision History
--------------------------------------------------------------------------------

  06/2007 - Version 1.0
  06/2007 - Version 0.9
  02/2007 - Version 0.8 - Initial public Beta 

--------------------------------------------------------------------------------
Open64 Sources
--------------------------------------------------------------------------------

The Open64 source files controlled under terms of the GPL license are
located via anonymous ftp at download.nvidia.com in the CUDAOpen64 directory.

--------------------------------------------------------------------------------
More Information
--------------------------------------------------------------------------------

  For more information and help with CUDA, please visit
  http://www.nvidia.com/cuda