Tuesday, March 23, 2010

CUDA Toolkit 3.0 now available

Good news!
There're some really delicious things in it:
  • Multiple Copy Engine support
  • Concurrent Kernel Execution
  • C++ Class Inheritance and Template Inheritance support for increased programmer productivity
  • CUDA Driver / Runtime Buffer Interoperability, which allows applications using the CUDA Driver API to also use libraries implemented using the CUDA C Runtime such as CUFFT and CUBLAS.
  • CUBLAS now supports all BLAS1, 2, and 3 routines including those for single and double precision complex numbers
  • New CUDA Memory Checker reports misalignment and out of bounds errors, available as a stand-alone utility and debugging mode within cuda-gdb
  • On Linux, use cuda-gdb and cuda-memcheck, and check out the solutions from Allinea and TotalView that will be available soon.
  • OpenCL Images support, for better/faster image filtering
C++ class inheritance is what I already was needed in my project. Higher performance and new debugging abilities also sounds great.

No comments:

Post a Comment