cuda 245

  1. What is the canonical way to check for errors using the CUDA runtime API?
  2. How do I choose grid and block dimensions for CUDA kernels?
  3. How to get the nvidia driver version from the command line?
  4. CUDA incompatible with my gcc version
  5. Can I run CUDA on Intel's integrated graphics processor?
  6. Error Message : Cannot find or open the PDB file
  7. CUDA vs FPGA?
  8. Compression library using Nvidia's CUDA
  9. CUDA Driver API vs. CUDA runtime
  10. CUDA apps time out & fail after several seconds - how to work around this?
  11. CUDA for .net?
  12. Is branch divergence really so bad?
  13. Is it possible to run CUDA on AMD GPUs?
  14. What is the purpose of using multiple “arch” flags in Nvidia's NVCC compiler?
  15. Cuda gridDim and blockDim
  16. Why can't libcudart.so.4 be found when compiling the CUDA samples under Ubuntu?
  17. How do I start a CUDA app in Visual Studio 2010?
  18. Revert Apple Clang Version For NVCC
  19. How do I make an already written concurrent program run on a GPU array?
  20. graph algorithms on GPU
  21. CUDA Block and Grid size efficiencies
  22. How to dynamically allocate arrays inside a kernel?
  23. Have you successfully used a GPGPU?
  24. Any Lisp extensions for CUDA?
  25. Efficient layout and reduction of virtual 2d data (abstract)
  26. “invalid configuration argument ” error for the call of CUDA kernel?
  27. Could not insert 'nvidia_352': No such device
  28. CUDA: Wrapping device memory allocation in C++
  29. Is there a fast in-memory queue I can use that swaps items as it reaches a certain size?
  30. Installing theano on Windows 8 with GPU enabled
  31. modular arithmetic on the gpu
  32. Installing cuda 5 samples in Ubuntu 12.10
  33. Interpreting Output of --ptxas-options=-v
  34. Feasibility of GPU as a CPU?
  35. Techniques to Reduce CPU to GPU Data Transfer Latency
  36. Lambda expressions with CUDA
  37. CUDA version X complains about not supporting gcc version Y - what to do?
  38. Numpy, BLAS and CUBLAS
  39. Nsight Eclipse Cuda + opencv
  40. nvidia-smi Volatile GPU-Utilization explanation?
  41. Is there an equivalent to memcpy() that works inside a CUDA kernel?
  42. Getting starting with Parallel programming
  43. CUDA compiler (nvcc) macro
  44. Reduce matrix rows with CUDA
  45. Bottom up set generation and ordering
  46. Unified Memory profiling failed
  47. What does #pragma unroll do exactly? Does it affect the number of threads?
  48. Aligning GPU memory accesses of an image convolution (OpenCL/CUDA) kernel
  49. Hamming weight based indexing
  50. How are CUDA threads divided into warps?