CUDA:Documentation
CUDA C Programming Guide
- CUDA C Programming Guide v8.0
- http://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#axzz4Otqd6zIG
-
CUDA_C_Programming_Guide_-_v8.0.pdf
- CUDA C Programming Guide v9.1.85
-
CUDA_C_Programming_Guide.pdf - Google translate (en -> ko): CUDA_C_Programming_Guide_-v9.1.85-01.pdf, CUDA_C_Programming_Guide-v9.1.85-_02.pdf
- CUDA C/C++ Programming Guide
-
CUDA_C_and_CPP_Programming_Guide.zip - Korea Insurance Development Institute 참조 (보험 개발원)
CUDA C Best Practices Guide
- CUDA C Best Practices Guide v7.5 (한글번역)
-
CUDA_C_Best_Practices_Guide_v7.5_-_ko.pdf
- CUDA C Best Practices Guide v8.0
- http://docs.nvidia.com/cuda/cuda-c-best-practices-guide/index.html#axzz4Otqd6zIG
-
CUDA_C_Best_Practices_Guide_-_v8.0.pdf
- CUDA C Best Practices Guide v9.1.85
-
CUDA_C_Best_Practices_Guide.pdf - Google translate (en -> ko): CUDA_C_Best_Practices_Guide_-_google-trans-ko.pdf
CUDA Toolkit Documentation - v9.1.85
- Pascal Compatibility Guide: Pascal_Compatibility_Guide.pdf
- Volta Compatibility Guide: Volta_Compatibility_Guide.pdf
- Pascal Tuning Guide: Pascal_Tuning_Guide.pdf
- Volta Tuning Guide: Volta_Tuning_Guide.pdf
- PTX Interoperability: PTX_Writers_Guide_To_Interoperability.pdf
- Inline PTX Assembly: Inline_PTX_Assembly.pdf
- CUDA Runtime API: CUDA_Runtime_API.pdf
- CUDA Driver API: CUDA_Driver_API.pdf
- CUDA Math API: CUDA_Math_API.pdf
- NVCC: CUDA_Compiler_Driver_NVCC.pdf
- CUDA-GDB: cuda-gdb.pdf
- CUDA-MEMCHECK: CUDA_Memcheck.pdf
- Nsight Eclipse Edition: Nsight_Eclipse_Edition_Getting_Started.pdf
- Profiler: CUDA_Profiler_Users_Guide.pdf
- Google translate (en -> ko): CUDA_Profiler_Users_Guide_-v9.1.85-_ko.pdf
- CUDA Binary Utilities: CUDA_Binary_Utilities.pdf
NVIDIA Nsight
NVIDIA® Nsight™ Visual Studio Edition is a development environment for CUDA and graphics applications running on NVIDIA GPUs, which is integrated into Microsoft Visual Studio.
- CUDA Debugger: NVIDIA_Nsight_Visual_Studio_Edition_5.5_User_Guide_-_CUDA_Debugger.zip
- Analysis Tools: NVIDIA_Nsight_Visual_Studio_Edition_5.5_User_Guide_-_Analysis_Tools.zip
- Reference: NVIDIA_Nsight_Visual_Studio_Edition_5.5_User_Guide_-_Reference.zip
- Kernel-Level Experiments (ko): CUDA_Experiments_-Kernel-Level_Experiments-_ko.zip
Parallel Forall
- CUDA Dynamic Parallelism API and Principles 1
- CUDA Pro Tip: Minimize the Tail Effect 2
- Accelerate Machine Learning with the cuDNN Deep Neural Network Library
- [추천] CUDA Pro Tip: Optimize for Pointer Aliasing 3
- Google translate (en -> ko): CUDA_Pro_Tip_-Optimize_for_Pointer_Aliasing-_ko.pdf
- C++11 in CUDA: Variadic Templates 4
- The Power of C++11 in CUDA 7 5 (CUDA Labmda)
- [추천] GPU Pro Tip: Fast Dynamic Indexing of Private Arrays in CUDA 6
- Google translate (en -> ko): Fast_Dynamic_Indexing_of_Private_Arrays_in_CUDA_-_ko.pdf
- GPU Pro Tip: CUDA 7 Streams Simplify Concurrency 7
- [추천] CUDA Pro Tip: Always Set the Current Device to Avoid Multithreading Bugs 8
- Maximizing Unified Memory Performance in CUDA 9
- Register Cache: Caching for Warp-Centric CUDA Programs 10
- Cooperative Groups: Flexible CUDA Thread Programming 11
- Building Cross-Platform CUDA Applications with CMake
- Mixed-Precision Programming with CUDA 8
- GPU-Accelerated Black Hole Simulations
- Fast Multi-GPU collectives with NCCL
- How to Access Global Memory Efficiently in CUDA C/C++ Kernels 12
- [추천] CUDA Pro Tip: Occupancy API Simplifies Launch Configuration 13
- Google translate (en -> ko): CUDA_Pro_Tip_-Occupancy_API_Simplifies_Launch_Configuration.pdf-_ko.pdf
- [추천] How to Optimize Data Transfers in CUDA C/C++ 14
- [추천] How to Implement Performance Metrics in CUDA C/C++ 15
- Google translate (en -> ko): How_to_Implement_Performance_Metrics_in_CUDA_C_Cpp_-NVIDIA_Developer_Blog-_ko.pdf
- Unified Memory for CUDA Beginners
- CUDA Pro Tip: Optimized Filtering with Warp-Aggregated Atomics
- How to Query Device Properties and Handle Errors in CUDA C/C++
- How to Overlap Data Transfers in CUDA C/C++ (CUDA Streams)
- An Efficient Matrix Transpose in CUDA C/C++
- Finite Difference Methods in CUDA C/C++, Part 1
- Finite Difference Methods in CUDA C++, Part 2
Popular Posts
- [추천] Tesla V100 White Paper 16
- CUDA 9 Features Revealed: Volta, Cooperative Groups and More 17
- [추천] An Even Easier Introduction to CUDA
- Hybridizer: High-Performance C# on GPUs
- An Easy Introduction to CUDA C and C++
- NVIDIA Jetson TX2 Delivers Twice the Intelligence to the Edge
- [추천] Using Shared Memory in CUDA C/C++ 18
- Google translate (en -> ko): Using_Shared_Memory_in_CUDA_C_Cpp_-_ko.pdf
- Deep Learning in a Nutshell: Core Concepts
- Inside Volta: The World’s Most Advanced Data Center GPU 19
- Programming Tensor Cores in CUDA 9 20
- TensorRT 3: Faster TensorFlow Inference and Volta Support
- NVIDIA Docker: GPU Server Application Deployment Made Easy
Legacy
- [추천] horacio9573.no-ip.org - NVIDIA CUDA Library Documentation 4.0
-
Horacio9573.no-ip.org-cuda.tar.gz
-
See also
Favorite site
- CUDA Toolkit Documentation
- [추천] CUDA C/C++ Basics Supercomputing 2011 Tutorial 21
- CUDA Advance 교육 (한국 엔비디아): CUDA_Advance_-jahan-_korea_nvidia.pdf
References
-
CUDA_Dynamic_Parallelism_API_and_Principles_-_Parallel_Forall.pdf ↩
-
CUDA_Pro_Tip_-Minimize_the_Tail_Effect-_Parallel_Forall.pdf ↩
-
CUDA_Pro_Tip_-_Optimize_for_Pointer_Aliasing.pdf ↩
-
Cpp11_in_CUDA_Variadic_Templates.pdf ↩
-
The_Power_of_Cpp11_Programming_in_CUDA_7.pdf ↩
-
Fast_Dynamic_Indexing_of_Private_Arrays_in_CUDA.pdf ↩
-
GPU_Pro_Tip_-_CUDA_7_Streams_Simplify_Concurrency.pdf ↩
-
CUDA_Pro_Tip_-_Always_Set_the_Current_Device_to_Avoid_Multithreading_Bugs.pdf ↩
-
Maximizing_Unified_Memory_Performance_in_CUDA_-_NVIDIA_Developer_Blog.pdf ↩
-
Register_Cache_-Caching_for_Warp-Centric_CUDA_Programs-_NVIDIA_Developer_Blog.pdf ↩
-
Cooperative_Groups_-Flexible_CUDA_Thread_Programming-_NVIDIA_Developer_Blog.pdf ↩
-
How_to_Access_Global_Memory_Efficiently_in_CUDA_C_Cpp_Kernels_-_NVIDIA_Developer_Blog.pdf ↩
-
CUDA_Pro_Tip_-_Occupancy_API_Simplifies_Launch_Configuration.pdf ↩
-
How_to_Optimize_Data_Transfers_in_CUDA_C_Cpp_-_NVIDIA_Developer_Blog.pdf ↩
-
How_to_Implement_Performance_Metrics_in_CUDA_C_Cpp_-_NVIDIA_Developer_Blog.pdf ↩
-
Volta-architecture-whitepaper.pdf ↩
-
CUDA_9_Features_Revealed_-Volta_Cooperative_Groups_and_More-_NVIDIA_Developer_Blog.pdf ↩
-
Using_Shared_Memory_in_CUDA_C_Cpp.pdf ↩
-
Inside_Volta_-The_Worlds_Most_Advanced_Data_Center_GPU-_NVIDIA_Developer_Blog.pdf ↩
-
Programming_Tensor_Cores_in_CUDA_9_-_NVIDIA_Developer_Blog.pdf ↩
-
Sc11-cuda-c-basics.pdf ↩