Tag: compute_52

  • Matching CUDA arch and CUDA gencode for various NVIDIA architectures

    Matching CUDA arch and CUDA gencode for various NVIDIA architectures

    Updated July 12th 2024 tl;dr I’ve seen some confusion regarding NVIDIA’s nvcc sm flags and what they’re used for:When compiling with NVCC, the arch flag (‘-arch‘) specifies the name of the NVIDIA GPU architecture that the CUDA files will be compiled for.Gencodes (‘-gencode‘) allows for more PTX generations and can be repeated many times for…