CUDA 运行时 API :: CUDA 工具包文档

7.9. cudaDeviceProp 结构体参考

[CUDA 运行时使用的数据类型]

CUDA 设备属性

公共变量

int ECCEnabled
int accessPolicyMaxWindowSize
int asyncEngineCount
int canMapHostMemory
int canUseHostPointerForRegisteredMem
int clockRate
int clusterLaunch
int computeMode
int computePreemptionSupported
int concurrentKernels
int concurrentManagedAccess
int cooperativeLaunch
int cooperativeMultiDeviceLaunch
int deferredMappingCudaArraySupported
int deviceOverlap
int directManagedMemAccessFromHost
int globalL1CacheSupported
unsigned int gpuDirectRDMAFlushWritesOptions
int gpuDirectRDMASupported
int gpuDirectRDMAWritesOrdering
int hostNativeAtomicSupported
int hostRegisterReadOnlySupported
int hostRegisterSupported
int integrated
int ipcEventSupported
int isMultiGpuBoard
int kernelExecTimeoutEnabled
int l2CacheSize
int localL1CacheSupported
char luid[8]
unsigned int luidDeviceNodeMask
int major
int managedMemory
int maxBlocksPerMultiProcessor
int maxGridSize[3]
int maxSurface1D
int maxSurface1DLayered[2]
int maxSurface2D[2]
int maxSurface2DLayered[3]
int maxSurface3D[3]
int maxSurfaceCubemap
int maxSurfaceCubemapLayered[2]
int maxTexture1D
int maxTexture1DLayered[2]
int maxTexture1DLinear
int maxTexture1DMipmap
int maxTexture2D[2]
int maxTexture2DGather[2]
int maxTexture2DLayered[3]
int maxTexture2DLinear[3]
int maxTexture2DMipmap[2]
int maxTexture3D[3]
int maxTexture3DAlt[3]
int maxTextureCubemap
int maxTextureCubemapLayered[2]
int maxThreadsDim[3]
int maxThreadsPerBlock
int maxThreadsPerMultiProcessor
size_t memPitch
int memoryBusWidth
int memoryClockRate
unsigned int memoryPoolSupportedHandleTypes
int memoryPoolsSupported
int minor
int multiGpuBoardGroupID
int multiProcessorCount
char name[256]
int pageableMemoryAccess
int pageableMemoryAccessUsesHostPageTables
int pciBusID
int pciDeviceID
int pciDomainID
int persistingL2CacheMaxSize
int regsPerBlock
int regsPerMultiprocessor
int reserved[63]
size_t reservedSharedMemPerBlock
size_t sharedMemPerBlock
size_t sharedMemPerBlockOptin
size_t sharedMemPerMultiprocessor
int singleToDoublePrecisionPerfRatio
int sparseCudaArraySupported
int streamPrioritiesSupported
size_t surfaceAlignment
int tccDriver
size_t textureAlignment
size_t texturePitchAlignment
int timelineSemaphoreInteropSupported
size_t totalConstMem
size_t totalGlobalMem
int unifiedAddressing
int unifiedFunctionPointers
cudaUUID_t uuid
int warpSize

变量

int cudaDeviceProp::ECCEnabled [继承]: 设备已启用 ECC 支持
int cudaDeviceProp::accessPolicyMaxWindowSize [继承]: cudaAccessPolicyWindow::num_bytes 的最大值。
int cudaDeviceProp::asyncEngineCount [继承]: 异步引擎的数量
int cudaDeviceProp::canMapHostMemory [继承]: 设备可以使用 cudaHostAlloc/cudaHostGetDevicePointer 映射主机内存
int cudaDeviceProp::canUseHostPointerForRegisteredMem [继承]: 设备可以像 CPU 一样在相同的虚拟地址访问主机注册内存
int cudaDeviceProp::clockRate [继承]: 已弃用，时钟频率，单位为千赫兹
int cudaDeviceProp::clusterLaunch [继承]: 指示设备支持集群启动
int cudaDeviceProp::computeMode [继承]: 已弃用，计算模式（参见 cudaComputeMode）
int cudaDeviceProp::computePreemptionSupported [继承]: 设备支持计算抢占
int cudaDeviceProp::concurrentKernels [继承]: 设备可能可以并发执行多个内核
int cudaDeviceProp::concurrentManagedAccess [继承]: 设备可以与 CPU 并发地连贯访问托管内存
int cudaDeviceProp::cooperativeLaunch [继承]: 设备支持通过 cudaLaunchCooperativeKernel 启动协同内核
int cudaDeviceProp::cooperativeMultiDeviceLaunch [继承]: 已弃用，cudaLaunchCooperativeKernelMultiDevice 已弃用。
int cudaDeviceProp::deferredMappingCudaArraySupported [继承]: 如果设备支持延迟映射 CUDA 数组和 CUDA mipmapped 数组，则为 1
int cudaDeviceProp::deviceOverlap [继承]: 设备可以并发地复制内存并执行内核。已弃用。请改用 asyncEngineCount。
int cudaDeviceProp::directManagedMemAccessFromHost [继承]: 主机可以直接访问设备上的托管内存，而无需迁移。
int cudaDeviceProp::globalL1CacheSupported [继承]: 设备支持在 L1 中缓存全局变量
unsigned int cudaDeviceProp::gpuDirectRDMAFlushWritesOptions [继承]: 要根据 cudaFlushGPUDirectRDMAWritesOptions 枚举解释的位掩码
int cudaDeviceProp::gpuDirectRDMASupported [继承]: 如果设备支持 GPUDirect RDMA API，则为 1，否则为 0
int cudaDeviceProp::gpuDirectRDMAWritesOrdering [继承]: 有关数值，请参见 cudaGPUDirectRDMAWritesOrdering 枚举
int cudaDeviceProp::hostNativeAtomicSupported [继承]: 设备和主机之间的链接支持本机原子操作
int cudaDeviceProp::hostRegisterReadOnlySupported [继承]: 设备支持使用 cudaHostRegister 标志 cudaHostRegisterReadOnly 来注册必须作为只读映射到 GPU 的内存
int cudaDeviceProp::hostRegisterSupported [继承]: 设备支持通过 cudaHostRegister 进行主机内存注册。
int cudaDeviceProp::integrated [继承]: 设备是集成的，而不是离散的
int cudaDeviceProp::ipcEventSupported [继承]: 设备支持 IPC 事件。
int cudaDeviceProp::isMultiGpuBoard [继承]: 设备位于多 GPU 板上
int cudaDeviceProp::kernelExecTimeoutEnabled [继承]: 已弃用，指定内核运行时是否有限制
int cudaDeviceProp::l2CacheSize [继承]: L2 缓存的大小，以字节为单位
int cudaDeviceProp::localL1CacheSupported [继承]: 设备支持在 L1 中缓存本地变量
char cudaDeviceProp::luid[8] [继承]: 8 字节本地唯一标识符。在 TCC 和非 Windows 平台上，该值未定义
unsigned int cudaDeviceProp::luidDeviceNodeMask [继承]: LUID 设备节点掩码。在 TCC 和非 Windows 平台上，该值未定义
int cudaDeviceProp::major [继承]: 主要计算能力
int cudaDeviceProp::managedMemory [继承]: 设备支持在此系统上分配托管内存
int cudaDeviceProp::maxBlocksPerMultiProcessor [继承]: 每个多处理器允许的最大常驻块数
int cudaDeviceProp::maxGridSize[3] [继承]: 网格每个维度的最大大小
int cudaDeviceProp::maxSurface1D [继承]: 最大 1D 表面尺寸
int cudaDeviceProp::maxSurface1DLayered[2] [继承]: 最大 1D 分层表面尺寸
int cudaDeviceProp::maxSurface2D[2] [继承]: 最大 2D 表面尺寸
int cudaDeviceProp::maxSurface2DLayered[3] [继承]: 最大 2D 分层表面尺寸
int cudaDeviceProp::maxSurface3D[3] [继承]: 最大 3D 表面尺寸
int cudaDeviceProp::maxSurfaceCubemap [继承]: 最大 Cubemap 表面尺寸
int cudaDeviceProp::maxSurfaceCubemapLayered[2] [继承]: 最大 Cubemap 分层表面尺寸
int cudaDeviceProp::maxTexture1D [继承]: 最大 1D 纹理尺寸
int cudaDeviceProp::maxTexture1DLayered[2] [继承]: 最大 1D 分层纹理尺寸
int cudaDeviceProp::maxTexture1DLinear [继承]: 已弃用，请勿使用。请改用 cudaDeviceGetTexture1DLinearMaxWidth() 或 cuDeviceGetTexture1DLinearMaxWidth()。
int cudaDeviceProp::maxTexture1DMipmap [继承]: 最大 1D mipmapped 纹理尺寸
int cudaDeviceProp::maxTexture2D[2] [inherited]: 最大 2D 纹理尺寸
int cudaDeviceProp::maxTexture2DGather[2] [inherited]: 如果需要执行纹理Gather操作，则为最大 2D 纹理尺寸
int cudaDeviceProp::maxTexture2DLayered[3] [inherited]: 最大 2D 分层纹理尺寸
int cudaDeviceProp::maxTexture2DLinear[3] [inherited]: 绑定到倾斜内存的 2D 纹理的最大尺寸（宽度、高度、间距）
int cudaDeviceProp::maxTexture2DMipmap[2] [inherited]: 最大 2D Mipmapped 纹理尺寸
int cudaDeviceProp::maxTexture3D[3] [inherited]: 最大 3D 纹理尺寸
int cudaDeviceProp::maxTexture3DAlt[3] [inherited]: 最大备选 3D 纹理尺寸
int cudaDeviceProp::maxTextureCubemap [inherited]: 最大 Cubemap 纹理尺寸
int cudaDeviceProp::maxTextureCubemapLayered[2] [inherited]: 最大 Cubemap 分层纹理尺寸
int cudaDeviceProp::maxThreadsDim[3] [inherited]: 一个块的每个维度的最大尺寸
int cudaDeviceProp::maxThreadsPerBlock [inherited]: 每个块的最大线程数
int cudaDeviceProp::maxThreadsPerMultiProcessor [inherited]: 每个多处理器最大常驻线程数
size_t cudaDeviceProp::memPitch [inherited]: 内存复制允许的最大间距（以字节为单位）
int cudaDeviceProp::memoryBusWidth [inherited]: 全局内存总线宽度（以位为单位）
int cudaDeviceProp::memoryClockRate [inherited]: 已弃用，峰值内存时钟频率（千赫兹）
unsigned int cudaDeviceProp::memoryPoolSupportedHandleTypes [inherited]: 基于 mempool 的 IPC 支持的句柄类型位掩码
int cudaDeviceProp::memoryPoolsSupported [inherited]: 如果设备支持使用 cudaMallocAsync 和 cudaMemPool 系列 API，则为 1，否则为 0
int cudaDeviceProp::minor [inherited]: 次要计算能力
int cudaDeviceProp::multiGpuBoardGroupID [inherited]: 同一多 GPU 板卡上的一组设备的唯一标识符
int cudaDeviceProp::multiProcessorCount [inherited]: 设备上的多处理器数量
char cudaDeviceProp::name[256] [inherited]: 标识设备的 ASCII 字符串
int cudaDeviceProp::pageableMemoryAccess [inherited]: 设备支持一致地访问可分页内存，而无需对其调用 cudaHostRegister
int cudaDeviceProp::pageableMemoryAccessUsesHostPageTables [inherited]: 设备通过主机的页表访问可分页内存
int cudaDeviceProp::pciBusID [inherited]: 设备的 PCI 总线 ID
int cudaDeviceProp::pciDeviceID [inherited]: 设备的 PCI 设备 ID
int cudaDeviceProp::pciDomainID [inherited]: 设备的 PCI 域 ID
int cudaDeviceProp::persistingL2CacheMaxSize [inherited]: 设备的最大 L2 持久化行容量设置（以字节为单位）
int cudaDeviceProp::regsPerBlock [inherited]: 每个块可用的 32 位寄存器
int cudaDeviceProp::regsPerMultiprocessor [inherited]: 每个多处理器可用的 32 位寄存器
int cudaDeviceProp::reserved[63] [inherited]: 保留供将来使用
size_t cudaDeviceProp::reservedSharedMemPerBlock [inherited]: CUDA 驱动程序为每个块保留的共享内存（以字节为单位）
size_t cudaDeviceProp::sharedMemPerBlock [inherited]: 每个块可用的共享内存（以字节为单位）
size_t cudaDeviceProp::sharedMemPerBlockOptin [inherited]: 每个设备每个块可通过特殊选择加入使用的最大共享内存
size_t cudaDeviceProp::sharedMemPerMultiprocessor [inherited]: 每个多处理器可用的共享内存（以字节为单位）
int cudaDeviceProp::singleToDoublePrecisionPerfRatio [inherited]: 已弃用，单精度性能（以每秒浮点运算次数计）与双精度性能之比
int cudaDeviceProp::sparseCudaArraySupported [inherited]: 如果设备支持稀疏 CUDA 数组和稀疏 CUDA Mipmapped 数组，则为 1，否则为 0
int cudaDeviceProp::streamPrioritiesSupported [inherited]: 设备支持流优先级
size_t cudaDeviceProp::surfaceAlignment [inherited]: 表面的对齐要求
int cudaDeviceProp::tccDriver [inherited]: 如果设备是使用 TCC 驱动程序的 Tesla 设备，则为 1，否则为 0
size_t cudaDeviceProp::textureAlignment [inherited]: 纹理的对齐要求
size_t cudaDeviceProp::texturePitchAlignment [inherited]: 绑定到倾斜内存的纹理引用的间距对齐要求
int cudaDeviceProp::timelineSemaphoreInteropSupported [inherited]: 设备上支持外部时间线信号量互操作
size_t cudaDeviceProp::totalConstMem [inherited]: 设备上可用的常量内存（以字节为单位）
size_t cudaDeviceProp::totalGlobalMem [inherited]: 设备上可用的全局内存（以字节为单位）
int cudaDeviceProp::unifiedAddressing [inherited]: 设备与主机共享统一地址空间
int cudaDeviceProp::unifiedFunctionPointers [inherited]: 指示设备支持统一指针
cudaUUID_t cudaDeviceProp::uuid [inherited]: 16 字节唯一标识符
int cudaDeviceProp::warpSize [inherited]: Warp 大小（以线程为单位）