ProductNVIDIA® A30 GPU Computing Accelerator - 24GB HBM2 - PCIe 4.0 x16 - Passive CoolerNVIDIA® A40 GPU Computing Accelerator - 48GB GDDR6 - PCIe 4.0 x16 - Passive Cooling
ActionSelectSelect
Main Specifications
Product Series Nvidia A30Nvidia A40
Core Type NVIDIA TENSORNVIDIA TENSOR
Host Interface PCI Express 4.0 x16PCI Express 4.0 x16
GPU Architecture AmpereAmpere
Detailed Specifications
Streaming Processor Cores 10752 CUDA Cores
NVIDIA Tensor Cores 336 Tensor Cores
NVIDIA RT Cores 84 RT Cores
Memory Interface 384-bit
Memory Speeds (GT/s) 14.5Gbps GDDR6
Max Memory Size 24 GB HBM248 GB GDDR6 with error-correcting code (ECC)
Max Memory Bandwidth 933 GB/s696 GB/s
Peak FP64 5.2 teraFLOPS
Peak FP64 Tensor Core 10.3 teraFLOPS
INT8 Tensor Core 330 TOPS | 661 TOPS
TF32 Tensor Core 82 teraFLOPS | 165 teraFLOPS
FP32 10.3 teraFLOPS
Peak BFLOAT16 Tensor Core 165 teraFLOPS | 330 teraFLOPS
Peak FP16 Tensor Core 165 teraFLOPS | 330 teraFLOPS
Peak INT4 Tensor Core 661 TOPS | 1321 TOPS
Total NVLink Bandwidth Third-gen NVLINK: 200GB/sNVIDIA NVLink 112.5 GB/s (bidirectional) PCIe Gen4 16 GB/s
DisplayPort Connectors 3x DisplayPort 1.4
A40 is configured for virtualization by default with physical display connectors disabled. The display outputs can be enabled via management software tools.
Cooling Passive
Dual Slot Dual-slot2-slot Low-profile
Dimensions 4.4" (H) x 10.5" (L)
Lithography Samsung 8nm
Supplementary Power Connectors 1x 8-pin CPU (EPS12V)1x 8-pin CPU (EPS12V)
Max Graphics Card Power (W) 165W300W
ActionSelectSelect