ProductNVIDIA® A2 GPU Computing Accelerator - 16GB GDDR6 - PCIe 4.0 x8 - Passive CoolerNVIDIA® A10 GPU Computing Accelerator - 24GB GDDR6 - PCIe 4.0 x16 - Passive Cooler (w/o CEC)NVIDIA® A16 GPU Computing Accelerator - 64GB (4x 16GB) GDDR6 - PCIe 4.0 x16 - Passive CoolerNVIDIA® A40 GPU Computing Accelerator - 48GB GDDR6 - PCIe 4.0 x16 - Passive CoolingNVIDIA® L40S ADA GPU Computing Accelerator - 48GB GDDR6 - PCIe 4.0 x16 - Passive Cooling
ActionSelectSelectSelectSelectSelect
Main Specifications
Product Series Nvidia A2Nvidia A10Nvidia A16Nvidia A40Nvidia L40S
Core Type NVIDIA TENSORNVIDIA TENSORNVIDIA TENSORNVIDIA TENSORNVIDIA TENSOR
Core Clock Speed 1440 MHz (1770 MHz Boost Clock)885 MHz (1695 MHz Boost Clock)
Host Interface PCI Express 4.0 x8PCI Express 4.0 x16 64GB/sPCI Express 4.0 x16PCI Express 4.0 x16PCI Express 4.0 x16
GPU Architecture AmpereAmpereAmpereAmpereAda Lovelace
Detailed Specifications
Streaming Processor Cores 1280 CUDA Cores10752 CUDA Cores18,176
NVIDIA Tensor Cores 40 | Gen 3336 Tensor Cores568 | Gen 4
NVIDIA RT Cores 10 | Gen 272 RT Cores84 RT Cores142 | Gen 3
Memory Clock Speed 6251 MHz1563 MHz
Memory Interface 128-bit384-bit
Memory Speeds (GT/s) 14.5Gbps GDDR6
Max Memory Size 16 GB GDDR6 ECC24 GB GDDR64x 16GB GDDR6 with error-correcting code (ECC)48 GB GDDR6 with error-correcting code (ECC)48GB GDDR6 with ECC
Max Memory Bandwidth 200 GB/s600 GB/s4x 232GB/s696 GB/s864 GB/s
INT8 Tensor Core 250 TOPS | 500 TOPS733 teraFLOPS
TF32 Tensor Core 9 TFLOPS | 18 TFLOPS Sparsity62.5 teraFLOPS | 125 teraFLOPS183 teraFLOPS
FP32 4.5 TFLOPS31.2 teraFLOPS91.6 teraFLOPS
Peak BFLOAT16 Tensor Core 125 teraFLOPS | 250 teraFLOPS362.05 teraFLOPS
Peak FP16 Tensor Core 18 TFLOPS | 36 TFLOPS Sparsity125 teraFLOPS | 250 teraFLOPS362.05 teraFLOPS
Peak FP8 Tensor Core 733 teraFLOPS
Peak INT4 Tensor Core 500 TOPS | 1,000 TOPS733 teraFLOPS
Total NVLink Bandwidth NVIDIA NVLink 112.5 GB/s (bidirectional) PCIe Gen4 16 GB/sNot supported
Multi-Instance GPUs No
NVIDIA CUDA™ Technology 11.1 or later
NVENC | NVDEC 3x l 3x (includes AV1 encode and decode)
Secure Boot with Root of Trust Yes
NEBS Ready Level 3
Peak INT4 Performance 72 TOPS | 144 TOPS Sparsity
Peak INT8 Performance 36 TOPS | 72 TOPS Sparsity
ECC Protection On by Default
DisplayPort Connectors 3x DisplayPort 1.4
A40 is configured for virtualization by default with physical display connectors disabled. The display outputs can be enabled via management software tools.
4x DisplayPort 1.4a
Cooling PassivePassivePassivePassivePassive
Dual Slot Single-slotSingle-slotDual-slot2-slot Low-profile
Dimensions 6.61” L x 2.71” HFHFL4.4" (H) x 10.5" (L)4.4" (H) x 10.5" (L)
Form Factor Low-Profile PCIe
Lithography 8 nmSamsung 8nm
Supplementary Power Connectors None8-pin CPU1x 8-pin CPU (EPS12V)1x 16-pin
Max Graphics Card Power (W) 40-60 W | Configurable150W250W300W350W
ActionSelectSelectSelectSelectSelect