ProductNVIDIA® A2 GPU Computing Accelerator - 16GB GDDR6 - PCIe 4.0 x8 - Passive Cooler (w/o CEC)NVIDIA® A10 GPU Computing Accelerator - 24GB GDDR6 - PCIe 4.0 x16 - Passive Cooler (w/o CEC)NVIDIA® A16 GPU Computing Accelerator - 64GB (4x 16GB) GDDR6 - PCIe 4.0 x16 - Passive CoolerNVIDIA® A30 GPU Computing Accelerator - 24GB HBM2 - PCIe 4.0 x16 - Passive CoolerNVIDIA® A40 GPU Computing Accelerator - 48GB GDDR6 - PCIe 4.0 x16 - Passive Cooling (w/o CEC)NVIDIA® L40 ADA GPU Computing Accelerator - 48GB GDDR6 - PCIe 4.0 x16 - Passive CoolingNVIDIA® RTX A2000 - 12GB GDDR6 - PCIe 4.0 x16 - Active Cooling (4x mDP)NVIDIA® RTX A6000 - 48GB GDDR6 - PCIe 4.0 x16 - Active Cooling (4xDP)
ActionSelectSelectSelectSelectSelectSelectSelectSelect
Main Specifications
Product Series Nvidia A2Nvidia A10Nvidia A16Nvidia A30Nvidia A40Nvidia L40
Core Type NVIDIA TENSORNVIDIA TENSORNVIDIA TENSORNVIDIA TENSORNVIDIA TENSORNVIDIA TENSOR
Core Clock Speed 1440 MHz (1770 MHz Boost Clock)885 MHz (1695 MHz Boost Clock)
Host Interface PCI Express 4.0 x8PCI Express 4.0 x16 64GB/sPCI Express 4.0 x16PCI Express 4.0 x16PCI Express 4.0 x16PCI Express 4.0 x16PCI Express 4.0 x16PCI Express 4.0 x16
GPU Architecture AmpereAmpereAmpereAmpereAmpereAda Lovelace
Product Type WorkstationWorkstation
Product Line NVIDIA Professional GraphicsNVIDIA Professional Graphics
Memory Technology GDDR6 with ECCGDDR6
Memory Capacity 12 GB GDDR6 with ECC48 GB
Max Displays 4 Displays
Detailed Specifications
Streaming Processor Cores 1280 CUDA Cores10752 CUDA Cores3328 CUDA Cores10752 Shading Units
NVIDIA Tensor Cores 40 | Gen 3336 Tensor Cores104336
NVIDIA RT Cores 10 | Gen 272 RT Cores84 RT Cores2684
Memory Clock Speed 6251 MHz1563 MHz6001 MHz2000 MHz 16 Gbps effective
Memory Interface 128-bit384-bit192-bit384-bit
Memory Speeds (GT/s) 14.5Gbps GDDR6
Max Memory Size 16 GB GDDR6 ECC24 GB GDDR64x 16GB GDDR6 with error-correcting code (ECC)24 GB HBM248 GB GDDR6 with error-correcting code (ECC)48 GB GDDR6 with ECC
Max Memory Bandwidth 200 GB/s600 GB/s4x 232GB/s933 GB/s696 GB/s
Peak FP64 5.2 teraFLOPS
Peak FP64 Tensor Core 10.3 teraFLOPS
INT8 Tensor Core 250 TOPS | 500 TOPS330 TOPS | 661 TOPS
TF32 Tensor Core 9 TFLOPS | 18 TFLOPS Sparsity62.5 teraFLOPS | 125 teraFLOPS82 teraFLOPS | 165 teraFLOPS
FP32 4.5 TFLOPS31.2 teraFLOPS10.3 teraFLOPS
Peak BFLOAT16 Tensor Core 125 teraFLOPS | 250 teraFLOPS165 teraFLOPS | 330 teraFLOPS
Peak FP16 Tensor Core 18 TFLOPS | 36 TFLOPS Sparsity125 teraFLOPS | 250 teraFLOPS165 teraFLOPS | 330 teraFLOPS
Peak INT4 Tensor Core 500 TOPS | 1,000 TOPS661 TOPS | 1321 TOPS
Total NVLink Bandwidth Third-gen NVLINK: 200GB/sNVIDIA NVLink 112.5 GB/s (bidirectional) PCIe Gen4 16 GB/s
Tensor Performance 63.9 TFLOPS
NVIDIA CUDA™ Technology 11.1 or laterYes
vGPU Software Support NVIDIA vPC/vApps, NVIDIA RTX Virtual Workstation (vWS)
NVENC | NVDEC 3x | 3x (Includes AV1 Encode & Decode)
Secure Boot with Root of Trust Yes
NEBS Ready Yes / Level 3
Peak INT4 Performance 72 TOPS | 144 TOPS Sparsity
Peak INT8 Performance 36 TOPS | 72 TOPS Sparsity
ECC Protection On by Default
Transistor Count 13.25 Billion28.3 Billion
DisplayPort Connectors 3x DisplayPort 1.4
A40 is configured for virtualization by default with physical display connectors disabled. The display outputs can be enabled via management software tools.
4x DP 1.4a
Cooling PassivePassivePassivePassivePassive
Dual Slot Single-slotSingle-slotDual-slotDual-slot2-slot Low-profileYes
Dimensions 6.61” L x 2.71” HFHFL4.4" (H) x 10.5" (L)4.4" (H) x 10.5" (L)2.713" H x 6.6" L4.4" (H) x 10.5" (L)
Form Factor Low-Profile PCIePCIe
Lithography 8 nmSamsung 8nm8nmSamsung 8nm
Supplementary Power Connectors None8-pin CPU1x 8-pin CPU (EPS12V)1x 8-pin CPU (EPS12V)1x 16-pin PCIe CEM51x 8-pin EPS
Max Graphics Card Power (W) 40-60 W | Configurable150W250W165W300W300W70W300W
Processor AmpereAmpere (GA102)
Memory Bandwidth 288 GB/sec768 GB/s
Core Clock Speed 1455 MHz Base Clock
1860 MHz Boost Clock
L2 Cache Size 6 MB
API Support CUDA 8.5, OpenCL 2.0
Shader Model 6.5, OpenGL 4.6, DirectX 12 Ultimate (12_2), Vulkan 1.2
Texture Fill Rate 625 GTexel/s
Graphics Resolution 2x 7680 x 4320 at 60 Hz7680 x 4320 x36 bpp at 60 Hz
Peak Double Precision FP64 Performance 1,250 GFLOPS (1:32)
Peak Single Precision FP32 Performance 8.0 TFLOPS38.7 TFLOPS
Peak Half Precision FP16 Performance 40.00 TFLOPS (1:1)
Multi-GPU Scalability NVLINK 2-way low profile (2-slot and 3-slot bridges) connects 2x NVIDIA RTX A6000
NVLink Interconnect 112.5 GB/s (bidirectional)
RT Core Performance 15.6 TLOPS
VR Ready Yes
Vulkan API 1.2
DisplayPort Output 4x DisplayPort 1.4a
Mini DisplayPort Output 4x mDP Latching
Minimum Recommended Power, Single Card (W) 700W
Minimum Recommended Power, 2-Way (W) 850
Minimum Recommended Power, 3-Way (W) 1000
Minimum Recommended Power, 4-Way (W) 1200
Thermal Solution Active HeatsinkActive Heatsink
Slot Height 2-Slot2-Slot
ActionSelectSelectSelectSelectSelectSelectSelectSelect