ProductNVIDIA® A2 GPU Computing Accelerator - 16GB GDDR6 - PCIe 4.0 x8 - Passive CoolerNVIDIA® A16 GPU Computing Accelerator - 64GB (4x 16GB) GDDR6 - PCIe 4.0 x16 - Passive CoolerNVIDIA® A40 GPU Computing Accelerator - 48GB GDDR6 - PCIe 4.0 x16 - Passive CoolingNVIDIA® L40 ADA GPU Computing Accelerator - 48GB GDDR6 - PCIe 4.0 x16 - Passive CoolingNVIDIA® L40S ADA GPU Computing Accelerator - 48GB GDDR6 - PCIe 4.0 x16 - Passive CoolingNVIDIA® RTX A6000 - 48GB GDDR6 - PCIe 4.0 x16 - Active Cooling (4xDP)NVIDIA® RTX 5000 Ada Generation - 32GB GDDR6 ECC - PCIe 4.0 x16 - Active Cooling (4xDP)NVIDIA® RTX 6000 Ada Generation - 48GB GDDR6 ECC - PCIe 4.0 x16 - Active Cooling (4xDP)
ActionSelectSelectSelectSelectSelectSelectSelectSelect
Main Specifications
Product Series Nvidia A2Nvidia A16Nvidia A40Nvidia L40Nvidia L40S
Core Type NVIDIA TENSORNVIDIA TENSORNVIDIA TENSORNVIDIA TENSORNVIDIA TENSOR
Core Clock Speed 1440 MHz (1770 MHz Boost Clock)
Host Interface PCI Express 4.0 x8PCI Express 4.0 x16PCI Express 4.0 x16PCI Express 4.0 x16PCI Express 4.0 x16PCI Express 4.0 x16PCI Express 4.0 x16PCI Express 4.0 x16
GPU Architecture AmpereAmpereAmpereAda LovelaceAda Lovelace
Product Type WorkstationWorkstationWorkstation
Product Line NVIDIA Professional GraphicsNVIDIA Professional GraphicsNVIDIA Professional Graphics
Memory Technology GDDR6GDDR6GDDR6
Memory Capacity 48 GB32 GB GDDR6 ECC48 GB with ECC
Max Displays 4 Displays4 Displays
Detailed Specifications
Streaming Processor Cores 1280 CUDA Cores10752 CUDA Cores18,17610752 Shading Units12,800 CUDA Parallel Processing Cores18,176
NVIDIA Tensor Cores 40 | Gen 3336 Tensor Cores568 | Gen 4336400568
NVIDIA RT Cores 10 | Gen 284 RT Cores142 | Gen 384100142
Memory Clock Speed 6251 MHz2000 MHz 16 Gbps effective
Memory Interface 128-bit384-bit384-bit256-bit384-bit
Memory Speeds (GT/s) 14.5Gbps GDDR6
Max Memory Size 16 GB GDDR6 ECC4x 16GB GDDR6 with error-correcting code (ECC)48 GB GDDR6 with error-correcting code (ECC)48 GB GDDR6 with ECC48GB GDDR6 with ECC
Max Memory Bandwidth 200 GB/s4x 232GB/s696 GB/s864 GB/s
INT8 Tensor Core 733 teraFLOPS
TF32 Tensor Core 9 TFLOPS | 18 TFLOPS Sparsity183 teraFLOPS
FP32 4.5 TFLOPS91.6 teraFLOPS
Peak BFLOAT16 Tensor Core 362.05 teraFLOPS
Peak FP16 Tensor Core 18 TFLOPS | 36 TFLOPS Sparsity362.05 teraFLOPS
Peak FP8 Tensor Core 733 teraFLOPS
Peak INT4 Tensor Core 733 teraFLOPS
Total NVLink Bandwidth NVIDIA NVLink 112.5 GB/s (bidirectional) PCIe Gen4 16 GB/sNot supported
Multi-Instance GPUs No
Tensor Performance 1044.4 TFLOPS1457.0 TFLOPS
NVIDIA CUDA™ Technology 11.1 or laterYes
vGPU Software Support NVIDIA vPC/vApps, NVIDIA RTX Virtual Workstation (vWS)
NVENC | NVDEC 3x | 3x (Includes AV1 Encode & Decode)3x l 3x (includes AV1 encode and decode)
Secure Boot with Root of Trust YesYes
NEBS Ready Yes / Level 3Level 3
Peak INT4 Performance 72 TOPS | 144 TOPS Sparsity
Peak INT8 Performance 36 TOPS | 72 TOPS Sparsity
ECC Protection On by Default
Transistor Count 28.3 Billion76.3 billion76.3 billion
DisplayPort Connectors 3x DisplayPort 1.4
A40 is configured for virtualization by default with physical display connectors disabled. The display outputs can be enabled via management software tools.
4x DP 1.4a4x DisplayPort 1.4a
Cooling PassivePassivePassivePassivePassive
Dual Slot Single-slotDual-slot2-slot Low-profileYes
Dimensions 6.61” L x 2.71” H4.4" (H) x 10.5" (L)4.4" (H) x 10.5" (L)4.4" (H) x 10.5" (L)4.4" (H) x 10.5" (L)4.4" H x 10.5" L4.4" H x 10.5" L
Form Factor Low-Profile PCIePCIe
Lithography Samsung 8nmSamsung 8nm4 nm NVIDIA Custom Process4 nm NVIDIA Custom Process
Supplementary Power Connectors 8-pin CPU1x 8-pin CPU (EPS12V)1x 16-pin PCIe CEM51x 16-pin1x 8-pin EPS1x 16-pin CEM5 PCIe1x PCIe CEM5 16-pin
Max Graphics Card Power (W) 40-60 W | Configurable250W300W300W350W300W250W300W
Processor Ampere (GA102)NVIDIA Ada LovelaceNVIDIA Ada Lovelace
Memory Bandwidth 768 GB/s576 GB/s960 GB/s
Core Clock Speed 1455 MHz Base Clock
1860 MHz Boost Clock
L2 Cache Size 6 MB
Peak Single-Precision Performance 65.3 TFLOPS
API Support CUDA 8.5, OpenCL 2.0
Shader Model 6.5, OpenGL 4.6, DirectX 12 Ultimate (12_2), Vulkan 1.2
Texture Fill Rate 625 GTexel/s
Graphics Resolution 7680 x 4320 x36 bpp at 60 Hz
Peak Double Precision FP64 Performance 1,250 GFLOPS (1:32)
Peak Single Precision FP32 Performance 38.7 TFLOPS91.1 TFLOPS
Peak Half Precision FP16 Performance 40.00 TFLOPS (1:1)
Multi-GPU Scalability NVLINK 2-way low profile (2-slot and 3-slot bridges) connects 2x NVIDIA RTX A6000
NVLink Interconnect 112.5 GB/s (bidirectional)Not Supported
RT Core Performance 151.0 TFLOPS210.6 TFLOPS
VR Ready Yes
Vulkan API 1.2
DisplayPort Output 4x DisplayPort 1.4a4x DP 1.4a
Mini DisplayPort Output 4x DP 1.4a
Minimum Recommended Power, Single Card (W) 700W600
Minimum Recommended Power, 2-Way (W) 850750
Minimum Recommended Power, 3-Way (W) 1000850
Minimum Recommended Power, 4-Way (W) 12001000
Thermal Solution Active HeatsinkBlower Active FanBlower Active Fan
Slot Height 2-Slot2-Slot2-Slot
ActionSelectSelectSelectSelectSelectSelectSelectSelect