ProductAMD Instinct™ MI100 Accelerator - 32GB HBM2 - PCIe 4.0 x16 - Passive CoolingAMD Instinct™ MI210 Accelerator - 64GB HBM2e - PCIe 4.0 x16 - Passive CoolingNVIDIA® A10 GPU Computing Accelerator - 24GB GDDR6 - PCIe 4.0 x16 - Passive Cooler (w/o CEC)NVIDIA® A30 GPU Computing Accelerator - 24GB HBM2 - PCIe 4.0 x16 - Passive CoolerNVIDIA® A40 GPU Computing Accelerator - 48GB GDDR6 - PCIe 4.0 x16 - Passive CoolingNVIDIA® L4 ADA GPU Computing Accelerator - 24GB GDDR6X - PCIe 4.0 x16 - Passive CoolingNVIDIA® L40 ADA GPU Computing Accelerator - 48GB GDDR6 - PCIe 4.0 x16 - Passive CoolingNVIDIA® L40S ADA GPU Computing Accelerator - 48GB GDDR6 - PCIe 4.0 x16 - Passive CoolingNVIDIA® RTX A2000 - 12GB GDDR6 - PCIe 4.0 x16 - Active Cooling (4x mDP)NVIDIA® RTX A4000 - 16GB GDDR6 - PCIe 4.0 x16 - Active Cooling (4xDP)NVIDIA® RTX A6000 - 48GB GDDR6 - PCIe 4.0 x16 - Active Cooling (4xDP)
ActionSelectSelectSelectSelectSelectSelectSelectSelectSelectSelectSelect
Main Specifications
Product Series AMD InstinctAMD InstinctNvidia A10Nvidia A30Nvidia A40Nvidia L4Nvidia L40Nvidia L40S
Core Type NVIDIA TENSORNVIDIA TENSORNVIDIA TENSORNVIDIA TENSORNVIDIA TENSORNVIDIA TENSOR
Core Clock Speed 1502 MHz1700 MHz885 MHz (1695 MHz Boost Clock)795 MHz Base | 2040 MHz Boost
Host Interface PCI Express 4.0 x16PCI Express 4.0 x16PCI Express 4.0 x16 64GB/sPCI Express 4.0 x16PCI Express 4.0 x16PCI Express 4.0 x16PCI Express 4.0 x16PCI Express 4.0 x16PCI Express 4.0 x16PCI Express 4.0 x16PCI Express 4.0 x16
GPU Architecture CDNACDNA2AmpereAmpereAmpereAda LovelaceAda LovelaceAda Lovelace
Product Type WorkstationWorkstationWorkstation
Product Line NVIDIA Professional GraphicsNVIDIA Professional GraphicsNVIDIA Professional Graphics
Memory Technology GDDR6 with ECCGDDR6 with ECCGDDR6
Memory Capacity 12 GB GDDR6 with ECC16 GB GDDR6 with ECC48 GB
Max Displays 4 Displays4 Displays
Detailed Specifications
Streaming Processor Cores 7,680665610752 CUDA Cores18,1763328 CUDA Cores6144 CUDA Cores10752 Shading Units
Compute Units 120104
NVIDIA Tensor Cores 336 Tensor Cores568 | Gen 4104192336
NVIDIA RT Cores 72 RT Cores84 RT Cores142 | Gen 3264884
Memory Clock Speed 1.2 GHz1.6 GHz1563 MHz6251 MHz6001 MHz2000 MHz 16 Gbps effective
Memory Interface 4096-bit4096-bit384-bit192-bit192-bit256-bit384-bit
Memory Speeds (GT/s) 14.5Gbps GDDR6
Max Memory Size 32 GB HBM264 GB HBM2e24 GB GDDR624 GB HBM248 GB GDDR6 with error-correcting code (ECC)24 GB48 GB GDDR6 with ECC48GB GDDR6 with ECC
Max Memory Bandwidth Up to 1228.8 GB/sUp to 1638.4 GB/s600 GB/s933 GB/s696 GB/s300 GB/s864 GB/s
Infinity Fabric™ Links 33
Peak Infinity Fabric™ Link Bandwidth 92 GB/s100 GB/s
Peak FP64 5.2 teraFLOPS
Peak FP64 Tensor Core 10.3 teraFLOPS
Peak FP32 22.6 TFLOPs31.2 teraFLOPS10.3 teraFLOPS30.3 TFLOPS91.6 teraFLOPS
Peak TF32 Tensor Core 62.5 teraFLOPS | 125 teraFLOPS82 teraFLOPS | 165 teraFLOPS120 TFLOPS | Sparsity183 teraFLOPS
Peak BFLOAT16 Tensor Core 125 teraFLOPS | 250 teraFLOPS165 teraFLOPS | 330 teraFLOPS242 TFLOPS | Sparsity362.05 teraFLOPS
Peak FP16 Tensor Core 125 teraFLOPS | 250 teraFLOPS165 teraFLOPS | 330 teraFLOPS242 TFLOPS | Sparsity362.05 teraFLOPS
Peak FP8 Tensor Core 485 TFLOPS | Sparsity733 teraFLOPS
Peak INT8 Tensor Core 250 TOPS | 500 TOPS330 TOPS | 661 TOPS485 TOPS | Sparsity733 teraFLOPS
Peak INT4 Tensor Core 500 TOPS | 1,000 TOPS661 TOPS | 1321 TOPS733 teraFLOPS
NVIDIA NVLink™ Interconnect Bandwidth Third-gen NVLINK: 200GB/sNVIDIA NVLink 112.5 GB/s (bidirectional) PCIe Gen4 16 GB/sNot supported
Multi-Instance GPUs No
Tensor Performance 63.9 TFLOPS
NVIDIA CUDA™ Technology YesYes
vGPU Software Support NVIDIA vPC/vApps, NVIDIA RTX Virtual Workstation (vWS)
NVENC | NVDEC 2 | 4 | 4 | JPEG Decoders | AV1 Encode and Decode3x | 3x (Includes AV1 Encode & Decode)3x l 3x (includes AV1 encode and decode)
Secure Boot with Root of Trust YesYesYes
NEBS Ready Yes | Level 3Yes / Level 3Level 3
Peak Half Precision (FP16) Performance 184.6 TFLOPs
Peak Single Precision Matrix (FP32) Performance 46.1 TFLOPs45.3 TFLOPs
Peak Double Precision Matrix (FP64) Performance 45.3 TFLOPs
Peak Single Precision (FP32) Performance 23.1 TFLOPs
Peak Double Precision (FP64) Performance 11.5 TFLOPs22.6 TFLOPs
Peak INT4 Performance 184.6 TOPs181 TOPs
Peak INT8 Performance 184.6 TOPs
Peak bfloat16 92.3 TFLOPs181 TFLOPs
ECC Protection Yes (Full-Chip)On by Default
Transistor Count 13.25 Billion17.4 Billion28.3 Billion
DisplayPort Connectors 3x DisplayPort 1.4
A40 is configured for virtualization by default with physical display connectors disabled. The display outputs can be enabled via management software tools.
None | vGPU Only4x DP 1.4a4x DisplayPort 1.4a
OS Support Linux x86_64Linux x86_64
Cooling PassivePassivePassivePassivePassivePassivePassive
Dual Slot yesyesSingle-slotDual-slot2-slot Low-profileNoYes
Dimensions 10.5" (267 mm) Board Length10.5" (267 mm) Board LengthFHFL4.4" (H) x 10.5" (L)4.4" (H) x 10.5" (L)4.4" (H) x 10.5" (L)2.713" H x 6.6" L4.4” H x 9.5” L4.4" (H) x 10.5" (L)
Form Factor Full Height6.61” L x 2.71” H (Low-profile)PCIe
Lithography TSMC 7nm FinFET8 nmSamsung 8nm8nm8nmSamsung 8nm
Supplementary Power Connectors 2x PCIe 8-pin connectors1x8 pin 12V EPSNone1x 8-pin CPU (EPS12V)1x 8-pin CPU (EPS12V)1x 16-pin PCIe CEM51x 16-pin1x 6-pin PCIe1x 8-pin EPS
Max Graphics Card Power (W) 300W300W Peak150W165W300W72W300W350W70W140W300W
Processor AmpereAmpere (GA104)Ampere (GA102)
Memory Bandwidth 288 GB/sec448 GB/sec768 GB/s
Core Clock Speed 1455 MHz Base Clock
1860 MHz Boost Clock
L2 Cache Size 6 MB
API Support CUDA 8.5, OpenCL 2.0
Shader Model 6.5, OpenGL 4.6, DirectX 12 Ultimate (12_2), Vulkan 1.2
Texture Fill Rate 625 GTexel/s
Graphics Resolution 2x 7680 x 4320 at 60 HzMax Digital Resolution: 7680 x 4320 x36 bpp at 60 Hz7680 x 4320 x36 bpp at 60 Hz
Peak Double Precision FP64 Performance 1,250 GFLOPS (1:32)
Peak Single Precision FP32 Performance 8.0 TFLOPS38.7 TFLOPS
Peak Half Precision FP16 Performance 40.00 TFLOPS (1:1)
Deep Learning TFLOPS 153.4 TFLOPS
Multi-GPU Scalability NVLINK 2-way low profile (2-slot and 3-slot bridges) connects 2x NVIDIA RTX A6000
NVLink Interconnect 112.5 GB/s (bidirectional)
RT Core Performance 15.6 TLOPS
VR Ready Yes
Vulkan API 1.2
DisplayPort Output 4x DisplayPort 1.4a4x DisplayPort 1.4a
Mini DisplayPort Output 4x mDP Latching
Minimum Recommended Power, Single Card (W) 300W700W
Minimum Recommended Power, 2-Way (W) 500850
Minimum Recommended Power, 3-Way (W) 8501000
Minimum Recommended Power, 4-Way (W) 10001200
Thermal Solution Active HeatsinkActive HeatsinkActive Heatsink
Slot Height 2-SlotSingle Slot2-Slot
ActionSelectSelectSelectSelectSelectSelectSelectSelectSelectSelectSelect