ProductAMD Instinct™ MI100 Accelerator - 32GB HBM2 - PCIe 4.0 x16 - Passive CoolingNVIDIA® A10 GPU Computing Accelerator - 24GB GDDR6 - PCIe 4.0 x16 - Passive Cooler (w/o CEC)NVIDIA® A40 GPU Computing Accelerator - 48GB GDDR6 - PCIe 4.0 x16 - Passive CoolingNVIDIA® L40 ADA GPU Computing Accelerator - 48GB GDDR6 - PCIe 4.0 x16 - Passive CoolingNVIDIA® L40S ADA GPU Computing Accelerator - 48GB GDDR6 - PCIe 4.0 x16 - Passive Cooling
ActionSelectSelectSelectSelectSelect
Main Specifications
Product Series AMD InstinctNvidia A10Nvidia A40Nvidia L40Nvidia L40S
Core Type NVIDIA TENSORNVIDIA TENSORNVIDIA TENSORNVIDIA TENSOR
Core Clock Speed 1502 MHz885 MHz (1695 MHz Boost Clock)
Host Interface PCI Express 4.0 x16PCI Express 4.0 x16 64GB/sPCI Express 4.0 x16PCI Express 4.0 x16PCI Express 4.0 x16
GPU Architecture CDNAAmpereAmpereAda LovelaceAda Lovelace
Detailed Specifications
Streaming Processor Cores 7,68010752 CUDA Cores18,176
Compute Units 120
NVIDIA Tensor Cores 336 Tensor Cores568 | Gen 4
NVIDIA RT Cores 72 RT Cores84 RT Cores142 | Gen 3
Memory Clock Speed 1.2 GHz1563 MHz
Memory Interface 4096-bit384-bit
Memory Speeds (GT/s) 14.5Gbps GDDR6
Max Memory Size 32 GB HBM224 GB GDDR648 GB GDDR6 with error-correcting code (ECC)48 GB GDDR6 with ECC48GB GDDR6 with ECC
Max Memory Bandwidth Up to 1228.8 GB/s600 GB/s696 GB/s864 GB/s
Infinity Fabric™ Links 3
Peak Infinity Fabric™ Link Bandwidth 92 GB/s
INT8 Tensor Core 250 TOPS | 500 TOPS733 teraFLOPS
TF32 Tensor Core 62.5 teraFLOPS | 125 teraFLOPS183 teraFLOPS
FP32 31.2 teraFLOPS91.6 teraFLOPS
Peak BFLOAT16 Tensor Core 125 teraFLOPS | 250 teraFLOPS362.05 teraFLOPS
Peak FP16 Tensor Core 125 teraFLOPS | 250 teraFLOPS362.05 teraFLOPS
Peak FP8 Tensor Core 733 teraFLOPS
Peak INT4 Tensor Core 500 TOPS | 1,000 TOPS733 teraFLOPS
Total NVLink Bandwidth NVIDIA NVLink 112.5 GB/s (bidirectional) PCIe Gen4 16 GB/sNot supported
Multi-Instance GPUs No
vGPU Software Support NVIDIA vPC/vApps, NVIDIA RTX Virtual Workstation (vWS)
NVENC | NVDEC 3x | 3x (Includes AV1 Encode & Decode)3x l 3x (includes AV1 encode and decode)
Secure Boot with Root of Trust YesYes
NEBS Ready Yes / Level 3Level 3
Peak Half Precision (FP16) Performance 184.6 TFLOPs
Peak Single Precision Matrix (FP32) Performance 46.1 TFLOPs
Peak Single Precision (FP32) Performance 23.1 TFLOPs
Peak Double Precision (FP64) Performance 11.5 TFLOPs
Peak INT4 Performance 184.6 TOPs
Peak INT8 Performance 184.6 TOPs
Peak bfloat16 92.3 TFLOPs
DisplayPort Connectors 3x DisplayPort 1.4
A40 is configured for virtualization by default with physical display connectors disabled. The display outputs can be enabled via management software tools.
4x DP 1.4a4x DisplayPort 1.4a
OS Support Linux x86_64
Cooling PassivePassivePassivePassivePassive
Dual Slot yesSingle-slot2-slot Low-profileYes
Dimensions 10.5" (267 mm) Board LengthFHFL4.4" (H) x 10.5" (L)4.4" (H) x 10.5" (L)4.4" (H) x 10.5" (L)
Form Factor PCIe
Lithography TSMC 7nm FinFET8 nmSamsung 8nm
Supplementary Power Connectors 2x PCIe 8-pin connectorsNone1x 8-pin CPU (EPS12V)1x 16-pin PCIe CEM51x 16-pin
Max Graphics Card Power (W) 300W150W300W300W350W
ActionSelectSelectSelectSelectSelect