| | | | |
| Product | NVIDIA® A2 GPU Computing Accelerator - 16GB GDDR6 - PCIe 4.0 x8 - Passive Cooler | NVIDIA® A10 GPU Computing Accelerator - 24GB GDDR6 - PCIe 4.0 x16 - Passive Cooler (w/o CEC) | NVIDIA® A30 GPU Computing Accelerator - 24GB HBM2 - PCIe 4.0 x16 - Passive Cooler | NVIDIA® L40 ADA GPU Computing Accelerator - 48GB GDDR6 - PCIe 4.0 x16 - Passive Cooling |
| Action | Select | Select | Select | Select |
| Main Specifications |
| Product Series |
Nvidia A2 | Nvidia A10 | Nvidia A30 | Nvidia L40 |
| Core Type |
NVIDIA TENSOR | NVIDIA TENSOR | NVIDIA TENSOR | NVIDIA TENSOR |
| Core Clock Speed |
1440 MHz (1770 MHz Boost Clock) | 885 MHz (1695 MHz Boost Clock) | | |
| Host Interface |
PCI Express 4.0 x8 | PCI Express 4.0 x16 64GB/s | PCI Express 4.0 x16 | PCI Express 4.0 x16 |
| GPU Architecture |
Ampere | Ampere | Ampere | Ada Lovelace |
| Detailed Specifications |
| Streaming Processor Cores |
1280 CUDA Cores | | | |
| NVIDIA Tensor Cores |
40 | Gen 3 | | | |
| NVIDIA RT Cores |
10 | Gen 2 | 72 RT Cores | | |
| Memory Clock Speed |
6251 MHz | 1563 MHz | | |
| Memory Interface |
128-bit | | | |
| Max Memory Size |
16 GB GDDR6 ECC | 24 GB GDDR6 | 24 GB HBM2 | 48 GB GDDR6 with ECC |
| Max Memory Bandwidth |
200 GB/s | 600 GB/s | 933 GB/s | |
| ECC Protection |
On by Default | | | |
| Peak FP64 |
| | 5.2 teraFLOPS | |
| Peak FP64 Tensor Core |
| | 10.3 teraFLOPS | |
| INT8 Tensor Core |
| 250 TOPS | 500 TOPS | 330 TOPS | 661 TOPS | |
| TF32 Tensor Core |
9 TFLOPS | 18 TFLOPS Sparsity | 62.5 teraFLOPS | 125 teraFLOPS | 82 teraFLOPS | 165 teraFLOPS | |
| FP32 |
4.5 TFLOPS | 31.2 teraFLOPS | 10.3 teraFLOPS | |
| Peak BFLOAT16 Tensor Core |
| 125 teraFLOPS | 250 teraFLOPS | 165 teraFLOPS | 330 teraFLOPS | |
| Peak FP16 Tensor Core |
18 TFLOPS | 36 TFLOPS Sparsity | 125 teraFLOPS | 250 teraFLOPS | 165 teraFLOPS | 330 teraFLOPS | |
| Peak INT4 Tensor Core |
| 500 TOPS | 1,000 TOPS | 661 TOPS | 1321 TOPS | |
| Total NVLink Bandwidth |
| | Third-gen NVLINK: 200GB/s | |
| NVIDIA CUDA™ Technology |
11.1 or later | | | |
| vGPU Software Support |
| | | NVIDIA vPC/vApps, NVIDIA RTX Virtual Workstation (vWS) |
| NVENC | NVDEC |
| | | 3x | 3x (Includes AV1 Encode & Decode) |
| Secure Boot with Root of Trust |
| | | Yes |
| NEBS Ready |
| | | Yes / Level 3 |
| Peak INT4 Performance |
72 TOPS | 144 TOPS Sparsity | | | |
| Peak INT8 Performance |
36 TOPS | 72 TOPS Sparsity | | | |
| DisplayPort Connectors |
| | | 4x DP 1.4a |
| Cooling |
Passive | Passive | | Passive |
| Dual Slot |
Single-slot | Single-slot | Dual-slot | Yes |
| Dimensions |
6.61” L x 2.71” H | FHFL | | 4.4" (H) x 10.5" (L) |
| Form Factor |
Low-Profile PCIe | | | PCIe |
| Lithography |
| 8 nm | | |
| Supplementary Power Connectors |
| None | 1x 8-pin CPU (EPS12V) | 1x 16-pin PCIe CEM5 |
| Max Graphics Card Power (W) |
40-60 W | Configurable | 150W | 165W | 300W |
| Action | Select | Select | Select | Select |