| | | | | |
| Product | NVIDIA® A30 GPU Computing Accelerator - 24GB HBM2 - PCIe 4.0 x16 - Passive Cooler | NVIDIA® L4 ADA GPU Computing Accelerator - 24GB GDDR6X - PCIe 4.0 x16 - Passive Cooling | NVIDIA® L40 ADA GPU Computing Accelerator - 48GB GDDR6 - PCIe 4.0 x16 - Passive Cooling | NVIDIA® H100 GPU Computing Accelerator - 80GB HBM2e - PCIe 5.0 x16 - Passive Cooling (w/o CEC) | NVIDIA® RTX A1000 - 8GB GDDR6 - PCIe 4.0 x8 - Active Cooling (4x mDP) |
| Action | Select | Select | Select | Select | Select |
| Main Specifications |
| Product Series |
Nvidia A30 | Nvidia L4 | Nvidia L40 | Nvidia H100 | |
| Core Type |
NVIDIA TENSOR | NVIDIA TENSOR | NVIDIA TENSOR | NVIDIA TENSOR | |
| Core Clock Speed |
| 795 MHz Base | 2040 MHz Boost | | | |
| Host Interface |
PCI Express 4.0 x16 | PCI Express 4.0 x16 | PCI Express 4.0 x16 | PCI Express 5.0 x16 | PCI Express 4.0 x8 |
| GPU Architecture |
Ampere | Ada Lovelace | Ada Lovelace | Hopper | |
| Product Type |
| | | | Workstation |
| Product Line |
| | | | NVIDIA Professional Graphics |
| Memory Technology |
| | | | GDDR6 |
| Memory Capacity |
| | | | 8 GB |
| Max Displays |
| | | | 4 Displays |
| Detailed Specifications |
| Streaming Processor Cores |
| | | | 2307 CUDA Cores |
| NVIDIA Tensor Cores |
| | | | 72 |
| NVIDIA RT Cores |
| | | | 18 |
| PCIe x16 Interconnect Bandwidth |
| | | PCIe Gen5: 128GB/s | |
| Memory Clock Speed |
| 6251 MHz | | | |
| Memory Interface |
| 192-bit | | | 128-bit |
| Max Memory Size |
24 GB HBM2 | 24 GB | 48 GB GDDR6 with ECC | 80 GB | |
| Max Memory Bandwidth |
933 GB/s | 300 GB/s | | 2.0TB/s | |
| ECC Protection |
| On by Default | | | |
| Peak FP64 |
5.2 teraFLOPS | | | 24 teraFLOPS | |
| Peak FP64 Tensor Core |
10.3 teraFLOPS | | | 48 teraFLOPS | |
| INT8 Tensor Core |
330 TOPS | 661 TOPS | 485 TOPS | Sparsity | | 3,200 TOPS | |
| TF32 Tensor Core |
82 teraFLOPS | 165 teraFLOPS | 120 TFLOPS | Sparsity | | 800 teraFLOPS | |
| FP32 |
10.3 teraFLOPS | 30.3 TFLOPS | | 48 teraFLOPS | |
| Peak BFLOAT16 Tensor Core |
165 teraFLOPS | 330 teraFLOPS | 242 TFLOPS | Sparsity | | 1,600 teraFLOPS | |
| Peak FP16 Tensor Core |
165 teraFLOPS | 330 teraFLOPS | 242 TFLOPS | Sparsity | | 1,600 teraFLOPS | |
| Peak FP8 Tensor Core |
| 485 TFLOPS | Sparsity | | 3,200 teraFLOPS | |
| Peak INT4 Tensor Core |
661 TOPS | 1321 TOPS | | | | |
| Total NVLink Bandwidth |
Third-gen NVLINK: 200GB/s | | | 600GB/s | |
| Multi-Instance GPUs |
| | | Up to 7 MIGS @ 10GB each | |
| Tensor Performance |
| | | | 13.2 TFLOPS |
| vGPU Software Support |
| | NVIDIA vPC/vApps, NVIDIA RTX Virtual Workstation (vWS) | | |
| NVENC | NVDEC |
| 2 | 4 | 4 | JPEG Decoders | AV1 Encode and Decode | 3x | 3x (Includes AV1 Encode & Decode) | | |
| Secure Boot with Root of Trust |
| Yes | Yes | | |
| NEBS Ready |
| Yes | Level 3 | Yes / Level 3 | | |
| Transistor Count |
| | | | 8.7 Billion |
| DisplayPort Connectors |
| None | vGPU Only | 4x DP 1.4a | | |
| Cooling |
| Passive | Passive | Passive | |
| Dual Slot |
Dual-slot | No | Yes | Yes | |
| Dimensions |
| | 4.4" (H) x 10.5" (L) | | 2.7" H x 6.4" L |
| Form Factor |
| 6.61” L x 2.71” H (Low-profile) | PCIe | PCIe | |
| Lithography |
| | | | 8N | NVIDIA Custom Process |
| Supplementary Power Connectors |
1x 8-pin CPU (EPS12V) | | 1x 16-pin PCIe CEM5 | 1x 16-pin PCIe CEM5 | |
| Max Graphics Card Power (W) |
165W | 72W | 300W | 350W | 50W |
| Processor |
| | | | Ampere |
| Memory Bandwidth |
| | | | 192 GB/sec |
| Peak Single Precision FP32 Performance |
| | | | 6.74 TFLOPS |
| NVLink Interconnect |
| | | | Not Supported |
| Mini DisplayPort Output |
| | | | 4x mDisplayPort 1.4a |
| Thermal Solution |
| | | | Active Fan |
| Slot Height |
| | | | Single Slot |
| Action | Select | Select | Select | Select | Select |