| Product | NVIDIA® A30 GPU Computing Accelerator - 24GB HBM2 - PCIe 4.0 x16 - Passive Cooler (w/o CEC) | NVIDIA® L4 ADA GPU Computing Accelerator - 24GB GDDR6X - PCIe 4.0 x16 - Passive Cooling | NVIDIA® L40 ADA GPU Computing Accelerator - 48GB GDDR6 - PCIe 4.0 x16 - Passive Cooling | NVIDIA® L40S ADA GPU Computing Accelerator - 48GB GDDR6 - PCIe 4.0 x16 - Passive Cooling | NVIDIA® H100 GPU Computing Accelerator - 80GB HBM2e - PCIe 5.0 x16 - Passive Cooling (w/o CEC) | AMD Radeon™ AI PRO R9700S - 32GB GDDR6 - PCIe 5.0 x16 - Passive Cooling (4xDP) - 300W | NVIDIA® RTX 6000 Ada Generation - 48GB GDDR6 ECC - PCIe 4.0 x16 - Active Cooling (4xDP) | NVIDIA® RTX PRO 6000 Blackwell Max-Q Workstation Edition - 96GB GDDR7 ECC - PCIe 5.0 x16 - Active Cooling (4xDP) |
| Action | Select | Select | Select | Select | Select | Select | Select | Select |
| Main Specifications | ||||||||
| Product Series | Nvidia A30 | Nvidia L4 | Nvidia L40 | Nvidia L40S | Nvidia H100 | |||
| Core Type | NVIDIA TENSOR | NVIDIA TENSOR | NVIDIA TENSOR | NVIDIA TENSOR | NVIDIA TENSOR | |||
| Core Clock Speed | 795 MHz Base | 2040 MHz Boost | |||||||
| Host Interface | PCI Express 4.0 x16 | PCI Express 4.0 x16 | PCI Express 4.0 x16 | PCI Express 4.0 x16 | PCI Express 5.0 x16 | PCI Express 5.0 x16 | PCI Express 4.0 x16 | PCI Express 5.0 x16 |
| GPU Architecture | Ampere | Ada Lovelace | Ada Lovelace | Ada Lovelace | Hopper | |||
| Product Type | Workstation | Workstation | Workstation | |||||
| Product Line | Radeon AI PRO R9000 Series | NVIDIA Professional Graphics | NVIDIA Professional Graphics | |||||
| Memory Technology | GDDR6 with ECC (Linux Only) | GDDR6 | GDDR7 | |||||
| Memory Capacity | 32 GB | 48 GB with ECC | 96 GB with ECC | |||||
| Max Displays | 4 Displays | 4 Displays | 4 Displays | |||||
| Detailed Specifications | ||||||||
| Streaming Processor Cores | 18,176 | 4096 | 18,176 | 24,064 CUDA Parallel Processing Cores | ||||
| NVIDIA Tensor Cores | 568 | Gen 4 | 568 | 752 | |||||
| NVIDIA RT Cores | 142 | Gen 3 | 142 | 188 | |||||
| PCIe x16 Interconnect Bandwidth | PCIe Gen5: 128GB/s | |||||||
| Memory Clock Speed | 6251 MHz | |||||||
| Memory Interface | 192-bit | 256-bit | 384-bit | 512-bit | ||||
| Max Memory Size | 24 GB HBM2 | 24 GB | 48 GB GDDR6 with ECC | 48GB GDDR6 with ECC | 80 GB | |||
| Max Memory Bandwidth | 933 GB/s | 300 GB/s | 864 GB/s | 2.0TB/s | ||||
| ECC Protection | On by Default | |||||||
| Peak FP64 | 5.2 teraFLOPS | 24 teraFLOPS | ||||||
| Peak FP64 Tensor Core | 10.3 teraFLOPS | 48 teraFLOPS | ||||||
| INT8 Tensor Core | 330 TOPS | 661 TOPS | 485 TOPS | Sparsity | 733 teraFLOPS | 3,200 TOPS | ||||
| TF32 Tensor Core | 82 teraFLOPS | 165 teraFLOPS | 120 TFLOPS | Sparsity | 183 teraFLOPS | 800 teraFLOPS | ||||
| FP32 | 10.3 teraFLOPS | 30.3 TFLOPS | 91.6 teraFLOPS | 48 teraFLOPS | ||||
| Peak BFLOAT16 Tensor Core | 165 teraFLOPS | 330 teraFLOPS | 242 TFLOPS | Sparsity | 362.05 teraFLOPS | 1,600 teraFLOPS | ||||
| Peak FP16 Tensor Core | 165 teraFLOPS | 330 teraFLOPS | 242 TFLOPS | Sparsity | 362.05 teraFLOPS | 1,600 teraFLOPS | ||||
| Peak FP8 Tensor Core | 485 TFLOPS | Sparsity | 733 teraFLOPS | 3,200 teraFLOPS | |||||
| Peak INT4 Tensor Core | 661 TOPS | 1321 TOPS | 733 teraFLOPS | ||||||
| Total NVLink Bandwidth | Third-gen NVLINK: 200GB/s | Not supported | 600GB/s | |||||
| Multi-Instance GPUs | No | Up to 7 MIGS @ 10GB each | ||||||
| Tensor Performance | 1457.0 TFLOPS | |||||||
| vGPU Software Support | NVIDIA vPC/vApps, NVIDIA RTX Virtual Workstation (vWS) | |||||||
| NVENC | NVDEC | 2 | 4 | 4 | JPEG Decoders | AV1 Encode and Decode | 3x | 3x (Includes AV1 Encode & Decode) | 3x l 3x (includes AV1 encode and decode) | |||||
| Secure Boot with Root of Trust | Yes | Yes | Yes | |||||
| NEBS Ready | Yes | Level 3 | Yes / Level 3 | Level 3 | |||||
| Transistor Count | 53.9 Billion | 76.3 billion | 92.2 Billion | |||||
| DisplayPort Connectors | None | vGPU Only | 4x DP 1.4a | 4x DisplayPort 1.4a | |||||
| Cooling | Passive | Passive | Passive | Passive | ||||
| Dual Slot | Dual-slot | No | Yes | Yes | ||||
| Dimensions | 4.4" (H) x 10.5" (L) | 4.4" (H) x 10.5" (L) | 4.4" H x 10.5" L | 4.4” H x 10.5” L, FHFL Dual Slot | ||||
| Form Factor | 6.61” L x 2.71” H (Low-profile) | PCIe | PCIe | |||||
| Lithography | 4 nm NVIDIA Custom Process | 4N NVIDIA Custom Process | ||||||
| Supplementary Power Connectors | 1x 8-pin CPU (EPS12V) | 1x 16-pin PCIe CEM5 | 1x 16-pin | 1x 16-pin PCIe CEM5 | 12V-2x6 | 1x PCIe CEM5 16-pin | 1x PCIe CEM5 16-pin | |
| Max Graphics Card Power (W) | 165W | 72W | 300W | 350W | 350W | 300W | 300W | 300W |
| Processor | AMD RDNA™ 4 | NVIDIA Ada Lovelace | NVIDIA Blackwell Architecture | |||||
| Memory Bandwidth | 640 GB/s | 960 GB/s | 1792 GB/s | |||||
| Core Clock Speed | 2350 MHz (Boost Up to 2920 MHz) | |||||||
| Graphics Resolution | Up to: 4x 4096 x 2160 (4K DCI) @ 120Hz with DSC 2x 6144 x 3456 (6K) 12-bit HDR @ 60Hz Uncompressed 2x 7680 x 4320 (8K) 12-bit HDR @ 60Hz with DSC 1x 12288 x 6912 (12K) @ 120Hz with DSC | |||||||
| Peak Single Precision FP32 Performance | 47.8 TFLOPs | 91.1 TFLOPS | ||||||
| Peak Half Precision FP16 Performance | 47.8 TFLOPs | |||||||
| RT Core Performance | 210.6 TFLOPS | |||||||
| DisplayPort Output | 4x DP | 4x DP 1.4a | 4x DP 2.1 | |||||
| Minimum Recommended Power, Single Card (W) | 750W | 600 | 600 | |||||
| Minimum Recommended Power, 2-Way (W) | 1200 | 750 | 900 | |||||
| Minimum Recommended Power, 3-Way (W) | 1600 | 850 | 1200 | |||||
| Minimum Recommended Power, 4-Way (W) | 2000 | 1000 | 1600 | |||||
| Thermal Solution | Passive Cooled | Blower Active Fan | Blower Active Fan | |||||
| Slot Height | 2-Slot | 2-Slot | 2-Slot | |||||
| Action | Select | Select | Select | Select | Select | Select | Select | Select |