| | | | | | | | |
| Product | NVIDIA® A2 GPU Computing Accelerator - 16GB GDDR6 - PCIe 4.0 x8 - Passive Cooler (w/o CEC) | NVIDIA® L4 ADA GPU Computing Accelerator - 24GB GDDR6X - PCIe 4.0 x16 - Passive Cooling | NVIDIA® L40 ADA GPU Computing Accelerator - 48GB GDDR6 - PCIe 4.0 x16 - Passive Cooling | NVIDIA® RTX 6000 Ada Generation - 48GB GDDR6 ECC - PCIe 4.0 x16 - Active Cooling (4xDP) | NVIDIA® RTX PRO 4500 Blackwell - 32GB GDDR7 ECC - PCIe 5.0 x16 - Active Cooling (4xDP) | NVIDIA® RTX PRO 5000 Blackwell - 48GB GDDR7 ECC - PCIe 5.0 x16 - Active Cooling (4xDP) | NVIDIA® RTX PRO 6000 Blackwell Workstation Edition - 96GB GDDR7 ECC - PCIe 5.0 x16 - Active Cooling (4xDP) | NVIDIA® RTX PRO 6000 Blackwell Max-Q Workstation Edition - 96GB GDDR7 ECC - PCIe 5.0 x16 - Active Cooling (4xDP) |
| Action | Select | Select | Select | Select | Select | Select | Select | Select |
| Main Specifications |
| Product Series |
Nvidia A2 | Nvidia L4 | Nvidia L40 | | | | | |
| Core Type |
NVIDIA TENSOR | NVIDIA TENSOR | NVIDIA TENSOR | | | | | |
| Core Clock Speed |
1440 MHz (1770 MHz Boost Clock) | 795 MHz Base | 2040 MHz Boost | | | | | | |
| Host Interface |
PCI Express 4.0 x8 | PCI Express 4.0 x16 | PCI Express 4.0 x16 | PCI Express 4.0 x16 | PCI Express 5.0 x16 | PCI Express 5.0 x16 | PCI Express 5.0 x16 | PCI Express 5.0 x16 |
| GPU Architecture |
Ampere | Ada Lovelace | Ada Lovelace | | | | | |
| Product Type |
| | | Workstation | Workstation | Workstation | Workstation | Workstation |
| Product Line |
| | | NVIDIA Professional Graphics | NVIDIA Professional Graphics | NVIDIA Professional Graphics | NVIDIA Professional Graphics | NVIDIA Professional Graphics |
| Memory Technology |
| | | GDDR6 | GDDR7 | GDDR7 | GDDR7 | GDDR7 |
| Memory Capacity |
| | | 48 GB with ECC | 32 GB with ECC | 48 GB with ECC | 96 GB with ECC | 96 GB with ECC |
| Max Displays |
| | | 4 Displays | 4 Displays | 4 Displays | 4 Displays | 4 Displays |
| Detailed Specifications |
| Streaming Processor Cores |
1280 CUDA Cores | | | 18,176 | 10,496 CUDA Parallel Processing Cores | 14,080 CUDA Parallel Processing Cores | 24,064 CUDA Parallel Processing Cores | 24,064 CUDA Parallel Processing Cores |
| NVIDIA Tensor Cores |
40 | Gen 3 | | | 568 | 328 | 440 | 752 | 752 |
| NVIDIA RT Cores |
10 | Gen 2 | | | 142 | 82 | 110 | 188 | 188 |
| Memory Clock Speed |
6251 MHz | 6251 MHz | | | | | | |
| Memory Interface |
128-bit | 192-bit | | 384-bit | 256-bit | 384-bit | 512-bit | 512-bit |
| Max Memory Size |
16 GB GDDR6 ECC | 24 GB | 48 GB GDDR6 with ECC | | | | | |
| Max Memory Bandwidth |
200 GB/s | 300 GB/s | | | | | | |
| ECC Protection |
On by Default | On by Default | | | | | | |
| INT8 Tensor Core |
| 485 TOPS | Sparsity | | | | | | |
| TF32 Tensor Core |
9 TFLOPS | 18 TFLOPS Sparsity | 120 TFLOPS | Sparsity | | | | | | |
| FP32 |
4.5 TFLOPS | 30.3 TFLOPS | | | | | | |
| Peak BFLOAT16 Tensor Core |
| 242 TFLOPS | Sparsity | | | | | | |
| Peak FP16 Tensor Core |
18 TFLOPS | 36 TFLOPS Sparsity | 242 TFLOPS | Sparsity | | | | | | |
| Peak FP8 Tensor Core |
| 485 TFLOPS | Sparsity | | | | | | |
| Tensor Performance |
| | | 1457.0 TFLOPS | | | | |
| NVIDIA CUDA™ Technology |
11.1 or later | | | | | | | |
| vGPU Software Support |
| | NVIDIA vPC/vApps, NVIDIA RTX Virtual Workstation (vWS) | | | | | |
| NVENC | NVDEC |
| 2 | 4 | 4 | JPEG Decoders | AV1 Encode and Decode | 3x | 3x (Includes AV1 Encode & Decode) | | | | | |
| Secure Boot with Root of Trust |
| Yes | Yes | | | | | |
| NEBS Ready |
| Yes | Level 3 | Yes / Level 3 | | | | | |
| Peak INT4 Performance |
72 TOPS | 144 TOPS Sparsity | | | | | | | |
| Peak INT8 Performance |
36 TOPS | 72 TOPS Sparsity | | | | | | | |
| Transistor Count |
| | | 76.3 billion | 45.6 Billion | 92.2 Billion | 92.2 Billion | 92.2 Billion |
| DisplayPort Connectors |
| None | vGPU Only | 4x DP 1.4a | | | | | |
| Cooling |
Passive | Passive | Passive | | | | | |
| Dual Slot |
Single-slot | No | Yes | | | | | |
| Dimensions |
6.61” L x 2.71” H | | 4.4" (H) x 10.5" (L) | 4.4" H x 10.5" L | 4.4" H x 10.5" L | 4.4” H x 10.5” L, FHFL Dual Slot | 5.4” H x 12” L, XHFL Dual Slot | 4.4” H x 10.5” L, FHFL Dual Slot |
| Form Factor |
Low-Profile PCIe | 6.61” L x 2.71” H (Low-profile) | PCIe | | | | | |
| Lithography |
| | | 4 nm NVIDIA Custom Process | 4N NVIDIA Custom Process | 4N NVIDIA Custom Process | 4N NVIDIA Custom Process | 4N NVIDIA Custom Process |
| Supplementary Power Connectors |
| | 1x 16-pin PCIe CEM5 | 1x PCIe CEM5 16-pin | 1x PCIe CEM5 16-pin | 1x PCIe CEM5 16-pin | 1x PCIe CEM5 16-pin | 1x PCIe CEM5 16-pin |
| Max Graphics Card Power (W) |
40-60 W | Configurable | 72W | 300W | 300W | 200W | 300W | 600W | 300W |
| Processor |
| | | NVIDIA Ada Lovelace | NVIDIA Blackwell Architecture | NVIDIA Blackwell Architecture | NVIDIA Blackwell Architecture | NVIDIA Blackwell Architecture |
| Memory Bandwidth |
| | | 960 GB/s | 896 GB/s | 1344 GB/s | 1792 GB/s | 1792 GB/s |
| Peak Single-Precision Performance |
| | | | | | 125 TFLOPS | |
| AI Performance |
| | | | | | 4000 AI TOPS2 | |
| Peak Single Precision FP32 Performance |
| | | 91.1 TFLOPS | | | | |
| RT Core Performance |
| | | 210.6 TFLOPS | | | 380 TFLOPS | |
| DisplayPort Output |
| | | 4x DP 1.4a | 4x DP 2.1 | 4x DP 2.1 | 4x DP 2.1 | 4x DP 2.1 |
| Minimum Recommended Power, Single Card (W) |
| | | 600 | 550 | 600 | 800 | 600 |
| Minimum Recommended Power, 2-Way (W) |
| | | 750 | 750 | 750 | 1600 | 900 |
| Minimum Recommended Power, 3-Way (W) |
| | | 850 | 850 | 850 | 2400 | 1200 |
| Minimum Recommended Power, 4-Way (W) |
| | | 1000 | 1000 | 1000 | 3200 | 1600 |
| Thermal Solution |
| | | Blower Active Fan | Blower Active Fan | Blower Active Fan | Double Flow Through | Blower Active Fan |
| Slot Height |
| | | 2-Slot | 2-Slot | 2-Slot | 2-Slot | 2-Slot |
| Action | Select | Select | Select | Select | Select | Select | Select | Select |