ProductNVIDIA® A16 GPU Computing Accelerator - 64GB (4x 16GB) GDDR6 - PCIe 4.0 x16 - Passive Cooler (w/o CEC)NVIDIA® L4 ADA GPU Computing Accelerator - 24GB GDDR6X - PCIe 4.0 x16 - Passive CoolingNVIDIA® L40S ADA GPU Computing Accelerator - 48GB GDDR6 - PCIe 4.0 x16 - Passive CoolingNVIDIA® RTX PRO 6000 Blackwell Max-Q Workstation Edition - 96GB GDDR7 ECC - PCIe 5.0 x16 - Active Cooling (4xDP)
ActionSelectSelectSelectSelect
Main Specifications
Product Series Nvidia A16Nvidia L4Nvidia L40S
Core Type NVIDIA TENSORNVIDIA TENSORNVIDIA TENSOR
Core Clock Speed 795 MHz Base | 2040 MHz Boost
Host Interface PCI Express 4.0 x16PCI Express 4.0 x16PCI Express 4.0 x16PCI Express 5.0 x16
GPU Architecture AmpereAda LovelaceAda Lovelace
Product Type Workstation
Product Line NVIDIA Professional Graphics
Memory Technology GDDR7
Memory Capacity 96 GB with ECC
Max Displays 4 Displays
Detailed Specifications
Streaming Processor Cores 18,17624,064 CUDA Parallel Processing Cores
NVIDIA Tensor Cores 568 | Gen 4752
NVIDIA RT Cores 142 | Gen 3188
Memory Clock Speed 6251 MHz
Memory Interface 192-bit512-bit
Max Memory Size 4x 16GB GDDR6 with error-correcting code (ECC)24 GB48GB GDDR6 with ECC
Max Memory Bandwidth 4x 232GB/s300 GB/s864 GB/s
ECC Protection On by Default
INT8 Tensor Core 485 TOPS | Sparsity733 teraFLOPS
TF32 Tensor Core 120 TFLOPS | Sparsity183 teraFLOPS
FP32 30.3 TFLOPS91.6 teraFLOPS
Peak BFLOAT16 Tensor Core 242 TFLOPS | Sparsity362.05 teraFLOPS
Peak FP16 Tensor Core 242 TFLOPS | Sparsity362.05 teraFLOPS
Peak FP8 Tensor Core 485 TFLOPS | Sparsity733 teraFLOPS
Peak INT4 Tensor Core 733 teraFLOPS
Total NVLink Bandwidth Not supported
Multi-Instance GPUs No
NVENC | NVDEC 2 | 4 | 4 | JPEG Decoders | AV1 Encode and Decode3x l 3x (includes AV1 encode and decode)
Secure Boot with Root of Trust YesYes
NEBS Ready Yes | Level 3Level 3
Transistor Count 92.2 Billion
DisplayPort Connectors None | vGPU Only4x DisplayPort 1.4a
Cooling PassivePassivePassive
Dual Slot Dual-slotNo
Dimensions 4.4" (H) x 10.5" (L)4.4” H x 10.5” L, FHFL Dual Slot
Form Factor 6.61” L x 2.71” H (Low-profile)
Lithography 4N NVIDIA Custom Process
Supplementary Power Connectors 8-pin CPU1x 16-pin1x PCIe CEM5 16-pin
Max Graphics Card Power (W) 250W72W350W300W
Processor NVIDIA Blackwell Architecture
Memory Bandwidth 1792 GB/s
DisplayPort Output 4x DP 2.1
Minimum Recommended Power, Single Card (W) 600
Minimum Recommended Power, 2-Way (W) 900
Minimum Recommended Power, 3-Way (W) 1200
Minimum Recommended Power, 4-Way (W) 1600
Thermal Solution Blower Active Fan
Slot Height 2-Slot
ActionSelectSelectSelectSelect