ProductNVIDIA® A2 GPU Computing Accelerator - 16GB GDDR6 - PCIe 4.0 x8 - Passive Cooler (w/o CEC)NVIDIA® A16 GPU Computing Accelerator - 64GB (4x 16GB) GDDR6 - PCIe 4.0 x16 - Passive CoolerNVIDIA® L4 ADA GPU Computing Accelerator - 24GB GDDR6X - PCIe 4.0 x16 - Passive CoolingNVIDIA® L40 ADA GPU Computing Accelerator - 48GB GDDR6 - PCIe 4.0 x16 - Passive CoolingNVIDIA® L40S ADA GPU Computing Accelerator - 48GB GDDR6 - PCIe 4.0 x16 - Passive CoolingNVIDIA® T1000 8GB GDDR6 - PCIe 3.0 x16 - Active Cooling (4x mDP)NVIDIA® RTX 4000 SFF Ada Generation - 20GB GDDR6 ECC - PCIe 4.0 x16 - Active Cooling (4x mDP)
ActionSelectSelectSelectSelectSelectSelectSelect
Main Specifications
Product Series Nvidia A2Nvidia A16Nvidia L4Nvidia L40Nvidia L40S
Core Type NVIDIA TENSORNVIDIA TENSORNVIDIA TENSORNVIDIA TENSORNVIDIA TENSOR
Core Clock Speed 1440 MHz (1770 MHz Boost Clock)795 MHz Base | 2040 MHz Boost
Host Interface PCI Express 4.0 x8PCI Express 4.0 x16PCI Express 4.0 x16PCI Express 4.0 x16PCI Express 4.0 x16PCI Express 3.0 x16PCI Express 4.0 x16
GPU Architecture AmpereAmpereAda LovelaceAda LovelaceAda Lovelace
Product Type WorkstationWorkstation
Product Line NVIDIA Professional Graphics
Memory Technology GDDR6GDDR6
Memory Capacity 8 GB20 GB GDDR6 ECC
Max Displays 4 Displays
Detailed Specifications
Streaming Processor Cores 1280 CUDA Cores18,176896 CUDA Parallel-Processing Cores6144 CUDA Cores
NVIDIA Tensor Cores 40 | Gen 3568 | Gen 4192 | Gen 4
NVIDIA RT Cores 10 | Gen 2142 | Gen 348 | Gen 3
Memory Clock Speed 6251 MHz6251 MHz
Memory Interface 128-bit192-bit128-bit160-bit
Max Memory Size 16 GB GDDR6 ECC4x 16GB GDDR6 with error-correcting code (ECC)24 GB48 GB GDDR6 with ECC48GB GDDR6 with ECC
Max Memory Bandwidth 200 GB/s4x 232GB/s300 GB/s864 GB/s
INT8 Tensor Core 485 TOPS | Sparsity733 teraFLOPS
TF32 Tensor Core 9 TFLOPS | 18 TFLOPS Sparsity120 TFLOPS | Sparsity183 teraFLOPS
FP32 4.5 TFLOPS30.3 TFLOPS91.6 teraFLOPS
Peak BFLOAT16 Tensor Core 242 TFLOPS | Sparsity362.05 teraFLOPS
Peak FP16 Tensor Core 18 TFLOPS | 36 TFLOPS Sparsity242 TFLOPS | Sparsity362.05 teraFLOPS
Peak FP8 Tensor Core 485 TFLOPS | Sparsity733 teraFLOPS
Peak INT4 Tensor Core 733 teraFLOPS
Total NVLink Bandwidth Not supported
Multi-Instance GPUs No
Tensor Performance 306.8 TFLOPS
NVIDIA CUDA™ Technology 11.1 or laterYes
vGPU Software Support NVIDIA vPC/vApps, NVIDIA RTX Virtual Workstation (vWS)
NVENC | NVDEC 2 | 4 | 4 | JPEG Decoders | AV1 Encode and Decode3x | 3x (Includes AV1 Encode & Decode)3x l 3x (includes AV1 encode and decode)
Secure Boot with Root of Trust YesYesYes
NEBS Ready Yes | Level 3Yes / Level 3Level 3
Peak INT4 Performance 72 TOPS | 144 TOPS Sparsity
Peak INT8 Performance 36 TOPS | 72 TOPS Sparsity
ECC Protection On by DefaultOn by Default
Transistor Count 35.8 Billion
DisplayPort Connectors None | vGPU Only4x DP 1.4a4x DisplayPort 1.4a
Cooling PassivePassivePassivePassivePassive
Dual Slot Single-slotDual-slotNoYes
Dimensions 6.61” L x 2.71” H4.4" (H) x 10.5" (L)4.4" (H) x 10.5" (L)2.713” H x 6.137” L2.7” H x 6.6”L
Form Factor Low-Profile PCIe6.61” L x 2.71” H (Low-profile)PCIe
Supplementary Power Connectors 8-pin CPU1x 16-pin PCIe CEM51x 16-pinNo Auxiliary Power Required
Max Graphics Card Power (W) 40-60 W | Configurable250W72W300W350W50W70W
Processor NVIDIA TuringNVIDIA Ada Lovelace
Memory Bandwidth 160 GB/s320 GB/s
API Support CUDA C, CUDA C++, DirectCompute 5.0, OpenCL, Java, Python, and Fortran
Shader Model 5.1 (OpenGL 4.5 and DirectX 12)
Graphics Resolution Max Digital Resolution: 7680 x 4320 at 60 Hz
Peak Single Precision FP32 Performance 2.50 TFLOPS19.2 TFLOPS
RT Core Performance 44.3 TFLOPS
VR Ready Yes
Mini DisplayPort Output x44x mDP 1.4a
Thermal Solution Ultra-quiet active fansinkActive Heatsink
Slot Height Low-Profile Single SlotLow Profile Dual Slot
ActionSelectSelectSelectSelectSelectSelectSelect