ProductNVIDIA® L4 ADA GPU Computing Accelerator - 24GB GDDR6X - PCIe 4.0 x16 - Passive CoolingNVIDIA® L40S ADA GPU Computing Accelerator - 48GB GDDR6 - PCIe 4.0 x16 - Passive CoolingNVIDIA® RTX 4000 Ada Generation - 20GB GDDR6 ECC - PCIe 4.0 x16 - Active Cooling (4xDP)NVIDIA® RTX 4500 Ada Generation - 24GB GDDR6 ECC - PCIe 4.0 x16 - Active Cooling (4xDP)NVIDIA® RTX 5000 Ada Generation - 32GB GDDR6 ECC - PCIe 4.0 x16 - Active Cooling (4xDP)NVIDIA® RTX 6000 Ada Generation - 48GB GDDR6 ECC - PCIe 4.0 x16 - Active Cooling (4xDP)NVIDIA® RTX PRO 6000 Blackwell Max-Q Workstation Edition - 96GB GDDR7 ECC - PCIe 5.0 x16 - Active Cooling (4xDP)
ActionSelectSelectSelectSelectSelectSelectSelect
Main Specifications
Product Series Nvidia L4Nvidia L40S
Core Type NVIDIA TENSORNVIDIA TENSOR
Core Clock Speed 795 MHz Base | 2040 MHz Boost
Host Interface PCI Express 4.0 x16PCI Express 4.0 x16PCI Express 4.0 x16PCI Express 4.0 x16PCI Express 4.0 x16PCI Express 4.0 x16PCI Express 5.0 x16
GPU Architecture Ada LovelaceAda Lovelace
Product Type WorkstationWorkstationWorkstationWorkstation
Product Line NVIDIA Professional GraphicsNVIDIA Professional GraphicsNVIDIA Professional GraphicsNVIDIA Professional GraphicsNVIDIA Professional Graphics
Memory Technology GDDR6GDDR6GDDR6GDDR6GDDR7
Memory Capacity 20 GB GDDR6 ECC24 GB GDDR6 ECC32 GB GDDR6 ECC48 GB with ECC96 GB with ECC
Max Displays 4 Displays4 Displays4 Displays
Detailed Specifications
Streaming Processor Cores 18,1766144 CUDA Cores7,680 CUDA Parallel Processing Cores12,800 CUDA Parallel Processing Cores18,17624,064 CUDA Parallel Processing Cores
NVIDIA Tensor Cores 568 | Gen 4192240400568752
NVIDIA RT Cores 142 | Gen 34860100142188
Memory Clock Speed 6251 MHz
Memory Interface 192-bit160-bit192-bit256-bit384-bit512-bit
Max Memory Size 24 GB48GB GDDR6 with ECC
Max Memory Bandwidth 300 GB/s864 GB/s
ECC Protection On by Default
INT8 Tensor Core 485 TOPS | Sparsity733 teraFLOPS
TF32 Tensor Core 120 TFLOPS | Sparsity183 teraFLOPS
FP32 30.3 TFLOPS91.6 teraFLOPS
Peak BFLOAT16 Tensor Core 242 TFLOPS | Sparsity362.05 teraFLOPS
Peak FP16 Tensor Core 242 TFLOPS | Sparsity362.05 teraFLOPS
Peak FP8 Tensor Core 485 TFLOPS | Sparsity733 teraFLOPS
Peak INT4 Tensor Core 733 teraFLOPS
Total NVLink Bandwidth Not supported
Multi-Instance GPUs No
Tensor Performance 327.6 TFLOPS637.8 TFLOPS1044.4 TFLOPS1457.0 TFLOPS
NVENC | NVDEC 2 | 4 | 4 | JPEG Decoders | AV1 Encode and Decode3x l 3x (includes AV1 encode and decode)
Secure Boot with Root of Trust YesYes
NEBS Ready Yes | Level 3Level 3
Transistor Count 35.8 Billion35.8 billion76.3 billion76.3 billion92.2 Billion
DisplayPort Connectors None | vGPU Only4x DisplayPort 1.4a
Cooling PassivePassive
Dual Slot No
Dimensions 4.4" (H) x 10.5" (L)4.4" (H) x 9.5"(L)4.4" H x 10.5" L4.4" H x 10.5" L4.4" H x 10.5" L4.4” H x 10.5” L, FHFL Dual Slot
Form Factor 6.61” L x 2.71” H (Low-profile)
Lithography 4 nm NVIDIA Custom Process4 nm NVIDIA Custom Process4N NVIDIA Custom Process
Supplementary Power Connectors 1x 16-pin1x 16-pin CEM5 PCIe1x 16-pin CEM5 PCIe1x PCIe CEM5 16-pin1x PCIe CEM5 16-pin
Max Graphics Card Power (W) 72W350W130W210W250W300W300W
Processor NVIDIA Ada LovelaceNVIDIA Ada LovelaceNVIDIA Ada LovelaceNVIDIA Ada LovelaceNVIDIA Blackwell Architecture
Memory Bandwidth 360 GB/s432 GB/s576 GB/s960 GB/s1792 GB/s
Peak Single-Precision Performance 65.3 TFLOPS
Peak Single Precision FP32 Performance 26.7 TFLOPS39.9 TFLOPS91.1 TFLOPS
NVLink Interconnect Not SupportedNot SupportedNot Supported
RT Core Performance 61.8 TFLOPS92.2 TFLOPS151.0 TFLOPS210.6 TFLOPS
DisplayPort Output 4x DP 1.4a4x DP 2.1
Mini DisplayPort Output 4x mDP 1.4a4x DP 1.4a4x DP 1.4a
Minimum Recommended Power, Single Card (W) 600600
Minimum Recommended Power, 2-Way (W) 750900
Minimum Recommended Power, 3-Way (W) 8501200
Minimum Recommended Power, 4-Way (W) 10001600
Thermal Solution Blower Active FanBlower Active FanBlower Active FanBlower Active FanBlower Active Fan
Slot Height Single Slot2-Slot2-Slot2-Slot2-Slot
ActionSelectSelectSelectSelectSelectSelectSelect