ProductNVIDIA® A2 GPU Computing Accelerator - 16GB GDDR6 - PCIe 4.0 x8 - Passive Cooler (w/o CEC)NVIDIA® L4 ADA GPU Computing Accelerator - 24GB GDDR6X - PCIe 4.0 x16 - Passive CoolingNVIDIA® L40 ADA GPU Computing Accelerator - 48GB GDDR6 - PCIe 4.0 x16 - Passive CoolingNVIDIA® L40S ADA GPU Computing Accelerator - 48GB GDDR6 - PCIe 4.0 x16 - Passive CoolingNVIDIA® H100 GPU Computing Accelerator - 80GB HBM2e - PCIe 5.0 x16 - Passive Cooling (w/o CEC)NVIDIA® RTX A1000 - 8GB GDDR6 - PCIe 4.0 x8 - Active Cooling (4x mDP)
ActionSelectSelectSelectSelectSelectSelect
Main Specifications
Product Series Nvidia A2Nvidia L4Nvidia L40Nvidia L40SNvidia H100
Core Type NVIDIA TENSORNVIDIA TENSORNVIDIA TENSORNVIDIA TENSORNVIDIA TENSOR
Core Clock Speed 1440 MHz (1770 MHz Boost Clock)795 MHz Base | 2040 MHz Boost
Host Interface PCI Express 4.0 x8PCI Express 4.0 x16PCI Express 4.0 x16PCI Express 4.0 x16PCI Express 5.0 x16PCI Express 4.0 x8
GPU Architecture AmpereAda LovelaceAda LovelaceAda LovelaceHopper
Product Type Workstation
Product Line NVIDIA Professional Graphics
Memory Technology GDDR6
Memory Capacity 8 GB
Max Displays 4 Displays
Detailed Specifications
Streaming Processor Cores 1280 CUDA Cores18,1762307 CUDA Cores
NVIDIA Tensor Cores 40 | Gen 3568 | Gen 472
NVIDIA RT Cores 10 | Gen 2142 | Gen 318
PCIe x16 Interconnect Bandwidth PCIe Gen5: 128GB/s
Memory Clock Speed 6251 MHz6251 MHz
Memory Interface 128-bit192-bit128-bit
Max Memory Size 16 GB GDDR6 ECC24 GB48 GB GDDR6 with ECC48GB GDDR6 with ECC80 GB
Max Memory Bandwidth 200 GB/s300 GB/s864 GB/s2.0TB/s
ECC Protection On by DefaultOn by Default
Peak FP64 24 teraFLOPS
Peak FP64 Tensor Core 48 teraFLOPS
INT8 Tensor Core 485 TOPS | Sparsity733 teraFLOPS3,200 TOPS
TF32 Tensor Core 9 TFLOPS | 18 TFLOPS Sparsity120 TFLOPS | Sparsity183 teraFLOPS800 teraFLOPS
FP32 4.5 TFLOPS30.3 TFLOPS91.6 teraFLOPS48 teraFLOPS
Peak BFLOAT16 Tensor Core 242 TFLOPS | Sparsity362.05 teraFLOPS1,600 teraFLOPS
Peak FP16 Tensor Core 18 TFLOPS | 36 TFLOPS Sparsity242 TFLOPS | Sparsity362.05 teraFLOPS1,600 teraFLOPS
Peak FP8 Tensor Core 485 TFLOPS | Sparsity733 teraFLOPS3,200 teraFLOPS
Peak INT4 Tensor Core 733 teraFLOPS
Total NVLink Bandwidth Not supported600GB/s
Multi-Instance GPUs NoUp to 7 MIGS @ 10GB each
Tensor Performance 13.2 TFLOPS
NVIDIA CUDA™ Technology 11.1 or later
vGPU Software Support NVIDIA vPC/vApps, NVIDIA RTX Virtual Workstation (vWS)
NVENC | NVDEC 2 | 4 | 4 | JPEG Decoders | AV1 Encode and Decode3x | 3x (Includes AV1 Encode & Decode)3x l 3x (includes AV1 encode and decode)
Secure Boot with Root of Trust YesYesYes
NEBS Ready Yes | Level 3Yes / Level 3Level 3
Peak INT4 Performance 72 TOPS | 144 TOPS Sparsity
Peak INT8 Performance 36 TOPS | 72 TOPS Sparsity
Transistor Count 8.7 Billion
DisplayPort Connectors None | vGPU Only4x DP 1.4a4x DisplayPort 1.4a
Cooling PassivePassivePassivePassivePassive
Dual Slot Single-slotNoYesYes
Dimensions 6.61” L x 2.71” H4.4" (H) x 10.5" (L)4.4" (H) x 10.5" (L)2.7" H x 6.4" L
Form Factor Low-Profile PCIe6.61” L x 2.71” H (Low-profile)PCIePCIe
Lithography 8N | NVIDIA Custom Process
Supplementary Power Connectors 1x 16-pin PCIe CEM51x 16-pin1x 16-pin PCIe CEM5
Max Graphics Card Power (W) 40-60 W | Configurable72W300W350W350W50W
Processor Ampere
Memory Bandwidth 192 GB/sec
Peak Single Precision FP32 Performance 6.74 TFLOPS
NVLink Interconnect Not Supported
Mini DisplayPort Output 4x mDisplayPort 1.4a
Thermal Solution Active Fan
Slot Height Single Slot
ActionSelectSelectSelectSelectSelectSelect