ProductAMD Instinct™ MI100 Accelerator - 32GB HBM2 - PCIe 4.0 x16 - Passive CoolingAMD Instinct™ MI210 Accelerator - 64GB HBM2e - PCIe 4.0 x16 - Passive CoolingNVIDIA® L40S ADA GPU Computing Accelerator - 48GB GDDR6 - PCIe 4.0 x16 - Passive CoolingNVIDIA® H100 NVL GPU Computing Accelerator - 94GB HBM3 - PCIe 5.0 x16 - Passive CoolingNVIDIA® RTX 5000 Ada Generation - 32GB GDDR6 ECC - PCIe 4.0 x16 - Active Cooling (4xDP)NVIDIA® RTX 6000 Ada Generation - 48GB GDDR6 ECC - PCIe 4.0 x16 - Active Cooling (4xDP)
ActionSelectSelectSelectSelectSelectSelect
Main Specifications
Product Series AMD InstinctAMD InstinctNvidia L40SNvidia H100 NVL
Core Type NVIDIA TENSORNVIDIA TENSOR
Core Clock Speed 1502 MHz1700 MHz
Host Interface PCI Express 4.0 x16PCI Express 4.0 x16PCI Express 4.0 x16PCI Express 5.0 x16PCI Express 4.0 x16PCI Express 4.0 x16
GPU Architecture CDNACDNA2Ada LovelaceHopper
Product Type WorkstationWorkstation
Product Line NVIDIA Professional GraphicsNVIDIA Professional Graphics
Memory Technology GDDR6GDDR6
Memory Capacity 32 GB GDDR6 ECC48 GB with ECC
Max Displays 4 Displays4 Displays
Detailed Specifications
Streaming Processor Cores 7,680665618,17612,800 CUDA Parallel Processing Cores18,176
Compute Units 120104
NVIDIA Tensor Cores 568 | Gen 4400568
NVIDIA RT Cores 142 | Gen 3100142
PCIe x16 Interconnect Bandwidth PCIe Gen5: 128GB/s
Memory Clock Speed 1.2 GHz1.6 GHz
Memory Interface 4096-bit4096-bit256-bit384-bit
Max Memory Size 32 GB HBM264 GB HBM2e48GB GDDR6 with ECC94 GB
Max Memory Bandwidth Up to 1228.8 GB/sUp to 1638.4 GB/s864 GB/s7.8TB/s
Infinity Fabric™ Links 33
Peak Infinity Fabric™ Link Bandwidth 92 GB/s100 GB/s
Peak FP64 68 teraFLOPs
Peak FP64 Tensor Core 134 teraFLOPs
INT8 Tensor Core 733 teraFLOPS7,916 TOPS
TF32 Tensor Core 183 teraFLOPS1,979 teraFLOPs
FP32 22.6 TFLOPs91.6 teraFLOPS134 teraFLOPs
Peak BFLOAT16 Tensor Core 362.05 teraFLOPS3,958 teraFLOPs
Peak FP16 Tensor Core 362.05 teraFLOPS3,958 teraFLOPs
Peak FP8 Tensor Core 733 teraFLOPS7,916 teraFLOPs
Peak INT4 Tensor Core 733 teraFLOPS
Total NVLink Bandwidth Not supported600GB/s
Multi-Instance GPUs No
Tensor Performance 1044.4 TFLOPS1457.0 TFLOPS
NVENC | NVDEC 3x l 3x (includes AV1 encode and decode)
Secure Boot with Root of Trust Yes
NEBS Ready Level 3
Peak Half Precision (FP16) Performance 184.6 TFLOPs
Peak Single Precision Matrix (FP32) Performance 46.1 TFLOPs45.3 TFLOPs
Peak Double Precision Matrix (FP64) Performance 45.3 TFLOPs
Peak Single Precision (FP32) Performance 23.1 TFLOPs
Peak Double Precision (FP64) Performance 11.5 TFLOPs22.6 TFLOPs
Peak INT4 Performance 184.6 TOPs181 TOPs
Peak INT8 Performance 184.6 TOPs
Peak bfloat16 92.3 TFLOPs181 TFLOPs
ECC Protection Yes (Full-Chip)
Transistor Count 76.3 billion76.3 billion
DisplayPort Connectors 4x DisplayPort 1.4a
OS Support Linux x86_64Linux x86_64
Cooling PassivePassivePassivePassive
Dual Slot yesyesYes
Dimensions 10.5" (267 mm) Board Length10.5" (267 mm) Board Length4.4" (H) x 10.5" (L)4.4" H x 10.5" L4.4" H x 10.5" L
Form Factor Full HeightPCIe
Lithography TSMC 7nm FinFET4 nm NVIDIA Custom Process4 nm NVIDIA Custom Process
Supplementary Power Connectors 2x PCIe 8-pin connectors1x8 pin 12V EPS1x 16-pin1x 16-pin CEM5 PCIe1x PCIe CEM5 16-pin
Max Graphics Card Power (W) 300W300W Peak350W400W250W300W
Processor NVIDIA Ada LovelaceNVIDIA Ada Lovelace
Memory Bandwidth 576 GB/s960 GB/s
Peak Single-Precision Performance 65.3 TFLOPS
Peak Single Precision FP32 Performance 91.1 TFLOPS
NVLink Interconnect Not Supported
RT Core Performance 151.0 TFLOPS210.6 TFLOPS
DisplayPort Output 4x DP 1.4a
Mini DisplayPort Output 4x DP 1.4a
Minimum Recommended Power, Single Card (W) 600
Minimum Recommended Power, 2-Way (W) 750
Minimum Recommended Power, 3-Way (W) 850
Minimum Recommended Power, 4-Way (W) 1000
Thermal Solution Blower Active FanBlower Active Fan
Slot Height 2-Slot2-Slot
ActionSelectSelectSelectSelectSelectSelect