ProductAMD Instinct™ MI210 Accelerator - 64GB HBM2e - PCIe 4.0 x16 - Passive CoolingNVIDIA® A30 GPU Computing Accelerator - 24GB HBM2 - PCIe 4.0 x16 - Passive Cooler (w/o CEC)NVIDIA® L4 ADA GPU Computing Accelerator - 24GB GDDR6X - PCIe 4.0 x16 - Passive CoolingNVIDIA® L40 ADA GPU Computing Accelerator - 48GB GDDR6 - PCIe 4.0 x16 - Passive CoolingNVIDIA® L40S ADA GPU Computing Accelerator - 48GB GDDR6 - PCIe 4.0 x16 - Passive CoolingNVIDIA® H100 NVL GPU Computing Accelerator - 94GB HBM3 - PCIe 5.0 x16 - Passive CoolingNVIDIA® RTX A4000 - 16GB GDDR6 - PCIe 4.0 x16 - Active Cooling (4xDP)NVIDIA® RTX A6000 - 48GB GDDR6 - PCIe 4.0 x16 - Active Cooling (4xDP)NVIDIA® RTX 6000 Ada Generation - 48GB GDDR6 ECC - PCIe 4.0 x16 - Active Cooling (4xDP)
ActionSelectSelectSelectSelectSelectSelectSelectSelectSelect
Main Specifications
Product Series AMD InstinctNvidia A30Nvidia L4Nvidia L40Nvidia L40SNvidia H100 NVL
Core Type NVIDIA TENSORNVIDIA TENSORNVIDIA TENSORNVIDIA TENSORNVIDIA TENSOR
Core Clock Speed 1700 MHz795 MHz Base | 2040 MHz Boost
Host Interface PCI Express 4.0 x16PCI Express 4.0 x16PCI Express 4.0 x16PCI Express 4.0 x16PCI Express 4.0 x16PCI Express 5.0 x16PCI Express 4.0 x16PCI Express 4.0 x16PCI Express 4.0 x16
GPU Architecture CDNA2AmpereAda LovelaceAda LovelaceAda LovelaceHopper
Product Type WorkstationWorkstationWorkstation
Product Line NVIDIA Professional GraphicsNVIDIA Professional GraphicsNVIDIA Professional Graphics
Memory Technology GDDR6 with ECCGDDR6GDDR6
Memory Capacity 16 GB GDDR6 with ECC48 GB48 GB with ECC
Max Displays 4 Displays4 Displays
Detailed Specifications
Streaming Processor Cores 665618,1766144 CUDA Cores10752 Shading Units18,176
Compute Units 104
NVIDIA Tensor Cores 568 | Gen 4192336568
NVIDIA RT Cores 142 | Gen 34884142
PCIe x16 Interconnect Bandwidth PCIe Gen5: 128GB/s
Memory Clock Speed 1.6 GHz6251 MHz2000 MHz 16 Gbps effective
Memory Interface 4096-bit192-bit256-bit384-bit384-bit
Max Memory Size 64 GB HBM2e24 GB HBM224 GB48 GB GDDR6 with ECC48GB GDDR6 with ECC94 GB
Max Memory Bandwidth Up to 1638.4 GB/s933 GB/s300 GB/s864 GB/s7.8TB/s
Infinity Fabric™ Links 3
Peak Infinity Fabric™ Link Bandwidth 100 GB/s
Peak FP64 5.2 teraFLOPS68 teraFLOPs
Peak FP64 Tensor Core 10.3 teraFLOPS134 teraFLOPs
INT8 Tensor Core 330 TOPS | 661 TOPS485 TOPS | Sparsity733 teraFLOPS7,916 TOPS
TF32 Tensor Core 82 teraFLOPS | 165 teraFLOPS120 TFLOPS | Sparsity183 teraFLOPS1,979 teraFLOPs
FP32 22.6 TFLOPs10.3 teraFLOPS30.3 TFLOPS91.6 teraFLOPS134 teraFLOPs
Peak BFLOAT16 Tensor Core 165 teraFLOPS | 330 teraFLOPS242 TFLOPS | Sparsity362.05 teraFLOPS3,958 teraFLOPs
Peak FP16 Tensor Core 165 teraFLOPS | 330 teraFLOPS242 TFLOPS | Sparsity362.05 teraFLOPS3,958 teraFLOPs
Peak FP8 Tensor Core 485 TFLOPS | Sparsity733 teraFLOPS7,916 teraFLOPs
Peak INT4 Tensor Core 661 TOPS | 1321 TOPS733 teraFLOPS
Total NVLink Bandwidth Third-gen NVLINK: 200GB/sNot supported600GB/s
Multi-Instance GPUs No
Tensor Performance 1457.0 TFLOPS
NVIDIA CUDA™ Technology YesYes
vGPU Software Support NVIDIA vPC/vApps, NVIDIA RTX Virtual Workstation (vWS)
NVENC | NVDEC 2 | 4 | 4 | JPEG Decoders | AV1 Encode and Decode3x | 3x (Includes AV1 Encode & Decode)3x l 3x (includes AV1 encode and decode)
Secure Boot with Root of Trust YesYesYes
NEBS Ready Yes | Level 3Yes / Level 3Level 3
Peak Single Precision Matrix (FP32) Performance 45.3 TFLOPs
Peak Double Precision Matrix (FP64) Performance 45.3 TFLOPs
Peak Double Precision (FP64) Performance 22.6 TFLOPs
Peak INT4 Performance 181 TOPs
Peak bfloat16 181 TFLOPs
ECC Protection Yes (Full-Chip)On by Default
Transistor Count 17.4 Billion28.3 Billion76.3 billion
DisplayPort Connectors None | vGPU Only4x DP 1.4a4x DisplayPort 1.4a
OS Support Linux x86_64
Cooling PassivePassivePassivePassivePassive
Dual Slot yesDual-slotNoYesYes
Dimensions 10.5" (267 mm) Board Length4.4" (H) x 10.5" (L)4.4" (H) x 10.5" (L)4.4” H x 9.5” L4.4" (H) x 10.5" (L)4.4" H x 10.5" L
Form Factor Full Height6.61” L x 2.71” H (Low-profile)PCIePCIe
Lithography 8nmSamsung 8nm4 nm NVIDIA Custom Process
Supplementary Power Connectors 1x8 pin 12V EPS1x 8-pin CPU (EPS12V)1x 16-pin PCIe CEM51x 16-pin1x 6-pin PCIe1x 8-pin EPS1x PCIe CEM5 16-pin
Max Graphics Card Power (W) 300W Peak165W72W300W350W400W140W300W300W
Processor Ampere (GA104)Ampere (GA102)NVIDIA Ada Lovelace
Memory Bandwidth 448 GB/sec768 GB/s960 GB/s
Core Clock Speed 1455 MHz Base Clock
1860 MHz Boost Clock
L2 Cache Size 6 MB
API Support CUDA 8.5, OpenCL 2.0
Shader Model 6.5, OpenGL 4.6, DirectX 12 Ultimate (12_2), Vulkan 1.2
Texture Fill Rate 625 GTexel/s
Graphics Resolution Max Digital Resolution: 7680 x 4320 x36 bpp at 60 Hz7680 x 4320 x36 bpp at 60 Hz
Peak Double Precision FP64 Performance 1,250 GFLOPS (1:32)
Peak Single Precision FP32 Performance 38.7 TFLOPS91.1 TFLOPS
Peak Half Precision FP16 Performance 40.00 TFLOPS (1:1)
Deep Learning TFLOPS 153.4 TFLOPS
Multi-GPU Scalability NVLINK 2-way low profile (2-slot and 3-slot bridges) connects 2x NVIDIA RTX A6000
NVLink Interconnect 112.5 GB/s (bidirectional)
RT Core Performance 210.6 TFLOPS
VR Ready Yes
Vulkan API 1.2
DisplayPort Output 4x DisplayPort 1.4a4x DisplayPort 1.4a4x DP 1.4a
Minimum Recommended Power, Single Card (W) 300W700W600
Minimum Recommended Power, 2-Way (W) 500850750
Minimum Recommended Power, 3-Way (W) 8501000850
Minimum Recommended Power, 4-Way (W) 100012001000
Thermal Solution Active HeatsinkActive HeatsinkBlower Active Fan
Slot Height Single Slot2-Slot2-Slot
ActionSelectSelectSelectSelectSelectSelectSelectSelectSelect