Product | NVIDIA® A10 GPU Computing Accelerator - 24GB GDDR6 - PCIe 4.0 x16 - Passive Cooler (w/o CEC) | NVIDIA® L4 ADA GPU Computing Accelerator - 24GB GDDR6X - PCIe 4.0 x16 - Passive Cooling | NVIDIA® L40 ADA GPU Computing Accelerator - 48GB GDDR6 - PCIe 4.0 x16 - Passive Cooling | NVIDIA® L40S ADA GPU Computing Accelerator - 48GB GDDR6 - PCIe 4.0 x16 - Passive Cooling | NVIDIA® H100 NVL GPU Computing Accelerator - 94GB HBM3 - PCIe 5.0 x16 - Passive Cooling | NVIDIA® RTX A6000 - 48GB GDDR6 - PCIe 4.0 x16 - Active Cooling (4xDP) | NVIDIA® RTX 6000 Ada Generation - 48GB GDDR6 ECC - PCIe 4.0 x16 - Active Cooling (4xDP) |
Action | Select | Select | Select | Select | Select | Select | Select |
Main Specifications | |||||||
Product Series | Nvidia A10 | Nvidia L4 | Nvidia L40 | Nvidia L40S | Nvidia H100 NVL | ||
Core Type | NVIDIA TENSOR | NVIDIA TENSOR | NVIDIA TENSOR | NVIDIA TENSOR | NVIDIA TENSOR | ||
Core Clock Speed | 885 MHz (1695 MHz Boost Clock) | 795 MHz Base | 2040 MHz Boost | |||||
Host Interface | PCI Express 4.0 x16 64GB/s | PCI Express 4.0 x16 | PCI Express 4.0 x16 | PCI Express 4.0 x16 | PCI Express 5.0 x16 | PCI Express 4.0 x16 | PCI Express 4.0 x16 |
GPU Architecture | Ampere | Ada Lovelace | Ada Lovelace | Ada Lovelace | Hopper | ||
Product Type | Workstation | Workstation | |||||
Product Line | NVIDIA Professional Graphics | NVIDIA Professional Graphics | |||||
Memory Technology | GDDR6 | GDDR6 | |||||
Memory Capacity | 48 GB | 48 GB with ECC | |||||
Max Displays | 4 Displays | ||||||
Detailed Specifications | |||||||
Streaming Processor Cores | 18,176 | 10752 Shading Units | 18,176 | ||||
NVIDIA Tensor Cores | 568 | Gen 4 | 336 | 568 | ||||
NVIDIA RT Cores | 72 RT Cores | 142 | Gen 3 | 84 | 142 | |||
PCIe x16 Interconnect Bandwidth | PCIe Gen5: 128GB/s | ||||||
Memory Clock Speed | 1563 MHz | 6251 MHz | 2000 MHz 16 Gbps effective | ||||
Memory Interface | 192-bit | 384-bit | 384-bit | ||||
Max Memory Size | 24 GB GDDR6 | 24 GB | 48 GB GDDR6 with ECC | 48GB GDDR6 with ECC | 94 GB | ||
Max Memory Bandwidth | 600 GB/s | 300 GB/s | 864 GB/s | 7.8TB/s | |||
Peak FP64 | 68 teraFLOPs | ||||||
Peak FP64 Tensor Core | 134 teraFLOPs | ||||||
INT8 Tensor Core | 250 TOPS | 500 TOPS | 485 TOPS | Sparsity | 733 teraFLOPS | 7,916 TOPS | |||
TF32 Tensor Core | 62.5 teraFLOPS | 125 teraFLOPS | 120 TFLOPS | Sparsity | 183 teraFLOPS | 1,979 teraFLOPs | |||
FP32 | 31.2 teraFLOPS | 30.3 TFLOPS | 91.6 teraFLOPS | 134 teraFLOPs | |||
Peak BFLOAT16 Tensor Core | 125 teraFLOPS | 250 teraFLOPS | 242 TFLOPS | Sparsity | 362.05 teraFLOPS | 3,958 teraFLOPs | |||
Peak FP16 Tensor Core | 125 teraFLOPS | 250 teraFLOPS | 242 TFLOPS | Sparsity | 362.05 teraFLOPS | 3,958 teraFLOPs | |||
Peak FP8 Tensor Core | 485 TFLOPS | Sparsity | 733 teraFLOPS | 7,916 teraFLOPs | ||||
Peak INT4 Tensor Core | 500 TOPS | 1,000 TOPS | 733 teraFLOPS | |||||
Total NVLink Bandwidth | Not supported | 600GB/s | |||||
Multi-Instance GPUs | No | ||||||
Tensor Performance | 1457.0 TFLOPS | ||||||
NVIDIA CUDA™ Technology | Yes | ||||||
vGPU Software Support | NVIDIA vPC/vApps, NVIDIA RTX Virtual Workstation (vWS) | ||||||
NVENC | NVDEC | 2 | 4 | 4 | JPEG Decoders | AV1 Encode and Decode | 3x | 3x (Includes AV1 Encode & Decode) | 3x l 3x (includes AV1 encode and decode) | ||||
Secure Boot with Root of Trust | Yes | Yes | Yes | ||||
NEBS Ready | Yes | Level 3 | Yes / Level 3 | Level 3 | ||||
ECC Protection | On by Default | ||||||
Transistor Count | 28.3 Billion | 76.3 billion | |||||
DisplayPort Connectors | None | vGPU Only | 4x DP 1.4a | 4x DisplayPort 1.4a | ||||
Cooling | Passive | Passive | Passive | Passive | Passive | ||
Dual Slot | Single-slot | No | Yes | Yes | |||
Dimensions | FHFL | 4.4" (H) x 10.5" (L) | 4.4" (H) x 10.5" (L) | 4.4" (H) x 10.5" (L) | 4.4" H x 10.5" L | ||
Form Factor | 6.61” L x 2.71” H (Low-profile) | PCIe | PCIe | ||||
Lithography | 8 nm | Samsung 8nm | 4 nm NVIDIA Custom Process | ||||
Supplementary Power Connectors | None | 1x 16-pin PCIe CEM5 | 1x 16-pin | 1x 8-pin EPS | 1x PCIe CEM5 16-pin | ||
Max Graphics Card Power (W) | 150W | 72W | 300W | 350W | 400W | 300W | 300W |
Processor | Ampere (GA102) | NVIDIA Ada Lovelace | |||||
Memory Bandwidth | 768 GB/s | 960 GB/s | |||||
Core Clock Speed | 1455 MHz Base Clock 1860 MHz Boost Clock | ||||||
L2 Cache Size | 6 MB | ||||||
API Support | CUDA 8.5, OpenCL 2.0 Shader Model 6.5, OpenGL 4.6, DirectX 12 Ultimate (12_2), Vulkan 1.2 | ||||||
Texture Fill Rate | 625 GTexel/s | ||||||
Graphics Resolution | 7680 x 4320 x36 bpp at 60 Hz | ||||||
Peak Double Precision FP64 Performance | 1,250 GFLOPS (1:32) | ||||||
Peak Single Precision FP32 Performance | 38.7 TFLOPS | 91.1 TFLOPS | |||||
Peak Half Precision FP16 Performance | 40.00 TFLOPS (1:1) | ||||||
Multi-GPU Scalability | NVLINK 2-way low profile (2-slot and 3-slot bridges) connects 2x NVIDIA RTX A6000 | ||||||
NVLink Interconnect | 112.5 GB/s (bidirectional) | ||||||
RT Core Performance | 210.6 TFLOPS | ||||||
VR Ready | Yes | ||||||
Vulkan API | 1.2 | ||||||
DisplayPort Output | 4x DisplayPort 1.4a | 4x DP 1.4a | |||||
Minimum Recommended Power, Single Card (W) | 700W | 600 | |||||
Minimum Recommended Power, 2-Way (W) | 850 | 750 | |||||
Minimum Recommended Power, 3-Way (W) | 1000 | 850 | |||||
Minimum Recommended Power, 4-Way (W) | 1200 | 1000 | |||||
Thermal Solution | Active Heatsink | Blower Active Fan | |||||
Slot Height | 2-Slot | 2-Slot | |||||
Action | Select | Select | Select | Select | Select | Select | Select |