| | | | | | |
Product | AMD Instinct™ MI100 Accelerator - 32GB HBM2 - PCIe 4.0 x16 - Passive Cooling | AMD Instinct™ MI210 Accelerator - 64GB HBM2e - PCIe 4.0 x16 - Passive Cooling | NVIDIA® L40S ADA GPU Computing Accelerator - 48GB GDDR6 - PCIe 4.0 x16 - Passive Cooling | NVIDIA® H100 NVL GPU Computing Accelerator - 94GB HBM3 - PCIe 5.0 x16 - Passive Cooling | NVIDIA® RTX 5000 Ada Generation - 32GB GDDR6 ECC - PCIe 4.0 x16 - Active Cooling (4xDP) | NVIDIA® RTX 6000 Ada Generation - 48GB GDDR6 ECC - PCIe 4.0 x16 - Active Cooling (4xDP) |
Action | Select | Select | Select | Select | Select | Select |
Main Specifications |
Product Series |
AMD Instinct | AMD Instinct | Nvidia L40S | Nvidia H100 NVL | | |
Core Type |
| | NVIDIA TENSOR | NVIDIA TENSOR | | |
Core Clock Speed |
1502 MHz | 1700 MHz | | | | |
Host Interface |
PCI Express 4.0 x16 | PCI Express 4.0 x16 | PCI Express 4.0 x16 | PCI Express 5.0 x16 | PCI Express 4.0 x16 | PCI Express 4.0 x16 |
GPU Architecture |
CDNA | CDNA2 | Ada Lovelace | Hopper | | |
Product Type |
| | | | Workstation | Workstation |
Product Line |
| | | | NVIDIA Professional Graphics | NVIDIA Professional Graphics |
Memory Technology |
| | | | GDDR6 | GDDR6 |
Memory Capacity |
| | | | 32 GB GDDR6 ECC | 48 GB with ECC |
Max Displays |
| | | | 4 Displays | 4 Displays |
Detailed Specifications |
Streaming Processor Cores |
7,680 | 6656 | 18,176 | | 12,800 CUDA Parallel Processing Cores | 18,176 |
Compute Units |
120 | 104 | | | | |
NVIDIA Tensor Cores |
| | 568 | Gen 4 | | 400 | 568 |
NVIDIA RT Cores |
| | 142 | Gen 3 | | 100 | 142 |
PCIe x16 Interconnect Bandwidth |
| | | PCIe Gen5: 128GB/s | | |
Memory Clock Speed |
1.2 GHz | 1.6 GHz | | | | |
Memory Interface |
4096-bit | 4096-bit | | | 256-bit | 384-bit |
Max Memory Size |
32 GB HBM2 | 64 GB HBM2e | 48GB GDDR6 with ECC | 94 GB | | |
Max Memory Bandwidth |
Up to 1228.8 GB/s | Up to 1638.4 GB/s | 864 GB/s | 7.8TB/s | | |
Infinity Fabric™ Links |
3 | 3 | | | | |
Peak Infinity Fabric™ Link Bandwidth |
92 GB/s | 100 GB/s | | | | |
Peak FP64 |
| | | 68 teraFLOPs | | |
Peak FP64 Tensor Core |
| | | 134 teraFLOPs | | |
INT8 Tensor Core |
| | 733 teraFLOPS | 7,916 TOPS | | |
TF32 Tensor Core |
| | 183 teraFLOPS | 1,979 teraFLOPs | | |
FP32 |
| 22.6 TFLOPs | 91.6 teraFLOPS | 134 teraFLOPs | | |
Peak BFLOAT16 Tensor Core |
| | 362.05 teraFLOPS | 3,958 teraFLOPs | | |
Peak FP16 Tensor Core |
| | 362.05 teraFLOPS | 3,958 teraFLOPs | | |
Peak FP8 Tensor Core |
| | 733 teraFLOPS | 7,916 teraFLOPs | | |
Peak INT4 Tensor Core |
| | 733 teraFLOPS | | | |
Total NVLink Bandwidth |
| | Not supported | 600GB/s | | |
Multi-Instance GPUs |
| | No | | | |
Tensor Performance |
| | | | 1044.4 TFLOPS | 1457.0 TFLOPS |
NVENC | NVDEC |
| | 3x l 3x (includes AV1 encode and decode) | | | |
Secure Boot with Root of Trust |
| | Yes | | | |
NEBS Ready |
| | Level 3 | | | |
Peak Half Precision (FP16) Performance |
184.6 TFLOPs | | | | | |
Peak Single Precision Matrix (FP32) Performance |
46.1 TFLOPs | 45.3 TFLOPs | | | | |
Peak Double Precision Matrix (FP64) Performance |
| 45.3 TFLOPs | | | | |
Peak Single Precision (FP32) Performance |
23.1 TFLOPs | | | | | |
Peak Double Precision (FP64) Performance |
11.5 TFLOPs | 22.6 TFLOPs | | | | |
Peak INT4 Performance |
184.6 TOPs | 181 TOPs | | | | |
Peak INT8 Performance |
184.6 TOPs | | | | | |
Peak bfloat16 |
92.3 TFLOPs | 181 TFLOPs | | | | |
ECC Protection |
| Yes (Full-Chip) | | | | |
Transistor Count |
| | | | 76.3 billion | 76.3 billion |
DisplayPort Connectors |
| | 4x DisplayPort 1.4a | | | |
OS Support |
Linux x86_64 | Linux x86_64 | | | | |
Cooling |
Passive | Passive | Passive | Passive | | |
Dual Slot |
yes | yes | | Yes | | |
Dimensions |
10.5" (267 mm) Board Length | 10.5" (267 mm) Board Length | 4.4" (H) x 10.5" (L) | | 4.4" H x 10.5" L | 4.4" H x 10.5" L |
Form Factor |
| Full Height | | PCIe | | |
Lithography |
TSMC 7nm FinFET | | | | 4 nm NVIDIA Custom Process | 4 nm NVIDIA Custom Process |
Supplementary Power Connectors |
2x PCIe 8-pin connectors | 1x8 pin 12V EPS | 1x 16-pin | | 1x 16-pin CEM5 PCIe | 1x PCIe CEM5 16-pin |
Max Graphics Card Power (W) |
300W | 300W Peak | 350W | 400W | 250W | 300W |
Processor |
| | | | NVIDIA Ada Lovelace | NVIDIA Ada Lovelace |
Memory Bandwidth |
| | | | 576 GB/s | 960 GB/s |
Peak Single-Precision Performance |
| | | | 65.3 TFLOPS | |
Peak Single Precision FP32 Performance |
| | | | | 91.1 TFLOPS |
NVLink Interconnect |
| | | | Not Supported | |
RT Core Performance |
| | | | 151.0 TFLOPS | 210.6 TFLOPS |
DisplayPort Output |
| | | | | 4x DP 1.4a |
Mini DisplayPort Output |
| | | | 4x DP 1.4a | |
Minimum Recommended Power, Single Card (W) |
| | | | | 600 |
Minimum Recommended Power, 2-Way (W) |
| | | | | 750 |
Minimum Recommended Power, 3-Way (W) |
| | | | | 850 |
Minimum Recommended Power, 4-Way (W) |
| | | | | 1000 |
Thermal Solution |
| | | | Blower Active Fan | Blower Active Fan |
Slot Height |
| | | | 2-Slot | 2-Slot |
Action | Select | Select | Select | Select | Select | Select |