Product | NVIDIA® A10 GPU Computing Accelerator - 24GB GDDR6 - PCIe 4.0 x16 - Passive Cooler (w/o CEC) | NVIDIA® A16 GPU Computing Accelerator - 64GB (4x 16GB) GDDR6 - PCIe 4.0 x16 - Passive Cooler | NVIDIA® A30 GPU Computing Accelerator - 24GB HBM2 - PCIe 4.0 x16 - Passive Cooler | NVIDIA® A40 GPU Computing Accelerator - 48GB GDDR6 - PCIe 4.0 x16 - Passive Cooling (w/o CEC) | NVIDIA® RTX A4000 - 16GB GDDR6 - PCIe 4.0 x16 - Active Cooling (4xDP) |
Action | Select | Select | Select | Select | Select |
Main Specifications | |||||
Product Series | Nvidia A10 | Nvidia A16 | Nvidia A30 | Nvidia A40 | |
Core Type | NVIDIA TENSOR | NVIDIA TENSOR | NVIDIA TENSOR | NVIDIA TENSOR | |
Core Clock Speed | 885 MHz (1695 MHz Boost Clock) | ||||
Host Interface | PCI Express 4.0 x16 64GB/s | PCI Express 4.0 x16 | PCI Express 4.0 x16 | PCI Express 4.0 x16 | PCI Express 4.0 x16 |
GPU Architecture | Ampere | Ampere | Ampere | Ampere | |
Product Type | Workstation | ||||
Product Line | NVIDIA Professional Graphics | ||||
Memory Technology | GDDR6 with ECC | ||||
Memory Capacity | 16 GB GDDR6 with ECC | ||||
Max Displays | 4 Displays | ||||
Detailed Specifications | |||||
Streaming Processor Cores | 10752 CUDA Cores | 6144 CUDA Cores | |||
NVIDIA Tensor Cores | 336 Tensor Cores | 192 | |||
NVIDIA RT Cores | 72 RT Cores | 84 RT Cores | 48 | ||
Memory Clock Speed | 1563 MHz | ||||
Memory Interface | 384-bit | 256-bit | |||
Memory Speeds (GT/s) | 14.5Gbps GDDR6 | ||||
Max Memory Size | 24 GB GDDR6 | 4x 16GB GDDR6 with error-correcting code (ECC) | 24 GB HBM2 | 48 GB GDDR6 with error-correcting code (ECC) | |
Max Memory Bandwidth | 600 GB/s | 4x 232GB/s | 933 GB/s | 696 GB/s | |
Peak FP64 | 5.2 teraFLOPS | ||||
Peak FP64 Tensor Core | 10.3 teraFLOPS | ||||
INT8 Tensor Core | 250 TOPS | 500 TOPS | 330 TOPS | 661 TOPS | |||
TF32 Tensor Core | 62.5 teraFLOPS | 125 teraFLOPS | 82 teraFLOPS | 165 teraFLOPS | |||
FP32 | 31.2 teraFLOPS | 10.3 teraFLOPS | |||
Peak BFLOAT16 Tensor Core | 125 teraFLOPS | 250 teraFLOPS | 165 teraFLOPS | 330 teraFLOPS | |||
Peak FP16 Tensor Core | 125 teraFLOPS | 250 teraFLOPS | 165 teraFLOPS | 330 teraFLOPS | |||
Peak INT4 Tensor Core | 500 TOPS | 1,000 TOPS | 661 TOPS | 1321 TOPS | |||
Total NVLink Bandwidth | Third-gen NVLINK: 200GB/s | NVIDIA NVLink 112.5 GB/s (bidirectional) PCIe Gen4 16 GB/s | |||
NVIDIA CUDA™ Technology | Yes | ||||
Transistor Count | 17.4 Billion | ||||
DisplayPort Connectors | 3x DisplayPort 1.4 A40 is configured for virtualization by default with physical display connectors disabled. The display outputs can be enabled via management software tools. | ||||
Cooling | Passive | Passive | Passive | ||
Dual Slot | Single-slot | Dual-slot | Dual-slot | 2-slot Low-profile | |
Dimensions | FHFL | 4.4" (H) x 10.5" (L) | 4.4” H x 9.5” L | ||
Lithography | 8 nm | Samsung 8nm | 8nm | ||
Supplementary Power Connectors | None | 8-pin CPU | 1x 8-pin CPU (EPS12V) | 1x 8-pin CPU (EPS12V) | 1x 6-pin PCIe |
Max Graphics Card Power (W) | 150W | 250W | 165W | 300W | 140W |
Processor | Ampere (GA104) | ||||
Memory Bandwidth | 448 GB/sec | ||||
Graphics Resolution | Max Digital Resolution: 7680 x 4320 x36 bpp at 60 Hz | ||||
Deep Learning TFLOPS | 153.4 TFLOPS | ||||
DisplayPort Output | 4x DisplayPort 1.4a | ||||
Minimum Recommended Power, Single Card (W) | 300W | ||||
Minimum Recommended Power, 2-Way (W) | 500 | ||||
Minimum Recommended Power, 3-Way (W) | 850 | ||||
Minimum Recommended Power, 4-Way (W) | 1000 | ||||
Thermal Solution | Active Heatsink | ||||
Slot Height | Single Slot | ||||
Action | Select | Select | Select | Select | Select |