ProductNVIDIA® A40 GPU Computing Accelerator - 48GB GDDR6 - PCIe 4.0 x16 - Passive Cooling (w/o CEC)NVIDIA® RTX A2000 - 12GB GDDR6 - PCIe 4.0 x16 - Active Cooling (4x mDP)NVIDIA® RTX A4000 - 16GB GDDR6 - PCIe 4.0 x16 - Active Cooling (4xDP)NVIDIA® RTX A6000 - 48GB GDDR6 - PCIe 4.0 x16 - Active Cooling (4xDP)
ActionSelectSelectSelectSelect
Main Specifications
Product Series Nvidia A40
Core Type NVIDIA TENSOR
Host Interface PCI Express 4.0 x16PCI Express 4.0 x16PCI Express 4.0 x16PCI Express 4.0 x16
GPU Architecture Ampere
Product Type WorkstationWorkstationWorkstation
Product Line NVIDIA Professional GraphicsNVIDIA Professional GraphicsNVIDIA Professional Graphics
Memory Technology GDDR6 with ECCGDDR6 with ECCGDDR6
Memory Capacity 12 GB GDDR6 with ECC16 GB GDDR6 with ECC48 GB
Max Displays 4 Displays4 Displays
Detailed Specifications
Streaming Processor Cores 10752 CUDA Cores3328 CUDA Cores6144 CUDA Cores10752 Shading Units
NVIDIA Tensor Cores 336 Tensor Cores104192336
NVIDIA RT Cores 84 RT Cores264884
Memory Clock Speed 6001 MHz2000 MHz 16 Gbps effective
Memory Interface 384-bit192-bit256-bit384-bit
Memory Speeds (GT/s) 14.5Gbps GDDR6
Max Memory Size 48 GB GDDR6 with error-correcting code (ECC)
Max Memory Bandwidth 696 GB/s
Total NVLink Bandwidth NVIDIA NVLink 112.5 GB/s (bidirectional) PCIe Gen4 16 GB/s
Tensor Performance 63.9 TFLOPS
NVIDIA CUDA™ Technology YesYes
Transistor Count 13.25 Billion17.4 Billion28.3 Billion
DisplayPort Connectors 3x DisplayPort 1.4
A40 is configured for virtualization by default with physical display connectors disabled. The display outputs can be enabled via management software tools.
Cooling Passive
Dual Slot 2-slot Low-profile
Dimensions 4.4" (H) x 10.5" (L)2.713" H x 6.6" L4.4” H x 9.5” L4.4" (H) x 10.5" (L)
Lithography Samsung 8nm8nm8nmSamsung 8nm
Supplementary Power Connectors 1x 8-pin CPU (EPS12V)1x 6-pin PCIe1x 8-pin EPS
Max Graphics Card Power (W) 300W70W140W300W
Processor AmpereAmpere (GA104)Ampere (GA102)
Memory Bandwidth 288 GB/sec448 GB/sec768 GB/s
Core Clock Speed 1455 MHz Base Clock
1860 MHz Boost Clock
L2 Cache Size 6 MB
API Support CUDA 8.5, OpenCL 2.0
Shader Model 6.5, OpenGL 4.6, DirectX 12 Ultimate (12_2), Vulkan 1.2
Texture Fill Rate 625 GTexel/s
Graphics Resolution 2x 7680 x 4320 at 60 HzMax Digital Resolution: 7680 x 4320 x36 bpp at 60 Hz7680 x 4320 x36 bpp at 60 Hz
Peak Double Precision FP64 Performance 1,250 GFLOPS (1:32)
Peak Single Precision FP32 Performance 8.0 TFLOPS38.7 TFLOPS
Peak Half Precision FP16 Performance 40.00 TFLOPS (1:1)
Deep Learning TFLOPS 153.4 TFLOPS
Multi-GPU Scalability NVLINK 2-way low profile (2-slot and 3-slot bridges) connects 2x NVIDIA RTX A6000
NVLink Interconnect 112.5 GB/s (bidirectional)
RT Core Performance 15.6 TLOPS
VR Ready Yes
Vulkan API 1.2
DisplayPort Output 4x DisplayPort 1.4a4x DisplayPort 1.4a
Mini DisplayPort Output 4x mDP Latching
Minimum Recommended Power, Single Card (W) 300W700W
Minimum Recommended Power, 2-Way (W) 500850
Minimum Recommended Power, 3-Way (W) 8501000
Minimum Recommended Power, 4-Way (W) 10001200
Thermal Solution Active HeatsinkActive HeatsinkActive Heatsink
Slot Height 2-SlotSingle Slot2-Slot
ActionSelectSelectSelectSelect