Product	NVIDIA® A10 GPU Computing Accelerator - 24GB GDDR6 - PCIe 4.0 x16 - Passive Cooler (w/o CEC)	NVIDIA® A40 GPU Computing Accelerator - 48GB GDDR6 - PCIe 4.0 x16 - Passive Cooling (w/o CEC)	NVIDIA® RTX A6000 - 48GB GDDR6 - PCIe 4.0 x16 - Active Cooling (4xDP)
Action	Select	Select	Select
Main Specifications
Product Series	Nvidia A10	Nvidia A40
Core Type	NVIDIA TENSOR	NVIDIA TENSOR
Core Clock Speed	885 MHz (1695 MHz Boost Clock)
Host Interface	PCI Express 4.0 x16 64GB/s	PCI Express 4.0 x16	PCI Express 4.0 x16
GPU Architecture	Ampere	Ampere
Product Type			Workstation
Product Line			NVIDIA Professional Graphics
Memory Technology			GDDR6
Memory Capacity			48 GB
Detailed Specifications
Streaming Processor Cores		10752 CUDA Cores	10752 Shading Units
NVIDIA Tensor Cores		336 Tensor Cores	336
NVIDIA RT Cores	72 RT Cores	84 RT Cores	84
Memory Clock Speed	1563 MHz		2000 MHz 16 Gbps effective
Memory Interface		384-bit	384-bit
Memory Speeds (GT/s)		14.5Gbps GDDR6
Max Memory Size	24 GB GDDR6	48 GB GDDR6 with error-correcting code (ECC)
Max Memory Bandwidth	600 GB/s	696 GB/s
INT8 Tensor Core	250 TOPS \| 500 TOPS
TF32 Tensor Core	62.5 teraFLOPS \| 125 teraFLOPS
FP32	31.2 teraFLOPS
Peak BFLOAT16 Tensor Core	125 teraFLOPS \| 250 teraFLOPS
Peak FP16 Tensor Core	125 teraFLOPS \| 250 teraFLOPS
Peak INT4 Tensor Core	500 TOPS \| 1,000 TOPS
Total NVLink Bandwidth		NVIDIA NVLink 112.5 GB/s (bidirectional) PCIe Gen4 16 GB/s
NVIDIA CUDA™ Technology			Yes
Transistor Count			28.3 Billion
DisplayPort Connectors		3x DisplayPort 1.4 A40 is configured for virtualization by default with physical display connectors disabled. The display outputs can be enabled via management software tools.
Cooling	Passive	Passive
Dual Slot	Single-slot	2-slot Low-profile
Dimensions	FHFL	4.4" (H) x 10.5" (L)	4.4" (H) x 10.5" (L)
Lithography	8 nm	Samsung 8nm	Samsung 8nm
Supplementary Power Connectors	None	1x 8-pin CPU (EPS12V)	1x 8-pin EPS
Max Graphics Card Power (W)	150W	300W	300W
Processor			Ampere (GA102)
Memory Bandwidth			768 GB/s
Core Clock Speed			1455 MHz Base Clock 1860 MHz Boost Clock
L2 Cache Size			6 MB
API Support			CUDA 8.5, OpenCL 2.0 Shader Model 6.5, OpenGL 4.6, DirectX 12 Ultimate (12_2), Vulkan 1.2
Texture Fill Rate			625 GTexel/s
Graphics Resolution			7680 x 4320 x36 bpp at 60 Hz
Peak Double Precision FP64 Performance			1,250 GFLOPS (1:32)
Peak Single Precision FP32 Performance			38.7 TFLOPS
Peak Half Precision FP16 Performance			40.00 TFLOPS (1:1)
Multi-GPU Scalability			NVLINK 2-way low profile (2-slot and 3-slot bridges) connects 2x NVIDIA RTX A6000
NVLink Interconnect			112.5 GB/s (bidirectional)
VR Ready			Yes
Vulkan API			1.2
DisplayPort Output			4x DisplayPort 1.4a
Minimum Recommended Power, Single Card (W)			700W
Minimum Recommended Power, 2-Way (W)			850
Minimum Recommended Power, 3-Way (W)			1000
Minimum Recommended Power, 4-Way (W)			1200
Thermal Solution			Active Heatsink
Slot Height			2-Slot
Action	Select	Select	Select