Product | NVIDIA® A2 GPU Computing Accelerator - 16GB GDDR6 - PCIe 4.0 x8 - Passive Cooler | NVIDIA® A16 GPU Computing Accelerator - 64GB (4x 16GB) GDDR6 - PCIe 4.0 x16 - Passive Cooler | NVIDIA® A40 GPU Computing Accelerator - 48GB GDDR6 - PCIe 4.0 x16 - Passive Cooling | NVIDIA® L40 ADA GPU Computing Accelerator - 48GB GDDR6 - PCIe 4.0 x16 - Passive Cooling | NVIDIA® L40S ADA GPU Computing Accelerator - 48GB GDDR6 - PCIe 4.0 x16 - Passive Cooling | NVIDIA® RTX 5000 Ada Generation - 32GB GDDR6 ECC - PCIe 4.0 x16 - Active Cooling (4xDP) | NVIDIA® RTX 6000 Ada Generation - 48GB GDDR6 ECC - PCIe 4.0 x16 - Active Cooling (4xDP) |
Price Change | |||||||
Action | Select | Select | Select | Select | Select | Select | Select |
Manufacturer | NVIDIA | NVIDIA | NVIDIA | NVIDIA | NVIDIA | PNY | PNY |
Part Number | 699-2G179-0220-000 | 900-2G171-0020-100 | 900-2G133-0000-000 | 900-2G133-0010-000 | 900-2G133-0080-000 | VCNRTX5000ADA-PB | VCNRTX6000ADA-PB |
Main Specifications | |||||||
Product Series | Nvidia A2 | Nvidia A16 | Nvidia A40 | Nvidia L40 | Nvidia L40S | ||
Core Type | NVIDIA TENSOR | NVIDIA TENSOR | NVIDIA TENSOR | NVIDIA TENSOR | NVIDIA TENSOR | ||
Core Clock Speed | 1440 MHz (1770 MHz Boost Clock) | ||||||
Host Interface | PCI Express 4.0 x8 | PCI Express 4.0 x16 | PCI Express 4.0 x16 | PCI Express 4.0 x16 | PCI Express 4.0 x16 | PCI Express 4.0 x16 | PCI Express 4.0 x16 |
GPU Architecture | Ampere | Ampere | Ampere | Ada Lovelace | Ada Lovelace | ||
Product Type | Workstation | Workstation | |||||
Product Line | NVIDIA Professional Graphics | NVIDIA Professional Graphics | |||||
Memory Technology | GDDR6 | GDDR6 | |||||
Memory Capacity | 32 GB GDDR6 ECC | 48 GB with ECC | |||||
Max Displays | 4 Displays | 4 Displays | |||||
Detailed Specifications | |||||||
Streaming Processor Cores | 1280 CUDA Cores | 10752 CUDA Cores | 18,176 | 12,800 CUDA Parallel Processing Cores | 18,176 | ||
NVIDIA Tensor Cores | 40 | Gen 3 | 336 Tensor Cores | 568 | Gen 4 | 400 | 568 | ||
NVIDIA RT Cores | 10 | Gen 2 | 84 RT Cores | 142 | Gen 3 | 100 | 142 | ||
Memory Clock Speed | 6251 MHz | ||||||
Memory Interface | 128-bit | 384-bit | 256-bit | 384-bit | |||
Memory Speeds (GT/s) | 14.5Gbps GDDR6 | ||||||
Max Memory Size | 16 GB GDDR6 ECC | 4x 16GB GDDR6 with error-correcting code (ECC) | 48 GB GDDR6 with error-correcting code (ECC) | 48 GB GDDR6 with ECC | 48GB GDDR6 with ECC | ||
Max Memory Bandwidth | 200 GB/s | 4x 232GB/s | 696 GB/s | 864 GB/s | |||
INT8 Tensor Core | 733 teraFLOPS | ||||||
TF32 Tensor Core | 9 TFLOPS | 18 TFLOPS Sparsity | 183 teraFLOPS | |||||
FP32 | 4.5 TFLOPS | 91.6 teraFLOPS | |||||
Peak BFLOAT16 Tensor Core | 362.05 teraFLOPS | ||||||
Peak FP16 Tensor Core | 18 TFLOPS | 36 TFLOPS Sparsity | 362.05 teraFLOPS | |||||
Peak FP8 Tensor Core | 733 teraFLOPS | ||||||
Peak INT4 Tensor Core | 733 teraFLOPS | ||||||
Total NVLink Bandwidth | NVIDIA NVLink 112.5 GB/s (bidirectional) PCIe Gen4 16 GB/s | Not supported | |||||
Multi-Instance GPUs | No | ||||||
Tensor Performance | 1044.4 TFLOPS | 1457.0 TFLOPS | |||||
NVIDIA CUDA™ Technology | 11.1 or later | ||||||
vGPU Software Support | NVIDIA vPC/vApps, NVIDIA RTX Virtual Workstation (vWS) | ||||||
NVENC | NVDEC | 3x | 3x (Includes AV1 Encode & Decode) | 3x l 3x (includes AV1 encode and decode) | |||||
Secure Boot with Root of Trust | Yes | Yes | |||||
NEBS Ready | Yes / Level 3 | Level 3 | |||||
Peak INT4 Performance | 72 TOPS | 144 TOPS Sparsity | ||||||
Peak INT8 Performance | 36 TOPS | 72 TOPS Sparsity | ||||||
ECC Protection | On by Default | ||||||
Transistor Count | 76.3 billion | 76.3 billion | |||||
DisplayPort Connectors | 3x DisplayPort 1.4 A40 is configured for virtualization by default with physical display connectors disabled. The display outputs can be enabled via management software tools. | 4x DP 1.4a | 4x DisplayPort 1.4a | ||||
Cooling | Passive | Passive | Passive | Passive | Passive | ||
Dual Slot | Single-slot | Dual-slot | 2-slot Low-profile | Yes | |||
Dimensions | 6.61” L x 2.71” H | 4.4" (H) x 10.5" (L) | 4.4" (H) x 10.5" (L) | 4.4" (H) x 10.5" (L) | 4.4" H x 10.5" L | 4.4" H x 10.5" L | |
Form Factor | Low-Profile PCIe | PCIe | |||||
Lithography | Samsung 8nm | 4 nm NVIDIA Custom Process | 4 nm NVIDIA Custom Process | ||||
Supplementary Power Connectors | 8-pin CPU | 1x 8-pin CPU (EPS12V) | 1x 16-pin PCIe CEM5 | 1x 16-pin | 1x 16-pin CEM5 PCIe | 1x PCIe CEM5 16-pin | |
Max Graphics Card Power (W) | 40-60 W | Configurable | 250W | 300W | 300W | 350W | 250W | 300W |
Processor | NVIDIA Ada Lovelace | NVIDIA Ada Lovelace | |||||
Memory Bandwidth | 576 GB/s | 960 GB/s | |||||
Peak Single-Precision Performance | 65.3 TFLOPS | ||||||
Peak Single Precision FP32 Performance | 91.1 TFLOPS | ||||||
NVLink Interconnect | Not Supported | ||||||
RT Core Performance | 151.0 TFLOPS | 210.6 TFLOPS | |||||
DisplayPort Output | 4x DP 1.4a | ||||||
Mini DisplayPort Output | 4x DP 1.4a | ||||||
Minimum Recommended Power, Single Card (W) | 600 | ||||||
Minimum Recommended Power, 2-Way (W) | 750 | ||||||
Minimum Recommended Power, 3-Way (W) | 850 | ||||||
Minimum Recommended Power, 4-Way (W) | 1000 | ||||||
Thermal Solution | Blower Active Fan | Blower Active Fan | |||||
Slot Height | 2-Slot | 2-Slot | |||||
Action | Select | Select | Select | Select | Select | Select | Select |