Significant price hikes on 5090, L40S and Enerperise Blackwell Series GPUs continues into Q1 2026. Please note Credit Card payments will only work if USD or AED currency is selected on top right corner of the website. For US customers; before placing an order for any crypto miners, inquire with a live chat sales rep or toll-free phone agent about any potential tariffs. HGX B200 lead times are now between 8-20 weeks for Golden Sku selections, with custom BOMs exceed 26 weeks. HGX H200 offerings in stock, as well as limited HGX B300. We are now certified partners of Supermicro in both NA and MENA regions.
Ships in 5 days from payment. All sales final. No returns or cancellations. For bulk inquiries, consult a live chat agent or call our toll-free number.
Do you require higher performance for artificial intelligence (AI) training and inference, high-performance computing (HPC) or graphics? NVIDIA®
Accelerators for HPE help solve the world’s most important scientific, industrial, and business challenges with AI and HPC. Visualize complex
content to create cutting-edge products, tell immersive stories, and reimagine cities of the future. Extract new insights from massive datasets.
Hewlett Packard Enterprise servers with NVIDIA accelerators are designed for the age of elastic computing, providing unmatched acceleration at every scale.
The NVIDIA A16 64GB Gen4 PCIe Passive GPU offers the following features:
| Feature | Specification |
|---|---|
| GPU Architecture | NVIDIA Ampere |
| NVIDIA Third-Generation Tensor Cores | 160 total Tensor Cores (40 cores per GPU, 4 GPUs) |
| NVIDIA CUDA Cores (shading units) | 5120 total FP32 CUDA Cores (1280 cores per GPU, 4 GPUs) |
| NVIDIA RT Cores | 40 total RT Cores (10 cores per GPU, 4 GPUs) |
| Double-Precision Performance (FP64) | Not applicable |
| Single-Precision Performance | FP32: 4x 4.5 TFLOPS<br>Tensor Float 32 (TF32): 4x 9 TFLOPS, 4x 18 TFLOPS* |
| Half-Precision Performance | FP16: 4x 17.9 TFLOPS, 4x 35.9 TFLOPS* |
| Bfloat16 | Not applicable |
| Integer Performance | INT8: 4x 35.9 TOPS, 4x 71.8 TOPS* |
| GPU Memory | 64GB GDDR6 (16 GB per GPU, 4 CPUs) |
| Memory Bandwidth | 4x 200 GB/s |
| ECC | Yes |
| Interconnect Bandwidth | Not applicable |
| System Interface | PCIe Gen 4, x16 lanes |
| Form Factor | PCIe full height/length, double width (dual slot) |
| Multi-Instance GPU (MIG) | No support |
| Max Power Consumption | 250 W |
| Thermal Solution | Passive |
| Graphics APIs | DirectX 12.07, Shader Model 5.17, OpenGL 4.68, Vulkan 1.18 |
| Compute APIs | CUDA, DirectCompute, OpenCL, OpenACC |
No reviews available.