NVIDIA GB200 NVL72

Supercharging Next-Generation AI and Accelerated Computing

The GB200 Grace Blackwell Superchip is a key component of the NVIDIA GB200 NVL72, connecting two high-performance NVIDIA Blackwell Tensor Core GPUs and an NVIDIA Grace CPU using the NVIDIA® NVLink®-C2C interconnect to the two Blackwell GPUs.

30X
vs. NVIDIA H100
Real-Time LLM Inference
4X
vs. NVIDIA H100
Massive-Scale Training
25X
vs. NVIDIA H100
Energy-Efficient Infrastructure
18X
vs. CPU
Data Processing

Key Highlights

Discover the revolutionary capabilities that make GB200 NVL72 the ultimate choice for next-generation AI workloads.

Real-Time LLM Inference

GB200 NVL72 introduces cutting-edge capabilities and a second-generation Transformer Engine which enables FP4 AI and when coupled with fifth-generation NVIDIA NVLink, delivers 30X faster real-time LLM inference performance for trillion-parameter language models.

Massive-Scale Training

GB200 NVL72 includes a faster second-generation Transformer Engine featuring FP8 precision, enabling a remarkable 4X faster training for large language models at scale.

Energy-Efficient Infrastructure

Liquid-cooled GB200 NVL72 racks reduce a data center's carbon footprint and energy consumption. Liquid cooling increases compute density, reduces the amount of floor space used, and facilitates high-bandwidth, low-latency GPU communication.

Data Processing

GB200 takes advantage of the high-bandwidth memory performance, NVLink-C2C, and dedicated decompression engines in the NVIDIA Blackwell architecture to speed up key database queries by 18X compared to CPU.

Technical Specifications

Comprehensive technical details and performance metrics for the NVIDIA GB200 NVL72 system.

Technical DetailGB200 NVL72GB200 Grace Blackwell Superchip
Configuration36 Grace CPU : 72 Blackwell GPUs1 Grace CPU : 2 Blackwell GPU
FP4 Tensor Core²1,440 PFLOPS40 PFLOPS
FP8/FP6 Tensor Core²720 PFLOPS20 PFLOPS
INT8 Tensor Core²720 POPS20 POPS
FP16/BF16 Tensor Core²360 PFLOPS10 PFLOPS
TF32 Tensor Core180 PFLOPS5 PFLOPS
FP326,480 TFLOPS180 TFLOPS
FP643,240 TFLOPS90 TFLOPS
FP64 Tensor Core3,240 TFLOPS90 TFLOPS
GPU Memory | BandwidthUp to 13.5 TB HBM3e | 576 TB/sUp to 384 GB HBM3e | 16 TB/s
NVLink Bandwidth130TB/s3.6TB/s
CPU Core Count2,592 Arm® Neoverse V2 cores72 Arm Neoverse V2 cores
CPU Memory | BandwidthUp to 17 TB LPDDR5X | Up to 18.4 TB/sUp to 480GB LPDDR5X | Up to 512 GB/s

1. Preliminary specifications. May be subject to change. 2. With sparsity.

Ready to Transform Your AI Infrastructure?

Experience the power of next-generation AI computing with NVIDIA GB200 NVL72. Contact our experts to learn how this revolutionary technology can accelerate your AI initiatives.