14.4 C
Canberra
Tuesday, July 22, 2025

New Amazon EC2 P6e-GB200 UltraServers accelerated by NVIDIA Grace Blackwell GPUs for the best AI efficiency


Voiced by Polly

As we speak, we’re asserting the overall availability of Amazon Elastic Compute Cloud (Amazon EC2) P6e-GB200 UltraServers, accelerated by NVIDIA GB200 NVL72 to supply the best GPU efficiency for AI coaching and inference. Amazon EC2 UltraServers join a number of EC2 cases utilizing a devoted, high-bandwidth, and low-latency accelerator interconnect throughout these cases.

The NVIDIA Grace Blackwell Superchips join two high-performance NVIDIA Blackwell tensor core GPUs and an NVIDIA Grace CPU based mostly on Arm structure utilizing the NVIDIA NVLink-C2C interconnect. Every Grace Blackwell Superchip delivers 10 petaflops of FP8 compute (with out sparsity) and as much as 372 GB HBM3e reminiscence. With the superchip structure, GPU and CPU are colocated inside one compute module, growing bandwidth between GPU and CPU considerably in comparison with present era EC2 P5en cases.

With EC2 P6e-GB200 UltraServers, you possibly can entry as much as 72 NVIDIA Blackwell GPUs inside one NVLink area to make use of 360 petaflops of FP8 compute (with out sparsity) and 13.4 TB of complete excessive bandwidth reminiscence (HBM3e). Powered by the AWS Nitro System, P6e-GB200 UltraServers are deployed in EC2 UltraClusters to securely and reliably scale to tens of 1000’s of GPUs.

EC2 P6e-GB200 UltraServers ship as much as 28.8 Tbps of complete Elastic Cloth Adapter (EFAv4) networking. EFA can also be coupled with NVIDIA GPUDirect RDMA to allow low-latency GPU-to-GPU communication between servers with working system bypass.

EC2 P6e-GB200 UltraServers specs
EC2 P6e-GB200 UltraServers can be found in sizes starting from 36 to 72 GPUs beneath NVLink. Listed here are the specs for EC2 P6e-GB200 UltraServers:

UltraServer kind GPUs
GPU
reminiscence (GB)
vCPUs Occasion reminiscence
(GiB)
Occasion storage (TB) Combination EFA Community Bandwidth (Gbps) EBS bandwidth (Gbps)
u-p6e-gb200x36 36 6660 1296 8640 202.5 14400 540
u-p6e-gb200x72 72 13320 2592 17280 405 28800 1080

P6e-GB200 UltraServers are perfect for probably the most compute and reminiscence intensive AI workloads, resembling coaching and inference of frontier fashions, together with combination of consultants fashions and reasoning fashions, on the trillion-parameter scale.

You’ll be able to construct agentic and generative AI functions, together with query answering, code era, video and picture era, speech recognition, and extra.

P6e-GB200 UltraServers in motion
You should use EC2 P6e-GB200 UltraServers within the Dallas Native Zone by EC2 Capability Blocks for ML. The Dallas Native Zone (us-east-1-dfw-2a) is an extension of the US East (N. Virginia) Area.

To order your EC2 Capability Blocks, select Capability Reservations on the Amazon EC2 console. You’ll be able to choose Buy Capability Blocks for ML after which select your complete capability and specify how lengthy you want the EC2 Capability Block for u-p6e-gb200x36 or u-p6e-gb200x72 UltraServers.

As soon as Capability Block is efficiently scheduled, it’s charged up entrance and its value doesn’t change after buy. The fee can be billed to your account inside 12 hours after you buy the EC2 Capability Blocks. To be taught extra, go to Capability Blocks for ML within the Amazon EC2 Consumer Information.

To run cases inside your bought Capability Block, you should use AWS Administration Console, AWS Command Line Interface (AWS CLI) or AWS SDKs. On the software program aspect, you can begin with the AWS Deep Studying AMIs. These photographs are preconfigured with the frameworks and instruments that you simply most likely already know and use: PyTorch, JAX, and much more.

You too can combine EC2 P6e-GB200 UltraServers seamlessly with numerous AWS managed companies. For instance:

  • Amazon SageMaker Hyperpod offers managed, resilient infrastructure that routinely handles the provisioning and administration of P6e-GB200 UltraServers, changing defective cases with preconfigured spare capability throughout the similar NVLink area to keep up efficiency.
  • Amazon Elastic Kubernetes Providers (Amazon EKS) permits one managed node group to span throughout a number of P6e-GB200 UltraServers as nodes, automating their provisioning and lifecycle administration inside Kubernetes clusters. You should use EKS topology-aware routing for P6e-GB200 UltraServers, enabling optimum placement of tightly coupled elements of distributed workloads inside a single UltraServer’s NVLink-connected cases.
  • Amazon FSx for Lustre file techniques present information entry for P6e-GB200 UltraServers on the lots of of GB/s of throughput and thousands and thousands of enter/output operations per second (IOPS) required for large-scale HPC and AI workloads. For quick entry to giant datasets, you should use as much as 405 TB of native NVMe SSD storage or nearly limitless cost-effective storage with Amazon Easy Storage Service (Amazon S3).

Now obtainable
Amazon EC2 P6e-GB200 UltraServers can be found immediately within the Dallas Native Zone (us-east-1-dfw-2a) by EC2 Capability Blocks for ML. For extra info, go to the Amazon EC2 pricing web page.

Give Amazon EC2 P6e-GB200 UltraServers a strive within the Amazon EC2 console. To be taught extra, go to the Amazon EC2 P6e cases web page and ship suggestions to AWS re:Submit for EC2 or by your ordinary AWS Help contacts.

Channy



Related Articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

[td_block_social_counter facebook="tagdiv" twitter="tagdivofficial" youtube="tagdiv" style="style8 td-social-boxed td-social-font-icons" tdc_css="eyJhbGwiOnsibWFyZ2luLWJvdHRvbSI6IjM4IiwiZGlzcGxheSI6IiJ9LCJwb3J0cmFpdCI6eyJtYXJnaW4tYm90dG9tIjoiMzAiLCJkaXNwbGF5IjoiIn0sInBvcnRyYWl0X21heF93aWR0aCI6MTAxOCwicG9ydHJhaXRfbWluX3dpZHRoIjo3Njh9" custom_title="Stay Connected" block_template_id="td_block_template_8" f_header_font_family="712" f_header_font_transform="uppercase" f_header_font_weight="500" f_header_font_size="17" border_color="#dd3333"]
- Advertisement -spot_img

Latest Articles