Sony Pixel Power calrec Sony

NVIDIA and AWS Collaborate to Bring AI to Production at Scale

23/06/2026

Building AI systems at scale is demanding, requiring low-latency inference, fast vector search, strong GPU price-performance and infrastructure that can grow without multiplying operational complexity.

NVIDIA's latest work with Amazon Web Services (AWS) addresses each of those constraints. Across Amazon OpenSearch and Amazon EC2, NVIDIA AI infrastructure is giving enterprises more practical paths to deploy AI at production scale.

EC2 G7 instances powered by NVIDIA RTX PRO 4500 Blackwell Server Edition GPUs expand the compute layer for AI, graphics, video and data analytics workloads, while the NVIDIA cuVS library accelerates the retrieval layer by making GPU-powered vector indexing the default in OpenSearch Serverless. And with AWS achieving NVIDIA Exemplar Cloud status for NVIDIA GB300, customers can trust they're receiving peak optimized performance for their training workloads.

NVIDIA RTX PRO 4500 Blackwell Server Edition Multi-Workload GPUs Power New Amazon EC2 G7 Instances Amazon EC2 G7 instances bring NVIDIA RTX PRO 4500 Blackwell Server Edition GPUs to AWS for AI inference, graphics, spatial computing and GPU-accelerated data analytics - delivering a new instance type engineered for production workloads that need performance without the operational overhead of a customer-managed GPU platform.

Compared with G6 instances, G7 delivers up to 4.6x AI inference performance, up to 2.1x graphics performance and significantly faster GPU-accelerated data analytics on Amazon EMR using the NVIDIA cuDF library for Apache Spark workloads.

With support for up to eight GPUs, 256GB of total GPU memory, 700 Gbps of EFA-enabled networking and up to 7.6TB of local NVMe SSD storage - across one-, two-, four- and eight- GPU configurations plus bare metal, coming soon - G7 instances let customers right-size infrastructure for their workloads instead of over-provisioning for them.

The platform's versatility means AI teams get lower-latency inference. Media and entertainment teams get high-resolution video workflows and rendering. Simulation, computer-aided design, virtual desktop infrastructure, gaming and spatial computing teams get the same instance type for graphics-intensive applications. And data teams can apply the GPU memory, local storage and networking improvements to analytics pipelines and vector database workloads.

G7 instances are accessible through AWS Deep Learning Amazon Machine Images (AMIs), Amazon Deep Learning Containers, Amazon EMR, Amazon EKS, Amazon ECS and graphics AMIs - and coming soon to Amazon SageMaker AI.

NVIDIA cuVS Makes GPU-Accelerated Vector Search the Default in Amazon OpenSearch The next generation of Amazon OpenSearch Serverless powers agentic AI and dynamic workloads with no infrastructure management required. It uses GPU-accelerated vector indexing, powered by NVIDIA cuVS, as the default compute choice for all vector collections.

For teams building retrieval-augmented generation, semantic search, recommendation systems and agentic AI applications, that shift matters. It turns GPU-powered vector search from a specialized optimization project into a standard AWS capability.

The customer impact is direct: vector indexing up to 10x faster at a quarter of the cost, compared with CPU-only builds - making billion-scale vector databases practical to build in under an hour.

By making NVIDIA cuVS the default in OpenSearch Serverless, AWS customers get a much faster path from raw data to production-ready AI retrieval infrastructure - with serverless scaling that reduces operational overhead when workloads are idle.

AWS Achieves NVIDIA Exemplar Cloud Status for GB300 Training Performance AWS has achieved NVIDIA Exemplar Cloud status on NVIDIA GB300 for training workloads. This means AWS meets the rigorous performance thresholds that NVIDIA uses to benchmark AI workloads against its reference architecture.

This achievement is the result of deep co-engineering efforts between AWS and NVIDIA teams. Through the NVIDIA Exemplar Clouds initiative, developers and AI leaders can be confident they're using consistent, high-performance cloud infrastructure for large-scale training, helping teams evaluate cloud providers with greater confidence, improve total cost of ownership and move AI projects from planning to production more efficiently.

Together, these advancements reinforce every layer of the AI infrastructure stack on AWS. The throughline is the same: production-grade AI infrastructure that performs at scale, without adding operational burden to the teams running it.

Learn more in this AWS blog.

Watch NVIDIA CEO Jensen Huangs GTC Taipei Keynote Replay Watch Here

Recent News

AI

How Businesses Are Building Specialized AI They Can Trust June 23, 2026

AI Infrastructure

NVIDIA Powers Over 400 of the World's 500 Fastest Supercomputers June 23, 2026

AI

NVIDIA Brings Trusted, 24/7 AI Agents to Telecom Operations June 22, 2026

AI Infrastructure

At ISC, JUPITER Shows What Exascale Science Looks Like June 22, 2026

View All Recent News

Categories:

AI Infrastructure

Cloud

Tags:

Agentic AI

NVIDIA Blackwell
LINK: https://blogs.nvidia.com/blog/nvidia-aws-ai-production-scale/...
See more stories from nvidia

North America Stories

24/06/2026

Gray Media Launches Political 360 Digital Advertising Solution

Share Copy link Facebook X Linkedin Bluesky Email...

24/06/2026

Walmart to Pay $1.4 Billion to Acquire Ad Tech Firm Vibe.co

Share Copy link Facebook X Linkedin Bluesky Email...

24/06/2026

FCC Flooded with Nearly 28K Comments on 'The View'

Share Copy link Facebook X Linkedin Bluesky Email...

24/06/2026

First Rush Brings SDI Multicam ProRes Recording to Apple Silicon Macs

First Rush Brings SDI Multicam ProRes Recording to Apple Silicon Macs Brie Clayton June 23, 2026 0 Comments First Rush is a native macOS application d...

24/06/2026

Vertical Drama Beneath Crimson Sails Created with Blackmagic Design

Vertical Drama Beneath Crimson Sails Created with Blackmagic Design Brie Clayton June 23, 2026 0 Comments Thunder Child Productions relies on cameras&...

23/06/2026

Case Study: YES Networks IP Transition Expands Production Possibilities and Redefines Workflows

When we began planning our transition from an SDI-based infrastructure to a new ...

23/06/2026

Imagine Communications Appoints Greg Garmon as SVP, Americas Video Sales

Imagine Communications has announced the appointment of Greg Garmon as Senior Vice President, Americas Video Sales. Garmon will oversee account growth and busin...

23/06/2026

Snap Promotes Emma Wakely to Head of Sports and Media Partnerships, Americas

Snap has promoted Emma Wakely to Head of Sports and Media Partnerships, Americas, succeeding Anmol Malhotra, who has been elevated to Global Head of Content and...

23/06/2026

YES Network and Gotham Sports App to Air MI New York Major League Cricket Matches

YES Network and The Gotham Sports App will air MI New York's Major League Cr...

23/06/2026

HAND Issues Persistent Digital IDs to 2026 NBA Draft Class

The Universal Talent Identifier (HAND) has issued HAND IDs to 34 top projected prospects in the 2026 NBA Draft class, including AJ Dybantsa, Cameron Boozer, and...

23/06/2026

World Boxing Launches World Boxing TV Streaming Platform

World Boxing has announced the launch of World Boxing TV, a subscription-based streaming platform built on the Joymo platform, offering live events, on-demand c...

23/06/2026

FloRacing to Stream 32 Off-Road Motorcycle Racing Events Including AMA Amateur National Motocross Championship

FloSports will stream 32 off-road motorcycle racing events on FloRacing, includi...

23/06/2026

SES Adds 14 Regional Channels and New Set-Top Boxes to ASTRA TV in Spain

SES has announced the expansion of its ASTRA TV platform in Spain with the addition of 14 regional channels in HD and UHD quality and the launch of new hybrid s...

23/06/2026

Appear Supports Rede Legislativas Contribution Workflow for Brazils TV 3.0 Trials

Appear ASA has announced its role in Rede Legislativa de R dio e TV's contri...

23/06/2026

PBS Selects LTN for Nationwide IP Video Network Across 330 Member Stations

LTN has announced that PBS has selected it as its IP video partner to modernize content distribution and contribution across more than 330 public television sta...

23/06/2026

Ease Live Powers Interactive Experience on Rally.TV for WRC

Ease Live has announced that its graphics overlay platform is powering an interactive fan experience on Rally.TV, the official streaming platform of the FIA Wor...

23/06/2026

Chyron LIVE Adds Haivision StreamHub Integration, SCTE-35 Ad Insertion, and Switcher Updates

Chyron has announced updates to Chyron LIVE, its cloud-native live production pl...

23/06/2026

ESPN Announces ESPN Fan House, Fan Engagement Hub Powered by Flowcode

ESPN has announced ESPN Fan House, a fan engagement hub powered by Flowcode, launching in August ahead of the 2026 college football season. Publicis Sports will...

23/06/2026

Sennheiser Relocates Americas Regional Hub to Nashville

The city's solid position in broadcast, entertainment, and sports attracted the major microphone manufacturer Sennheiser Group is moving its Americas Regio...

23/06/2026

IAB Tech Lab Releases SupplyChain v1.1

Share Copy link Facebook X Linkedin Bluesky Email...

23/06/2026

Besco to Represent PlayBox Neo in South Korea

PlayBox Neo appoints Besco as Channel Reseller to establish a firm foothold in Asia Pacific's thriving high-tech export-driven economic boom PlayBox Neo, t...

23/06/2026

PBS Selects LTN to Power Nationwide IP Video Network

Share Copy link Facebook X Linkedin Bluesky Email...

23/06/2026

PBS selects LTN for nationwide IP video network

LTN, a global leader in IP-based video transport and network services, today announced that PBS has selected LTN as its IP video partner to modernize and future...

23/06/2026

The LiveU Q Era Arrives in ANZ with the LU900Q at ABE2026

LiveU will introduce its Q Era to Australia and New Zealand for the first time at ABE2026 on Stand No. 25, (July 30 31). Leading the showcase is the LU900Q, a n...

23/06/2026

Miri Technologies Ships V410 Live 4K Video Encoder-Decode...

Miri Technologies Inc. has begun shipping its highly anticipated V410 live 4K video encoder/decoder for streaming, IP-based production workflows and AV-over-IP ...

23/06/2026

DHD SX2 and TX2 Consoles Go On-Air at Radio Tzafon

DHD audio reports the completion of an upgrade to the audio production facilities at the Galilee headquarters of Radio Tzafon. The station broadcasts two progra...

23/06/2026

Nagravision Launches Nagra Venturi Security Offering

Share Copy link Facebook X Linkedin Bluesky Email...

23/06/2026

ITN Expands Programmatic Local TV Platform

Share Copy link Facebook X Linkedin Bluesky Email...

23/06/2026

Warner Bros. Discovery Taps AWS for New AI-Powered Ad Tech

Share Copy link Facebook X Linkedin Bluesky Email...

23/06/2026

Study: Younger Viewers More Distracted But More Receptive to Ads

Share Copy link Facebook X Linkedin Bluesky Email...

23/06/2026

Chilevisin, ClaroVTR Tap Pixop for 4K FIFA World Cup Feed

Share Copy link Facebook X Linkedin Bluesky Email...

23/06/2026

Imagine Communications Names Greg Garmon as Senior Vice P...

Multifaceted Growth Executive Brings 20+ Years of Experience Leading Organizations Across Tech and M&E Imagine Communications today announced the appointment ...

23/06/2026

Visual Productions Unveils RdmRelay2 Four-channel Relay Control at InfoComm 2026

Visual Productions Unveils RdmRelay2 Four-channel Relay Control at InfoComm 2026 Brie Clayton June 22, 2026 0 Comments New Relay Solution Combines DMX, ...

23/06/2026

SMPTE Makes Its Standards Freely Accessible, Opening Standards Library to the Global Media Technology Community

SMPTE Makes Its Standards Freely Accessible, Opening Standards Library to the Gl...

23/06/2026

NVIDIA and AWS Collaborate to Bring AI to Production at Scale

Building AI systems at scale is demanding, requiring low-latency inference, fast vector search, strong GPU price-performance and infrastructure that can grow wi...

23/06/2026

NVIDIA Powers Over 400 of the World's 500 Fastest Supercomputers

News Highlights: NVIDIA technology runs 81% of the TOP500 and 90% of the systems new to the list. 26 systems on the TOP500 adopted the NVIDIA Grace CPU, up ei...

23/06/2026

How Businesses Are Building Specialized AI They Can Trust

Companies are asking how to build specialized AI that fits with the way their workflows actually run. The first wave of enterprise AI was about access. Compan...

23/06/2026

June 22, 2026

Newly identified molecule strengthens the eye's response to damage in retinal disease Scripps Research discovery finds that restoring the naturally occurrin...

22/06/2026

Behind the Mic: SportsCenters Lisa Cohn to Retire This June From ESPN as Longest-Tenured Anchor

Behind The Mic provides a roundup of recent news regarding on-air talent, includ...

22/06/2026

Cosm Appoints David Ho as Chief Legal Officer

Cosm has announced the appointment of David Ho as Chief Legal Officer, a newly created executive role reporting to President and CEO Jeb Terry. Ho will oversee ...

22/06/2026

Warner Bros. Discovery and AWS Announce AI-Powered Advertising Technology Platform

Warner Bros. Discovery and Amazon Web Services (AWS) have announced the developm...

22/06/2026

Daktronics Completes Audio Control System Upgrade at Petco Park for San Diego Padres

Daktronics has completed an audio control system upgrade at Petco Park in San Di...

22/06/2026

Accelerate Media Names John Willi President, Launches Accelerate Sports Network

Accelerate Media has named John Willi as President and announced the launch of the Accelerate Sports Network (ASN), a prep sports media and streaming platform c...

22/06/2026

AWSN to Air 3XBA Womens Basketball Tournament Live June 26-27

All Women's Sports Network (AWSN) and 3XBA (3 3 Basketball Association) have announced live television coverage of the annual 3XBA tournament on Friday, Jun...

22/06/2026

OWL AI Appoints Jay Prasad as Chief Executive Officer

OWL AI has announced the appointment of Jay Prasad as Chief Executive Officer and member of the Board of Directors. Prasad succeeds Josh Gwyther, who has served...

22/06/2026

CP Communications Provides RF Support for Inside the NBA at 2026 NBA Finals

CP Communications delivered RF video and audio support for TNT's Inside the NBA at the 2026 NBA Finals, providing main show coverage in San Antonio and ea...

22/06/2026

Polymarket and GRID Partner to Integrate Esports Data and Streaming into Trading Platform

Polymarket has announced a partnership with GRID, an official esports data platf...

22/06/2026

SVG New Sponsor Spotlight: Metinteractive's Rachel Mele, Ken Cyr on Building Technology Backbones for Sports Venues

As sports venues continue to evolve into more video-centric, fan-engagement-driv...

22/06/2026

SVG All-Stars: Corbin Perkins, Chief Engineer, Victory+

As the regional sports production scene shifts toward streaming, this Texan helps lead the engineering behind Victory+'s growing live platform...

22/06/2026

Meet the 2026 Sundance Institute Documentary Edit Intensive Fellows

By Kristin Feeley, Director, Documentary Film & Artist Programs the memories of your elders [are] a scaffolding for you to build your identity on - and t...