
Building AI systems at scale is demanding, requiring low-latency inference, fast vector search, strong GPU price-performance and infrastructure that can grow without multiplying operational complexity.
NVIDIA's latest work with Amazon Web Services (AWS) addresses each of those constraints. Across Amazon OpenSearch and Amazon EC2, NVIDIA AI infrastructure is giving enterprises more practical paths to deploy AI at production scale.
EC2 G7 instances powered by NVIDIA RTX PRO 4500 Blackwell Server Edition GPUs expand the compute layer for AI, graphics, video and data analytics workloads, while the NVIDIA cuVS library accelerates the retrieval layer by making GPU-powered vector indexing the default in OpenSearch Serverless. And with AWS achieving NVIDIA Exemplar Cloud status for NVIDIA GB300, customers can trust they're receiving peak optimized performance for their training workloads.
NVIDIA RTX PRO 4500 Blackwell Server Edition Multi-Workload GPUs Power New Amazon EC2 G7 Instances Amazon EC2 G7 instances bring NVIDIA RTX PRO 4500 Blackwell Server Edition GPUs to AWS for AI inference, graphics, spatial computing and GPU-accelerated data analytics - delivering a new instance type engineered for production workloads that need performance without the operational overhead of a customer-managed GPU platform.
Compared with G6 instances, G7 delivers up to 4.6x AI inference performance, up to 2.1x graphics performance and significantly faster GPU-accelerated data analytics on Amazon EMR using the NVIDIA cuDF library for Apache Spark workloads.
With support for up to eight GPUs, 256GB of total GPU memory, 700 Gbps of EFA-enabled networking and up to 7.6TB of local NVMe SSD storage - across one-, two-, four- and eight- GPU configurations plus bare metal, coming soon - G7 instances let customers right-size infrastructure for their workloads instead of over-provisioning for them.
The platform's versatility means AI teams get lower-latency inference. Media and entertainment teams get high-resolution video workflows and rendering. Simulation, computer-aided design, virtual desktop infrastructure, gaming and spatial computing teams get the same instance type for graphics-intensive applications. And data teams can apply the GPU memory, local storage and networking improvements to analytics pipelines and vector database workloads.
G7 instances are accessible through AWS Deep Learning Amazon Machine Images (AMIs), Amazon Deep Learning Containers, Amazon EMR, Amazon EKS, Amazon ECS and graphics AMIs - and coming soon to Amazon SageMaker AI.
NVIDIA cuVS Makes GPU-Accelerated Vector Search the Default in Amazon OpenSearch The next generation of Amazon OpenSearch Serverless powers agentic AI and dynamic workloads with no infrastructure management required. It uses GPU-accelerated vector indexing, powered by NVIDIA cuVS, as the default compute choice for all vector collections.
For teams building retrieval-augmented generation, semantic search, recommendation systems and agentic AI applications, that shift matters. It turns GPU-powered vector search from a specialized optimization project into a standard AWS capability.
The customer impact is direct: vector indexing up to 10x faster at a quarter of the cost, compared with CPU-only builds - making billion-scale vector databases practical to build in under an hour.
By making NVIDIA cuVS the default in OpenSearch Serverless, AWS customers get a much faster path from raw data to production-ready AI retrieval infrastructure - with serverless scaling that reduces operational overhead when workloads are idle.
AWS Achieves NVIDIA Exemplar Cloud Status for GB300 Training Performance AWS has achieved NVIDIA Exemplar Cloud status on NVIDIA GB300 for training workloads. This means AWS meets the rigorous performance thresholds that NVIDIA uses to benchmark AI workloads against its reference architecture.
This achievement is the result of deep co-engineering efforts between AWS and NVIDIA teams. Through the NVIDIA Exemplar Clouds initiative, developers and AI leaders can be confident they're using consistent, high-performance cloud infrastructure for large-scale training, helping teams evaluate cloud providers with greater confidence, improve total cost of ownership and move AI projects from planning to production more efficiently.
Together, these advancements reinforce every layer of the AI infrastructure stack on AWS. The throughline is the same: production-grade AI infrastructure that performs at scale, without adding operational burden to the teams running it.
Learn more in this AWS blog.
Watch NVIDIA CEO Jensen Huangs GTC Taipei Keynote Replay Watch Here
Recent News
AI
How Businesses Are Building Specialized AI They Can Trust June 23, 2026
AI Infrastructure
NVIDIA Powers Over 400 of the World's 500 Fastest Supercomputers June 23, 2026
AI
NVIDIA Brings Trusted, 24/7 AI Agents to Telecom Operations June 22, 2026
AI Infrastructure
At ISC, JUPITER Shows What Exascale Science Looks Like June 22, 2026
View All Recent News
Categories:
AI Infrastructure
Cloud
Tags:
Agentic AI
NVIDIA Blackwell
Most recent headlines
05/01/2027
Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...
06/09/2026
June 9 2026, 23:00 (PDT) Dolby and MagentaTV Bring Fans Closer to the FIFA Worl...
04/08/2026
Dalet, a leading technology and service provider for media-rich organizations, t...
04/07/2026
April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...
24/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
24/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
24/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
24/06/2026
First Rush Brings SDI Multicam ProRes Recording to Apple Silicon Macs
Brie Clayton June 23, 2026
0 Comments
First Rush is a native macOS application d...
24/06/2026
Vertical Drama Beneath Crimson Sails Created with Blackmagic Design
Brie Clayton June 23, 2026
0 Comments
Thunder Child Productions relies on cameras&...
23/06/2026
When we began planning our transition from an SDI-based infrastructure to a new ...
23/06/2026
Imagine Communications has announced the appointment of Greg Garmon as Senior Vice President, Americas Video Sales. Garmon will oversee account growth and busin...
23/06/2026
Snap has promoted Emma Wakely to Head of Sports and Media Partnerships, Americas, succeeding Anmol Malhotra, who has been elevated to Global Head of Content and...
23/06/2026
YES Network and The Gotham Sports App will air MI New York's Major League Cr...
23/06/2026
The Universal Talent Identifier (HAND) has issued HAND IDs to 34 top projected prospects in the 2026 NBA Draft class, including AJ Dybantsa, Cameron Boozer, and...
23/06/2026
World Boxing has announced the launch of World Boxing TV, a subscription-based streaming platform built on the Joymo platform, offering live events, on-demand c...
23/06/2026
FloSports will stream 32 off-road motorcycle racing events on FloRacing, includi...
23/06/2026
SES has announced the expansion of its ASTRA TV platform in Spain with the addition of 14 regional channels in HD and UHD quality and the launch of new hybrid s...
23/06/2026
Appear ASA has announced its role in Rede Legislativa de R dio e TV's contri...
23/06/2026
LTN has announced that PBS has selected it as its IP video partner to modernize content distribution and contribution across more than 330 public television sta...
23/06/2026
Ease Live has announced that its graphics overlay platform is powering an interactive fan experience on Rally.TV, the official streaming platform of the FIA Wor...
23/06/2026
Chyron has announced updates to Chyron LIVE, its cloud-native live production pl...
23/06/2026
ESPN has announced ESPN Fan House, a fan engagement hub powered by Flowcode, launching in August ahead of the 2026 college football season. Publicis Sports will...
23/06/2026
The city's solid position in broadcast, entertainment, and sports attracted the major microphone manufacturer
Sennheiser Group is moving its Americas Regio...
23/06/2026
128 channels of signal routing & DSP
Announced just before the NAMM Show 2026, Violet Audio's latest digital audio matrix offers 128 channels of signal ...
23/06/2026
Latest Current expansion created by EPROM
Minimal Audio have just launched the latest Current Expansion, Memory Rites. Designed in collaboration with renown...
23/06/2026
Popular hardware EQ gets official plug-in emulation
Undertone Audio have just launched a new plug-in that brings one of their most popular hardware designs ...
23/06/2026
December 7, 2022
Colorfront (colorfront.com) - the multi-award-winning develope...
23/06/2026
April 23, 2026
NAB 2026, Las Vegas - the Academy and Emmy Award-winning develop...
23/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
23/06/2026
PlayBox Neo appoints Besco as Channel Reseller to establish a firm foothold in Asia Pacific's thriving high-tech export-driven economic boom
PlayBox Neo, t...
23/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
23/06/2026
LTN, a global leader in IP-based video transport and network services, today announced that PBS has selected LTN as its IP video partner to modernize and future...
23/06/2026
LiveU will introduce its Q Era to Australia and New Zealand for the first time at ABE2026 on Stand No. 25, (July 30 31). Leading the showcase is the LU900Q, a n...
23/06/2026
Miri Technologies Inc. has begun shipping its highly anticipated V410 live 4K video encoder/decoder for streaming, IP-based production workflows and AV-over-IP ...
23/06/2026
DHD audio reports the completion of an upgrade to the audio production facilities at the Galilee headquarters of Radio Tzafon. The station broadcasts two progra...
23/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
23/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
23/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
23/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
23/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
23/06/2026
Multifaceted Growth Executive Brings 20+ Years of Experience Leading Organizations Across Tech and M&E
Imagine Communications today announced the appointment ...
23/06/2026
Australians in Film and Screen Australia's talent development initiative UNT...
23/06/2026
Visual Productions Unveils RdmRelay2 Four-channel Relay Control at InfoComm 2026
Brie Clayton June 22, 2026
0 Comments
New Relay Solution Combines DMX, ...
23/06/2026
SMPTE Makes Its Standards Freely Accessible, Opening Standards Library to the Gl...
23/06/2026
Building AI systems at scale is demanding, requiring low-latency inference, fast vector search, strong GPU price-performance and infrastructure that can grow wi...
23/06/2026
23rd June 2026, London: UKTV and BBC Entertainment have unveiled a joint co-comm...
23/06/2026
Also starring Jonny Lee Miller, Sheldon Shepherd and Bel Powley, the ambitious f...
23/06/2026
The priority now is a clear and credible plan
June 23, 2026, Winchester, UK - Arqiva, the UK's leading communications infrastructure provider, welcomes tod...
23/06/2026
The RT Toy Show Appeal has raised over 31 million since its inception in 2020 ...
23/06/2026
News Highlights:
NVIDIA technology runs 81% of the TOP500 and 90% of the systems new to the list.
26 systems on the TOP500 adopted the NVIDIA Grace CPU, up ei...