
With generative AI and large language models (LLMs) driving groundbreaking innovations, the computational demands for training and inference are skyrocketing.
These modern-day generative AI applications demand full-stack accelerated compute, starting with state-of-the-art infrastructure that can handle massive workloads with speed and accuracy. To help meet this need, Oracle Cloud Infrastructure today announced general availability of NVIDIA H100 Tensor Core GPUs on OCI Compute, with NVIDIA L40S GPUs coming soon.
NVIDIA H100 Tensor Core GPU Instance on OCI The OCI Compute bare-metal instances with NVIDIA H100 GPUs, powered by the NVIDIA Hopper architecture, enable an order-of-magnitude leap for large-scale AI and high-performance computing, with unprecedented performance, scalability and versatility for every workload.
Organizations using NVIDIA H100 GPUs obtain up to a 30x increase in AI inference performance and a 4x boost in AI training compared with tapping the NVIDIA A100 Tensor Core GPU. The H100 GPU is designed for resource-intensive computing tasks, including training LLMs and inference while running them.
The BM.GPU.H100.8 OCI Compute shape includes eight NVIDIA H100 GPUs, each with 80GB of HBM2 GPU memory. Between the eight GPUs, 3.2TB/s of bisectional bandwidth enables each GPU to communicate directly with all seven other GPUs via NVIDIA NVSwitch and NVLink 4.0 technology. The shape includes 16 local NVMe drives with a capacity of 3.84TB each and also includes 4th Gen Intel Xeon CPU processors with 112 cores, as well as 2TB of system memory.
In a nutshell, this shape is optimized for organizations' most challenging workloads.
Depending on timelines and sizes of workloads, OCI Supercluster allows organizations to scale their NVIDIA H100 GPU usage from a single node to up to tens of thousands of H100 GPUs over a high-performance, ultra-low-latency network.
NVIDIA L40S GPU Instance on OCI The NVIDIA L40S GPU, based on the NVIDIA Ada Lovelace architecture, is a universal GPU for the data center, delivering breakthrough multi-workload acceleration for LLM inference and training, visual computing and video applications. The OCI Compute bare-metal instances with NVIDIA L40S GPUs will be available for early access later this year, with general availability coming early in 2024.
These instances will offer an alternative to the NVIDIA H100 and A100 GPU instances for tackling smaller- to medium-sized AI workloads, as well as for graphics and video compute tasks. The NVIDIA L40S GPU achieves up to a 20% performance boost for generative AI workloads and as much as a 70% improvement in fine-tuning AI models compared with the NVIDIA A100.
The BM.GPU.L40S.4 OCI Compute shape includes four NVIDIA L40S GPUs, along with the latest-generation Intel Xeon CPU with up to 112 cores, 1TB of system memory, 15.36TB of low-latency NVMe local storage for caching data and 400GB/s of cluster network bandwidth. This instance was created to tackle a wide range of use cases, ranging from LLM training, fine-tuning and inference to NVIDIA Omniverse workloads and industrial digitalization, 3D graphics and rendering, video transcoding and FP32 HPC.
NVIDIA and OCI: Enterprise AI This collaboration between OCI and NVIDIA will enable organizations of all sizes to join the generative AI revolution by providing them with state-of-the-art NVIDIA H100 and L40S GPU-accelerated infrastructure.
Access to NVIDIA GPU-accelerated instances may not be enough, however. Unlocking the maximum potential of NVIDIA GPUs on OCI Compute means having an optimal software layer. NVIDIA AI Enterprise streamlines the development and deployment of enterprise-grade accelerated AI software with open-source containers and frameworks optimized for the underlying NVIDIA GPU infrastructure, all with the help of support services.
To learn more, join NVIDIA at Oracle Cloud World in the AI Pavillion, attend this session on the new OCI instances on Wednesday, Sept. 20, and visit these web pages on Oracle Cloud Infrastructure, OCI Compute, how Oracle approaches AI and the NVIDIA AI Platform.
Most recent headlines
05/01/2027
Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...
06/09/2026
June 9 2026, 23:00 (PDT) Dolby and MagentaTV Bring Fans Closer to the FIFA Worl...
04/08/2026
Dalet, a leading technology and service provider for media-rich organizations, t...
04/07/2026
April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...
01/07/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
01/07/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
01/07/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
01/07/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
01/07/2026
Groundbreaking First Nations Screen Business Accelerator launched through nation...
01/07/2026
Chyron Launches the All-New Chyron Academy: A Reimagined, Hands-On Learning Expe...
01/07/2026
Amplium Captures Kawasaki Brave Thunders Game with Blackmagic URSA Cine Immersiv...
01/07/2026
Boris FX Optics Expands Plugin Support to Apple Photos, Capture One, and Affinit...
30/06/2026
Could your journalism reach an international stage?
Entries are now open for the Thomson Foundation's Young Journalist Award 2026, one of the most prestigi...
30/06/2026
As Brazil's only way for fans to see all 104 matches, YouTube channel proves the power of digital...
30/06/2026
Brings together saturation & lo-fi effects
Following on from the release of their Voxcraft vocal-processing plug-in, UJAM have announced the launch of Retro...
30/06/2026
New IR reverb engine, Juno-inspired chorus & more
The latest version of Rapid Flow's hardware-emulation synth plug-in expands on its predecessor with a ...
30/06/2026
Excels at heavy-handed VCA compression
For their latest release, Shy Audio have recreated the crunchy' sound of a rackmount compressor that found its w...
30/06/2026
Component scarcity drives cost increases
Shortly after Apple's CEO Tim Cook acknowledged that cost increases would soon be inevitable , the company hav...
30/06/2026
Statement regarding GetUp Save Our SBS' campaign
30 June, 2026
Media releases
The GetUp Save Our SBS' campaign is an independent initiative. SBS ...
30/06/2026
Hitachi and Bank Pekao S.A. have completed the installation of the first Hitachi...
30/06/2026
eds3_5_jq(document).ready(function($) { $(#eds_sliderM519).chameleonSlider_2_1({ content_source:......
30/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
30/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
30/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
30/06/2026
MainStreaming, the award-winning and innovative Edge Video Delivery Network, today announced that it has been selected by ITV to support the delivery of ITVX, I...
30/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
30/06/2026
When Wheel of Fortune and Jeopardy! needed to upgrade their wireless communications system, they turned to Clear-Com FreeSpeak wireless for their iconic televi...
30/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
30/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
30/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
30/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
30/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
30/06/2026
Other World Computing Launches New Atlas Core Line with 256GB CFExpress 4.0 Type...
30/06/2026
DaVinci Resolve Studio Used for Taketoshi Sado's Perfume Cold Sleep -25 year...
30/06/2026
Life sciences has entered an era of computational scale, and for more than a dec...
30/06/2026
Fernando Cruz and Jaz Wray Join as Regional Sales Managers; Bringing Extensive S...
30/06/2026
As organizations move from AI pilots to production AI factories, infrastructure decisions have shifted from peak chip specifications to cost per token: how many...
30/06/2026
Editor's note: This post is part of Into the Omniverse, a series focused on ...
30/06/2026
Scripps Research scientists demonstrate a faster, cheaper route to making critical drugs using common table sugar New method illustrates how to build a tough ch...
29/06/2026
By Andy Rayner, CTO, Appear
The 2026 FIFA World Cup is the largest football tou...
29/06/2026
A new multi-country study from ESL FACEIT Group, Hero Esports, and Niko Partners estimates that 400 million Gen Z consumers regularly engage with esports, under...
29/06/2026
ESPN will mark America's 250th anniversary with a series of content initiatives across its linear, digital, and streaming platforms, including a special edi...
29/06/2026
The Esports Foundation has named OBSBOT the Official Camera and Webcam Partner for the Esports World Cup 2026, bringing the company's AI-powered imaging tec...
29/06/2026
Insight Productions has launched Insight Storm, a 53-foot mobile broadcast unit designed specifically for esports production, live entertainment, and digital-fi...
29/06/2026
Gravity Media once again provided broadcast, streaming, and content-distribution...
29/06/2026
The All England Lawn Tennis Club and IBM have introduced new and enhanced digita...
29/06/2026
Four-layer instrument aimed at dark electronic music
Excite Audio's latest software instrument has been designed with dark drum and bass, atmospheric te...
29/06/2026
New AI Assistant, Multi-channel Audio, ARA2 improvements & more
Tracktion's DAW software has just received its latest major update, gaining a selection ...
29/06/2026
Details environmental policies & results
The Focusrite Group have just announced that following a long audit process, they have published their 2026 sustain...
29/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...