
An avid cyclist, Thomas Park knows the value of having lots of gears to maintain a smooth, fast ride.
So, when the software architect designed an AI inference platform to serve predictions for Oracle Cloud Infrastructure's (OCI) Vision AI service, he picked NVIDIA Triton Inference Server. That's because it can shift up, down or sideways to handle virtually any AI model, framework and hardware and operating mode - quickly and efficiently.
The NVIDIA AI inference platform gives our worldwide cloud services customers tremendous flexibility in how they build and run their AI applications, said Park, a Zurich-based computer engineer and competitive cycler who's worked for four of the world's largest cloud services providers.
Specifically, Triton reduced OCI's total cost of ownership by 10%, increased prediction throughput up to 76% and reduced inference latency up to 51% for OCI Vision and Document Understanding Service models that were migrated to Triton. The services run globally across more than 45 regional data centers, according to an Oracle blog Park and a colleague posted earlier this year.
Computer Vision Accelerates Insights Customers rely on OCI Vision AI for a wide variety of object detection and image classification jobs. For instance, a U.S.-based transit agency uses it to automatically detect the number of vehicle axles passing by to calculate and bill bridge tolls, sparing busy truckers wait time at toll booths.
OCI AI is also available in Oracle NetSuite, a set of business applications used by more than 37,000 organizations worldwide. It's used, for example, to automate invoice recognition.
Thanks to Park's work, Triton is now being adopted across other OCI services, too.
A Triton-Aware Data Service Our AI platform is Triton-aware for the benefit of our customers , said Tzvi Keisar, a director of product management for OCI's Data Science service, which handles machine learning for Oracle's internal and external users.
If customers want to use Triton, they don't have to worry about the configuration because it will be done automatically by the service, launching a Triton-powered inference endpoint for them, said Keisar.
Triton is included in NVIDIA AI Enterprise, a platform that provides full security and support businesses need - and it's available on OCI Marketplace.
A Massive SaaS Platform OCI's Data Science service is the machine learning platform for both Oracle NetSuite and Oracle Fusion Applications.
These business application suites are massive, with tens of thousands of customers who are also building their frameworks on top of our service, he said.
It's a wide swath of mainly enterprise users in manufacturing, retail, transportation and other industries. They're building and using AI models of nearly every shape and size.
Inference was one of the group's first services, and Triton came on the team's radar not long after its launch.
A Best-in-Class Inference Framework We saw Triton pick up in popularity as a best-in-class serving framework, so we started experimenting with it, Keisar said. We saw really good performance, and it closed a gap in our existing offerings, especially on multi-model inference - it's the most versatile and advanced inferencing framework out there.
Launched on OCI in March, Triton has already attracted the attention of many internal teams at Oracle hoping to use it for inference jobs that require serving predictions from multiple AI models running concurrently.
Triton has a very good track record and performance on multiple models deployed on a single endpoint, he said.
Accelerating the Future Looking ahead, Keisar's team is evaluating NVIDIA TensorRT-LLM software to supercharge inference on the complex large language models (LLMs) that have captured the imagination of many users.
An active blogger, Keisar's latest article detailed quantization techniques for running a Llama 2 LLM with a whopping 70 billion parameters on NVIDIA A10 Tensor Core GPUs.
Even down to four-bit parameters, the quality of model outputs is still quite good, he said. Deploying on NVIDIA GPUs gives us the flexibility to find a good balance in latency, throughput and cost.
After announcements this fall that Oracle is deploying the latest NVIDIA H100 Tensor Core GPUs, H200 GPUs, L40S GPUs and Grace Hopper Superchips, it's just the start of many accelerated efforts to come.
Most recent headlines
05/01/2027
Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...
01/06/2026
January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026
Throughout the week, Dolby brings to life the latest innovatio...
02/05/2026
Dalet, a leading technology and service provider for media-rich organizations, t...
01/05/2026
January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...
01/04/2026
January 4 2026, 18:00 (PST) DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION
Douyin Users Can Now Create And Share Videos With Stun...
20/03/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
20/03/2026
Colombia's Icesi University and WSDG are proud to announce Ronald David Reyes as the recipient of the 2025 WSDG Excellence Scholarship, awarded to an outsta...
20/03/2026
Avid today celebrated the filmmakers, editors and sound teams that worked with Avid Media Composer and Pro Tools to create the vast majority of this year'...
20/03/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
20/03/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
20/03/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
20/03/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
20/03/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
20/03/2026
Real-time 9:16 AI-Generated Autocropping; Software-Defined Station in a Box; and Software Switcher with Unlimited Layering Are Among Show Highlights
For the fi...
20/03/2026
Signiant Showcases New Content Innovations Driving Visibility, Access, and Actio...
20/03/2026
Caffeine Relies on DaVinci Resolve Studio for End to End Post Workflow
Brie Clayton March 19, 2026
0 Comments
Blackmagic Cloud helps Mexican post faci...
20/03/2026
It's time to play The Money List! Baz Ashmawy is back at the helm as the qui...
19/03/2026
Live sports production increases complexity, with dynamic audio levels and an overall philosophy that encourages transient volume spikes
Fourteen years ago, Am...
19/03/2026
Advanced Systems Group, a technology and services provider for media creatives and content owners, announced the appointment of Peter Thordarson to the newly cr...
19/03/2026
For this senior from the Bay Area, the speed and pressure of live sports production play right into her strengths
In the live-sports-video industry, the future...
19/03/2026
Grass Valley has expanded its long-term partnership with University of Pittsburg...
19/03/2026
Audio-Technica has released the ATV-SG1 and ATV-SG1LE On-Camera Shotgun Microphones, designed for use with DSLR, mirrorless SLR, and other cameras.
The ATV-SG1...
19/03/2026
Harmonic (booth W2831) announces updates to its XOS Advanced Media Processor aim...
19/03/2026
DAZN and Top Rank have announced a multi-year partnership that will bring Top Ra...
19/03/2026
IHSE, a provider of KVM systems, has announced a partnership with Cyviz AS, a provider of technology solutions for collaboration and mission-critical operations...
19/03/2026
Net Insight has appointed Larissa G rner-Meeus as Chief Product Officer. She joins the company's executive management team.
G rner-Meeus holds a Dipl-Ing. ...
19/03/2026
Leader Electronics of Europe has appointed Rob Stanley as Regional Sales Manager for the UK and Northern Europe. In the role, he will manage key accounts and ha...
19/03/2026
FIFA has announced that YouTube will be a Preferred Platform for the FIFA World Cup 2026.
Under the agreement, FIFA's Media Partners will be able to publis...
19/03/2026
New features across mobile, connected devices, and automotive platforms undersco...
19/03/2026
PSSI Global Services has appointed Ben Bradshaw as Director of Product and Netwo...
19/03/2026
Cobalt Digital has announced its NAB 2026 product lineup, which includes additio...
19/03/2026
Sportradar has released a new report, Innovation in Sports Media: The Next Era of Sports Viewing, examining how the sports viewing experience in the U.S. is evo...
19/03/2026
Matrox Video has been awarded a three-year framework agreement to supply its Con...
19/03/2026
CBS Sports' Jason Cohen and TNT Sports' Chris Brown lead the charge on n...
19/03/2026
A1 Dave Grundtvig and his team deploy plenty of mics to capture the sounds and energy from the stands as well the court
March Madness is a tournament in which ...
19/03/2026
In 2021, we launched EQUAL, a program designed to address an industry reality that persists: Women artists, songwriters, and producers too often face fewer oppo...
19/03/2026
Latest EZKeys 2 expansion arrives
Toontrack's staggering collection of EZKeys 2 expansions has grown once again, and the latest instalment delivers a on...
19/03/2026
New generative AI plug-in due in May 2026
Roland have announced the upcoming launch of a new generative AI tool created in collaboration with Sony Computer ...
19/03/2026
Nick Williams updates users on insolvency process
Nick Williams, the CEO of Native Instruments, has released the following official statement regarding thei...
19/03/2026
Iconic Swedish mic manufacturer back in action
Legendary Swedish microphone manufacturer Milab have announced that production is now fully underway, and mic...
19/03/2026
Acclaimed saturation unit goes virtual
Freqport's Freqtube FT1 (reviewed here in SOS February 2023) offers a convenient way to integrate real valve-base...
19/03/2026
The discontinuation of loss-making business activities as part of the restructur...
19/03/2026
Silicon Valley satire The Audacity premieres 15 April on SBS and SBS On Demand
19 March, 2026
Media releases
From one of the writer/producers of Succession...
19/03/2026
SBS brings communities together at Bondi Pavilion for Harmony Week multilingual ...
19/03/2026
Clarification from SBS regarding Western Sydney expansion
19 March, 2026
Media releases
From an SBS spokesperson:
SBS wishes to clarify some media coverag...
19/03/2026
Test & measurement innovator, Leader Electronics of Europe, is pleased to announce the appointment of Rob Stanley as Regional Sales Manager - UK & Northern Euro...
19/03/2026
The recently announced joint venture between Accedo One and Magine Pro has been officially launched as Leyra. The new company will combine the two complementary...
19/03/2026
Budapest, Hungary, March 2026 - Demand for traditional matrix switching remains strong across live events, rental and staging markets. With a reputation for rel...
19/03/2026
DPA Microphones adds to its CORE microphone selection with the 4097 CORE Micro Shotgun, which delivers a new level of clarity, headroom and sonic transparency...
19/03/2026
Starfish Technologies will present the latest releases of its TS Splicer (Win) and TS Splicer (K8) at NAB Show 2026, together with a new Monitoring Dashboard de...