
An avid cyclist, Thomas Park knows the value of having lots of gears to maintain a smooth, fast ride.
So, when the software architect designed an AI inference platform to serve predictions for Oracle Cloud Infrastructure's (OCI) Vision AI service, he picked NVIDIA Triton Inference Server. That's because it can shift up, down or sideways to handle virtually any AI model, framework and hardware and operating mode - quickly and efficiently.
The NVIDIA AI inference platform gives our worldwide cloud services customers tremendous flexibility in how they build and run their AI applications, said Park, a Zurich-based computer engineer and competitive cycler who's worked for four of the world's largest cloud services providers.
Specifically, Triton reduced OCI's total cost of ownership by 10%, increased prediction throughput up to 76% and reduced inference latency up to 51% for OCI Vision and Document Understanding Service models that were migrated to Triton. The services run globally across more than 45 regional data centers, according to an Oracle blog Park and a colleague posted earlier this year.
Computer Vision Accelerates Insights Customers rely on OCI Vision AI for a wide variety of object detection and image classification jobs. For instance, a U.S.-based transit agency uses it to automatically detect the number of vehicle axles passing by to calculate and bill bridge tolls, sparing busy truckers wait time at toll booths.
OCI AI is also available in Oracle NetSuite, a set of business applications used by more than 37,000 organizations worldwide. It's used, for example, to automate invoice recognition.
Thanks to Park's work, Triton is now being adopted across other OCI services, too.
A Triton-Aware Data Service Our AI platform is Triton-aware for the benefit of our customers , said Tzvi Keisar, a director of product management for OCI's Data Science service, which handles machine learning for Oracle's internal and external users.
If customers want to use Triton, they don't have to worry about the configuration because it will be done automatically by the service, launching a Triton-powered inference endpoint for them, said Keisar.
Triton is included in NVIDIA AI Enterprise, a platform that provides full security and support businesses need - and it's available on OCI Marketplace.
A Massive SaaS Platform OCI's Data Science service is the machine learning platform for both Oracle NetSuite and Oracle Fusion Applications.
These business application suites are massive, with tens of thousands of customers who are also building their frameworks on top of our service, he said.
It's a wide swath of mainly enterprise users in manufacturing, retail, transportation and other industries. They're building and using AI models of nearly every shape and size.
Inference was one of the group's first services, and Triton came on the team's radar not long after its launch.
A Best-in-Class Inference Framework We saw Triton pick up in popularity as a best-in-class serving framework, so we started experimenting with it, Keisar said. We saw really good performance, and it closed a gap in our existing offerings, especially on multi-model inference - it's the most versatile and advanced inferencing framework out there.
Launched on OCI in March, Triton has already attracted the attention of many internal teams at Oracle hoping to use it for inference jobs that require serving predictions from multiple AI models running concurrently.
Triton has a very good track record and performance on multiple models deployed on a single endpoint, he said.
Accelerating the Future Looking ahead, Keisar's team is evaluating NVIDIA TensorRT-LLM software to supercharge inference on the complex large language models (LLMs) that have captured the imagination of many users.
An active blogger, Keisar's latest article detailed quantization techniques for running a Llama 2 LLM with a whopping 70 billion parameters on NVIDIA A10 Tensor Core GPUs.
Even down to four-bit parameters, the quality of model outputs is still quite good, he said. Deploying on NVIDIA GPUs gives us the flexibility to find a good balance in latency, throughput and cost.
After announcements this fall that Oracle is deploying the latest NVIDIA H100 Tensor Core GPUs, H200 GPUs, L40S GPUs and Grace Hopper Superchips, it's just the start of many accelerated efforts to come.
Most recent headlines
09/11/2025
Dalet today announced a transformative leap forward for media operations: Agentic Artificial Intelligence (AI) that unifies the Dalet ecosystem under one natura...
18/10/2025
New England Sports Network (NESN) has chosen Harmonic, working with Astound Business Solutions, as its enterprise technology partner to transform primary distri...
18/10/2025
NEW ORLEANS, La. In the run-up to the start of the NBA season, WVUE-TV and Gray Local Media have announced a deal with DirecTV that will greatly expand access t...
18/10/2025
Berklee Celebrates 40 Years of the Fall Together Concert Faculty composers Bob Pilkington and Greg Hopkins are among the featured artists for this year's ...
17/10/2025
NEP Group Receives New Equity Investment From 26North Partners LP, Co-InvestorsCarlyle remains the largest shareholder as the company prepares for the futureBy ...
17/10/2025
Apple Lands Five-Year Deal for F1 Distribution in the U.S.Besides airing on Apple TV, the sport will be amplified on other Apple servicesBy Ken Kerschbaumer, Ed...
17/10/2025
SVG Sit-Down: Marshall Electronics' Bernie Keach on the Future of PTZ Camera...
17/10/2025
L2 Productions' REMI Facility in Austin Can Produce Content From AnywhereMusic festivals, sports events are produced via flypacks and remote control roomsBy...
17/10/2025
By Lucy Spicer
One of the most exciting things about the Sundance Film Festival...
17/10/2025
(L-R) Christopher Meyer, Addison Timlin, Cooper Raiff, Lili Reinhart, Alyah Chan...
17/10/2025
M sica e arte se uniram em uma noite especial na semana passada na ZIV Gallery, ...
17/10/2025
Music and art came together for one special night last week at ZIV Gallery, an i...
17/10/2025
Spotify and FC Barcelona are extending our partnership through 2030, continuing a collaboration that's redefining how fans, players, and artists connect. Th...
17/10/2025
MURRIETA, Calif. The Sports Fishing Championship (SFC) has deployed DigitalGlue's creative.space storage platform to streamline video production by centrali...
17/10/2025
BELLEVUE, Wash. Football continued to cement its reputation as a bulwark of TV advertising in Q3 2025 with new data from iSpot that showed both the NFL and coll...
17/10/2025
The Sports Fishing Championship (SFC), the premier competitive saltwater fishing series, has transformed its production workflow by adopting creative.space, the...
17/10/2025
QuickLink, a leading provider of award-winning multi-camera video productions and remote contribution solutions, announces the release of StudioPro Version 4, ...
17/10/2025
Although the annual Grammy Awards celebration is best known for recognizing achievements in the recording industry, the show often proves a visual spectacle as ...
17/10/2025
OpenDrives, Inc., a leading provider of software-defined data storage and data services, has promoted Alex Dunfey to Chief Technology Officer (CTO) from his for...
17/10/2025
The University of Arizona (UofA) has significantly upgraded its broadcast communication infrastructure with the integration of Riedel Communications' advanc...
17/10/2025
Harmonic (NASDAQ: HLIT) today announced that New England Sports Network (NESN), owned by Fenway Sports Group and Delaware North, has selected Harmonic as its en...
17/10/2025
Austin PBS has recently upgraded its facility-wide communications infrastructure, deploying Clear-Com 's Eclipse HX, FreeSpeak II beltpacks, and V-Series ...
17/10/2025
ZEISS announces an open call for the closed BETA testing phase of CinCraft Virtual Lens Technology, the innovative digital tool that brings authentic lens chara...
17/10/2025
Situated in the town of Kokkola, Centria University of Applied Sciences offers higher education across five core fields: engineering, business, social and healt...
17/10/2025
Public information channel in Georgia, USA, to implement a powerful, simple, and cost-effective playout automation platform.
Pebble, the leading automation, co...
17/10/2025
HBO Max is reporting that it has launched in 15 new markets, including Bangladesh, Cambodia, Macau, Pakistan, Sri Lanka and Ukraine, boosting the streaming serv...
17/10/2025
Netflix said it will make a major push into video podcasts, inking a wide-ranging deal with Spotify through which it will offer 16 podcasts in the U.S. starting...
17/10/2025
Lexington, Ky. As part of a push to highlight its advanced advertising capabilities, Viamedia has launched a new AI-powered ad tech platform and officially rebr...
17/10/2025
NEW YORK QuickLink has announced the release of StudioPro Version 4, which the company is calling the most significant upgrade yet to its flagship video product...
17/10/2025
NEW YORK and CUPERTINO, Calif. Apple and NBCUniversal said they will sell Apple TV and Peacock streaming bundles to U.S. subscribers starting Oct. 20....
17/10/2025
Q&A with Boston Conservatory Choral Conductor Stephen Spinelli How his research into the lost manuscripts of composer Florence Price led to a Grammy-winning c...
17/10/2025
Back to All News
Netflix ISP Speed Index for September 2025
Product
17 October 2025
Global
Link copied to clipboard
This month, 1% of Internet Service Pro...
17/10/2025
NVIDIA's on the ground at Open Source AI Week. Stay tuned for a celebration ...
17/10/2025
AI has ignited a new industrial revolution.
NVIDIA and TSMC are working togethe...
17/10/2025
Gexcon is a trusted safety and risk management partner for complex, high hazard environments. ICG has been a dedicated marketing partner to Gexcon since 2018, b...
17/10/2025
Here is your host, Patrick Kielty!
After an incredible breakthrough year, Kingf...
16/10/2025
SVG Sit-Down: FUJIFILM Execs on GFX ETERNA 55 Camera, Importance of Shallow-Dept...
16/10/2025
Squash's Most Ambitious Broadcast Production To Be Deployed at Comcast Busin...
16/10/2025
Main Street Sports Group Inks Deal With Omaha Productions, Launches Original-Con...
16/10/2025
A Historic Precursor? FIFA, HBS, DAZN Offer an Inside Look at Production of FIFA...
16/10/2025
Prime Video Offers Sneak Peak at New NBA on Prime StudioThe massive 13,000-sq-ft, two-story studio features a LED regulation half court and hoopBy Jason Dachman...
16/10/2025
SVG Remote Production Forum Draws Record Crowd for Visit to PGA TOUR Studios, De...
16/10/2025
BitFire's Ben Grafchik on How Growing Cloud Workflows Are Impacting the Live...
16/10/2025
AI technology is advancing quickly, bringing both new creative possibilities and...
16/10/2025
In 2017, Imani Ellis launched CultureCon, a conference that's become a must-attend event for more than 10,000 diverse creatives and Black professionals to c...
16/10/2025
It might still be a little early to break out the tinsel and mistletoe, but Spotify's already queuing up some holiday magic. This year's Spotify Singles...
16/10/2025
Earlier this year, our in-house publishing imprint, Spotify Audiobooks, put out ...
16/10/2025
VAMPIRE has been integrated onto GM Defenses Infantry Squad Vehicle (ISV), providing a mobile solution to effectively and affordably counter small drone threat...
16/10/2025
The AgilePod mounted on the host aircraft....
16/10/2025
60% say infotainment systems are a critical purchasing or leasing consideration,...