
Deploying generative AI in the enterprise is about to get easier than ever.
NVIDIA NIM, a set of generative AI inference microservices, works with KServe, open-source software that automates putting AI models to work at the scale of a cloud computing application.
The combination ensures generative AI can be deployed like any other large enterprise application. It also makes NIM widely available through platforms from dozens of companies, such as Canonical, Nutanix and Red Hat.
The integration of NIM on KServe extends NVIDIA's technologies to the open-source community, ecosystem partners and customers. Through NIM, they can all access the performance, support and security of the NVIDIA AI Enterprise software platform with an API call - the push-button of modern programming.
Serving AI on Kubernetes KServe got its start as part of Kubeflow, a machine learning toolkit based on Kubernetes, the open-source system for deploying and managing software containers that hold all the components of large distributed applications.
As Kubeflow expanded its work on AI inference, what became KServe was born and ultimately evolved into its own open-source project.
Many companies have contributed to and adopted the KServe software that runs today at companies including AWS, Bloomberg, Canonical, Cisco, Hewlett Packard Enterprise, IBM, Red Hat, Zillow and NVIDIA.
Under the Hood With KServe KServe is essentially an extension of Kubernetes that runs AI inference like a powerful cloud application. It uses a standard protocol, runs with optimized performance and supports PyTorch, Scikit-learn, TensorFlow and XGBoost without users needing to know the details of those AI frameworks.
The software is especially useful these days, when new large language models (LLMs) are emerging rapidly.
KServe lets users easily go back and forth from one model to another, testing which one best suits their needs. And when an updated version of a model gets released, a KServe feature called canary rollouts automates the job of carefully validating and gradually deploying it into production.
Another feature, GPU autoscaling, efficiently manages how models are deployed as demand for a service ebbs and flows, so customers and service providers have the best possible experience.
An API Call to Generative AI The goodness of KServe is now available with the ease of NVIDIA NIM.
With NIM, a simple API call takes care of all the complexities. Enterprise IT admins get the metrics they need to ensure their application is running with optimal performance and efficiency, whether it's in their data center or on a remote cloud service - even if they change the AI models they're using.
NIM lets IT professionals become generative AI pros, transforming their company's operations. That's why a host of enterprises such as Foxconn and ServiceNow are deploying NIM microservices.
NIM Rides Dozens of Kubernetes Platforms Thanks to its integration with KServe, users will be able access NIM on dozens of enterprise platforms such as Canonical's Charmed KubeFlow and Charmed Kubernetes, Nutanix GPT-in-a-Box 2.0, Red Hat's OpenShift AI and many others.
Red Hat has been working with NVIDIA to make it easier than ever for enterprises to deploy AI using open source technologies, said KServe contributor Yuan Tang, a principal software engineer at Red Hat. By enhancing KServe and adding support for NIM in Red Hat OpenShift AI, we're able to provide streamlined access to NVIDIA's generative AI platform for Red Hat customers.
Through the integration of NVIDIA NIM inference microservices with Nutanix GPT-in-a-Box 2.0, customers will be able to build scalable, secure, high-performance generative AI applications in a consistent way, from the cloud to the edge, said the vice president of engineering at Nutanix, Debojyoti Dutta, whose team contributes to KServe and Kubeflow.
As a company that also contributes significantly to KServe, we're pleased to offer NIM through Charmed Kubernetes and Charmed Kubeflow, said Andreea Munteanu, MLOps product manager at Canonical. Users will be able to access the full power of generative AI, with the highest performance, efficiency and ease thanks to the combination of our efforts.
Dozens of other software providers can feel the benefits of NIM simply because they include KServe in their offerings.
Serving the Open-Source Community NVIDIA has a long track record on the KServe project. As noted in a recent technical blog, KServe's Open Inference Protocol is used in NVIDIA Triton Inference Server, which helps users run many AI models simultaneously across many GPUs, frameworks and operating modes.
With KServe, NVIDIA focuses on use cases that involve running one AI model at a time across many GPUs.
As part of the NIM integration, NVIDIA plans to be an active contributor to KServe, building on its portfolio of contributions to open-source software that includes Triton and TensorRT-LLM. NVIDIA is also an active member of the Cloud Native Computing Foundation, which supports open-source code for generative AI and other projects.
Try the NIM API on the NVIDIA API Catalog using the Llama 3 8B or Llama 3 70B LLM models today. Hundreds of NVIDIA partners worldwide are using NIM to deploy generative AI.
Watch NVIDIA founder and CEO Jensen Huang's COMPUTEX keynote to get the latest on AI and more.
Most recent headlines
09/11/2025
Dalet today announced a transformative leap forward for media operations: Agentic Artificial Intelligence (AI) that unifies the Dalet ecosystem under one natura...
22/10/2025
Prime Video Inks Deal To Present NFL Black Friday Game Worldwide By SVG Staff
Wednesday, October 22, 2025 - 10:06 am
Print This Story | Subscribe
Story ...
22/10/2025
NBA Tip-Off: ESPN Goes 1080p HDR End-to-End, Flipping HDR Switch on REMI and REM...
22/10/2025
FloSports Empowers Division II, III Athletic Departments With Turnkey Production...
22/10/2025
Wall Street Video Summit Debuts, Bringing Together 200 Financial Enterprise Vide...
22/10/2025
Dueling Pianos: International Chopin Piano Competition Is as Competitive as a Ba...
22/10/2025
In 1995, a young Colombian artist released an album that would change Latin pop ...
22/10/2025
Over the past few months, a photovoltaic system has been installed on a three-he...
22/10/2025
The Orion spacecraft for NASA's Artemis II mission is stacked on the Space Launch System (SLS) rocket in High Bay 3 of the Vehicle Assembly Building at Kenn...
22/10/2025
L3Harris' Hybrid SATCOM is resilient by design, offering path diversity that eliminates vulnerabilities by routing data across the best available networks i...
22/10/2025
WASHINGTON, D.C. Organizers of NAB Show New York said they are expecting more than 12,000 registered attendees from about 100 countries along with 260 exhibitor...
22/10/2025
WASHINGTON, D.C The organizers of The 2025 NAB Show New York have announced that they are expecting more than 12,000 registered attendees from about 100 countr...
22/10/2025
Masque Sound, a leading theatrical sound reinforcement, installation and design company, supplied an extensive gear package of professional-grade equipment for ...
22/10/2025
Lightware, a global leader in signal management and AV connectivity solutions, is seeing strong market momentum for the UCX-3x3-TPX-RX20, a compact transmitter-...
22/10/2025
MELVILLE, N.Y. Chyron has released PAINT 10.2, the latest update for its telestration platform, adding support for SMPTE ST 2110 IP workflows, expanding brandin...
22/10/2025
WASHINGTON Run3TV today said NBCUniversal is joining as an investor in the ATSC 3.0 Framework Authority, which develops the Run3TV NextGen TV application platfo...
22/10/2025
ATLANTA swXtch.io will feature two new networking solutions extending the company's reach across more cloud and on-prem workflows at NAB Show New York, set ...
22/10/2025
The Warner Bros. Discoverys HBO Max streaming services has increased prices for all its streaming tiers effectively immediately for new customers. Existing cust...
22/10/2025
LOS ANGELES OpenDrives has signed a new distribution partnership deal with Versatile Distribution Services (VDS) to strengthen its channel and streamline how it...
22/10/2025
WASHINGTON, D.C The organizers of The 2025 NAB Show New York have announced that they are expecting more than 12,000 registered attendees from about 100 countr...
22/10/2025
Samora Pinderhughes Brings Immersive Sound to Berklee's Signature Series The artist and composer, who's worked with Herbie Hancock, Robert Glasper, Co...
22/10/2025
BMI Day at Berklee Celebrates Composer Fil Eisler and Awards Scholarship to Stud...
22/10/2025
October 22nd, 2025 TRIBECA ANNOUNCES STAR-STUDDED LINEUP OF MEMBERSHIP EVENTS F...
22/10/2025
Rohde & Schwarz and TRUMPF cooperate in drone defense Rohde & Schwarz and TRUMPF partner to deliver a comprehensive drone defense solution combining Rohde & S...
22/10/2025
European Broadcaster Upgrades To Grass Valley's NativeIP LDX 135 Cameras And...
22/10/2025
Everyone Gets a Better Deal on Verizon with New FOX One Perk The $15 FOX One streaming service perk is yet another way Verizon continues to add savings for cu...
22/10/2025
First Look Hidden Assets Series 3
Premieres on 9th November 9:30pm on RT One & RT Player
WATCH HERE Promo Link: Hidden Assets Series 3 | RT
The Crimin...
21/10/2025
NAB New York 2025: Although AES Show Is on Its Own, Audio Will Be a Major Part o...
21/10/2025
NAB New York 2025: Business of Broadcast and Media,' Future of Content'...
21/10/2025
SVG All-Stars: Ethan Folz, Senior Director, Digital Operations and Quality of Ex...
21/10/2025
NBA on NBC/Peacock: Livestream Offers Graphic Overlays, Predictive Gaming, Ancil...
21/10/2025
NBA on NBC/Peacock: At the Front Bench With Producer Frank DiGraci and Director ...
21/10/2025
NBA on NBC/Peacock: NBC Sports, NEP Build Ultra-Flexible Production Plan That Se...
21/10/2025
Indigenous storytelling has been at the heart of the work of the Sundance Instit...
21/10/2025
Top L-R: Mysterious Skin, American Dream Second Row L-R: Little Miss Sunshine, D...
21/10/2025
Last week, Spotify and Columbia Records transformed Pier 4 at the Brooklyn Army ...
21/10/2025
SBS Learn's Dharug Ngurra resource empowers classrooms to meaningfully celeb...
21/10/2025
As global operators simplify and evolve their digital platforms, NPS improvement...
21/10/2025
Critical Design Review completion is a key milestone on the path toward Wideband Global Satellite Communications certification for the network in 2026, opening ...
21/10/2025
New Australian undersea training range to implement and improve warfighting tactics, proficiency and safety; enable joint/allied training that contributes to pr...
21/10/2025
eds3_5_jq(document).ready(function($) { $(#eds_sliderM519).chameleonSlider_2_1({...
21/10/2025
September, the beginning of autumn, brought an expected revival to the TV market, largely due to the new fall TV schedules. The time spent in front of the TV sc...
21/10/2025
Broadcast Booms with 20% Uptick vs. August, Achieving Largest Monthly
Increase ...
21/10/2025
During September, streaming's share of TV viewing in Mexico settled at 24.5%, a marginal shift of -0.5 share points from the previous month.
Disclaimer: YU...
21/10/2025
RENNES, France BBright and GlobalM have conducted a technical trial validating Ultra HD interoperability across the entire contribution chain in the cloud, achi...
21/10/2025
Kokusai Denki Electric America will mark the U.S. debut of a new 4K camera at the 2025 NAB New York, Oct. 22-23. Now available, the Z-HD6500-S1 UHD/HD productio...
21/10/2025
Triveni Digital, a trusted leader in ATSC 1.0 and 3.0 service delivery, data broadcasting, and quality assurance solutions, will showcase its entire NEXTGEN TV ...
21/10/2025
Radio Azzurra FM, longest-running radio station in the province of Novara, northwest Italy, has invested in an DHD SX2 audio routing and mixing console for inte...
21/10/2025
Globecast, the leading provider of broadcast, media and entertainment managed services, has announced the appointment of G Morgan as Executive Vice President of...