Sony Pixel Power calrec Sony

KServe Providers Dish Up NIMble Inference in Clouds and Data Centers

02/06/2024

Deploying generative AI in the enterprise is about to get easier than ever.

NVIDIA NIM, a set of generative AI inference microservices, works with KServe, open-source software that automates putting AI models to work at the scale of a cloud computing application.

The combination ensures generative AI can be deployed like any other large enterprise application. It also makes NIM widely available through platforms from dozens of companies, such as Canonical, Nutanix and Red Hat.

The integration of NIM on KServe extends NVIDIA's technologies to the open-source community, ecosystem partners and customers. Through NIM, they can all access the performance, support and security of the NVIDIA AI Enterprise software platform with an API call - the push-button of modern programming.

Serving AI on Kubernetes KServe got its start as part of Kubeflow, a machine learning toolkit based on Kubernetes, the open-source system for deploying and managing software containers that hold all the components of large distributed applications.

As Kubeflow expanded its work on AI inference, what became KServe was born and ultimately evolved into its own open-source project.

Many companies have contributed to and adopted the KServe software that runs today at companies including AWS, Bloomberg, Canonical, Cisco, Hewlett Packard Enterprise, IBM, Red Hat, Zillow and NVIDIA.

Under the Hood With KServe KServe is essentially an extension of Kubernetes that runs AI inference like a powerful cloud application. It uses a standard protocol, runs with optimized performance and supports PyTorch, Scikit-learn, TensorFlow and XGBoost without users needing to know the details of those AI frameworks.

The software is especially useful these days, when new large language models (LLMs) are emerging rapidly.

KServe lets users easily go back and forth from one model to another, testing which one best suits their needs. And when an updated version of a model gets released, a KServe feature called canary rollouts automates the job of carefully validating and gradually deploying it into production.

Another feature, GPU autoscaling, efficiently manages how models are deployed as demand for a service ebbs and flows, so customers and service providers have the best possible experience.

An API Call to Generative AI The goodness of KServe is now available with the ease of NVIDIA NIM.

With NIM, a simple API call takes care of all the complexities. Enterprise IT admins get the metrics they need to ensure their application is running with optimal performance and efficiency, whether it's in their data center or on a remote cloud service - even if they change the AI models they're using.

NIM lets IT professionals become generative AI pros, transforming their company's operations. That's why a host of enterprises such as Foxconn and ServiceNow are deploying NIM microservices.

NIM Rides Dozens of Kubernetes Platforms Thanks to its integration with KServe, users will be able access NIM on dozens of enterprise platforms such as Canonical's Charmed KubeFlow and Charmed Kubernetes, Nutanix GPT-in-a-Box 2.0, Red Hat's OpenShift AI and many others.

Red Hat has been working with NVIDIA to make it easier than ever for enterprises to deploy AI using open source technologies, said KServe contributor Yuan Tang, a principal software engineer at Red Hat. By enhancing KServe and adding support for NIM in Red Hat OpenShift AI, we're able to provide streamlined access to NVIDIA's generative AI platform for Red Hat customers.

Through the integration of NVIDIA NIM inference microservices with Nutanix GPT-in-a-Box 2.0, customers will be able to build scalable, secure, high-performance generative AI applications in a consistent way, from the cloud to the edge, said the vice president of engineering at Nutanix, Debojyoti Dutta, whose team contributes to KServe and Kubeflow.

As a company that also contributes significantly to KServe, we're pleased to offer NIM through Charmed Kubernetes and Charmed Kubeflow, said Andreea Munteanu, MLOps product manager at Canonical. Users will be able to access the full power of generative AI, with the highest performance, efficiency and ease thanks to the combination of our efforts.

Dozens of other software providers can feel the benefits of NIM simply because they include KServe in their offerings.

Serving the Open-Source Community NVIDIA has a long track record on the KServe project. As noted in a recent technical blog, KServe's Open Inference Protocol is used in NVIDIA Triton Inference Server, which helps users run many AI models simultaneously across many GPUs, frameworks and operating modes.

With KServe, NVIDIA focuses on use cases that involve running one AI model at a time across many GPUs.

As part of the NIM integration, NVIDIA plans to be an active contributor to KServe, building on its portfolio of contributions to open-source software that includes Triton and TensorRT-LLM. NVIDIA is also an active member of the Cloud Native Computing Foundation, which supports open-source code for generative AI and other projects.

Try the NIM API on the NVIDIA API Catalog using the Llama 3 8B or Llama 3 70B LLM models today. Hundreds of NVIDIA partners worldwide are using NIM to deploy generative AI.

Watch NVIDIA founder and CEO Jensen Huang's COMPUTEX keynote to get the latest on AI and more.
LINK: https://blogs.nvidia.com/blog/kserve-nim-inference/...
See more stories from nvidia

Most recent headlines

06/12/2025

L3Harris Chair and CEO Appears on CNBC at Reagan National Defense Forum

In a live broadcast from the Reagan National Defense Forum, L3Harris Chair and CEO Christopher Kubasik joined Morgan Brennan on CNBCs Closing Bell: Overtime. Ku...

06/12/2025

Survey: M&E Embraces Horizontally Integrated Media Archiving Approach

FORT LAUDERDALE, Fla. A new survey from Pixitmedia by Datacore revealed a major shift in the Media & Entertainment industry in media archiving, with 85% of resp...

06/12/2025

Czech TV Deploys LiveU Solutions in 10 OB Vans

HACKENSACK, N.J. LiveU has announced that the national public broadcaster Czech Television has completed one of the largest LiveU live production deployments fo...

06/12/2025

NATAS Celebrates 76th Technology & Engineering Emmy Award Honorees

NEW YORK The National Academy of Television Arts and Sciences (NATAS) presented the Excellence in Production Technology Emmy Award to NASA+ and Dr. Tom Leight...

05/12/2025

2025 Sports Broadcasting Hall of Fame: Curt Gowdy Jr. - Master Storyteller, Nationally and Regionally

2025 Sports Broadcasting Hall of Fame: Curt Gowdy Jr. - Master Storyteller, Nati...

05/12/2025

SVG Sit-Down: Veritone's Sean King on the Power of Mining Video, Audio Data

SVG Sit-Down: Veritone's Sean King on the Power of Mining Video, Audio DataThe company's Data Refinery offers users total control and governance over da...

05/12/2025

Platinum White Paper: Inside the Nashville Predators' Unified, Flexible, Scalable Production System with Ross Video

Platinum White Paper: Inside the Nashville Predators' Unified, Flexible, Sca...

05/12/2025

Netflix Reaches Agreement To Acquire Warner Bros. Following Planned WBD Split

Netflix Reaches Agreement To Acquire Warner Bros. Following Planned WBD SplitThe deal does not include WBDs sports assets like TNT Sports (US, UK, LatAm), Euros...

05/12/2025

FOX Sports Returns to Indianapolis for Primetime Broadcast of Big Ten Championship

FOX Sports Returns to Indianapolis for Primetime Broadcast of Big Ten Championsh...

05/12/2025

SVG Summit 2025 Preview: Digital Engagement & Monetization Workshop Tackles the Future of the Viewer Experience

SVG Summit 2025 Preview: Digital Engagement & Monetization Workshop Tackles the ...

05/12/2025

Atlanta United Lights Up New Emory Healthcare Studio With First Live Broadcast for World Cup Draw

Atlanta United Lights Up New Emory Healthcare Studio With First Live Broadcast f...

05/12/2025

As Messi Takes the Pitch, MLS, Apple, NEP Roll Out Largest MLS Cup Production Ever

As Messi Takes the Pitch, MLS, Apple, NEP Roll Out Largest MLS Cup Production Ev...

05/12/2025

ESPN Enters College Football's Most Intense Month With Elevated Workflows for Championship Week

ESPN Enters College Football's Most Intense Month With Elevated Workflows fo...

05/12/2025

Sorry, Baby, Peter Hujar's Day, Among Sundance Institute-Supported Projects Nominated for 2026 Film Independent Spirit Awards

It's about that time! Awards season is in full swing, and the Film Independe...

05/12/2025

Surprised by Your 2025 Wrapped? Here's a Look at How the Data Comes to Life

Every year, Spotify Wrapped offers a personalized look back at the audio that defined your year. It's a snapshot of your listening habits, designed to tell ...

05/12/2025

Celebrating Spotify's GLOW, RADAR, and EQUAL Artists of 2025

In 2025, Spotify's EQUAL, GLOW, and RADAR programs celebrated women, LGBTQIA , and emerging artists who turned moments into milestones. From breaking record...

05/12/2025

Wi-Fi 7 - Go Beyond Speed to Deliver Security, Trust and Value

In our latest blog, we explain how Wi-Fi 7 rollouts can drive consumer loyalty with value-add services such as consumer cybersecurity. We also explore how this ...

05/12/2025

Netflix to Acquire Warner Bros. in Deal Worth $82.7 Billon

LOS ANGELES Netflix announced it has entered into an agreement to acquire the assets of Warner Bros. for $82.7 billion....

05/12/2025

Gracenote Launches New CTV Ad Platform

NEW YORK Nielsens Gracenote has launched Gracenote Content Connect, a new ad platform that provides agencies, brands, supply-side platforms (SSPs) and demand-si...

05/12/2025

IAB Tech Lab Releases Deals API

NEW YORK In an most important update to the workings of deal-based programmatic advertising, IAB Tech Lab has released version 1.0 of its Deals API for public c...

05/12/2025

Nielsen: NFL Thanksgiving Games Score Big Audiences

NEW YORK Pass the turkey. Pass the stuffing. Pass the cranberry sauce. All are common requests of Americans celebrating Thanksgiving Day with family and f...

05/12/2025

Iris Cloud-Connected Camera Control Platform Now Available

NEW YORK Iris, the new cloud-connected camera control platform, has officially launched with features that turn virtually any PTZ camera into a software-connect...

05/12/2025

Netflix to Acquire Warner Bros. in Deal Worth $82.7B

HOLLYWOOD, Calif. Netflix announced today that it has entered into an agreement to acquire the assets of Warner Bros. for $82.7 billion....

05/12/2025

Iris Cloud-Connected Camera Control Platform Is Now Available

NEW YORK Iris, the new cloud-connected camera control platform, has officially launched with features that turn virtually any PTZ camera into a software-connect...

05/12/2025

FCC Approves AT&T's $1 Billion Acquisition of UScellular Spectrum

WASHINGTON The Federal Communications Commission has approved AT&T's $1.02 billion acquisition of spectrum from UScellular in a decision that was issued sho...

05/12/2025

The Best Coldplay Songs: 21 Tracks That Shoot for the Stars

The Best Coldplay Songs: 21 Tracks That Shoot for the Stars From Yellow to Viva La Vida, Fix You to Paradise, this playlist goes back to the start. December ...

05/12/2025

Zafris Lecture Series Brings Nabil Ayers to Berklee

Zafris Lecture Series Brings Nabil Ayers to Berklee The 32nd annual James G. Zafris Distinguished Lecture series was held on Thursday, November 13, with guest...

05/12/2025

Introducing New Perks to Help You Get Even More from...

Introducing New Perks to Help You Get Even More from LinkedIn Premium Published on Dec 5, 2025 Categories: Company News, Product News LinkedIn Corporate Co...

05/12/2025

A new Game of Thrones Tale: Official trailer for Sky Exclusive series A Knight of the Seven Kingdoms lands today

Friday 5 December 2025 A new Game of Thrones Tale: Official trailer for Sky Exc...

05/12/2025

Don Lee, Lee Jin-uk, and Lalisa Manobal to Star in Netflix Action Thriller 'TYGO'

Back to All News Don Lee, Lee Jin-uk, and Lalisa Manobal to Star in Netflix Act...

05/12/2025

Give Back Fridays with a twist is back for 2025!

Tis the season of giving once again and this year we've taken our Give Back Fridays' concept and turned it on its head. In the autumn we were approach...

05/12/2025

2025-11-06

Brayden Gogis doesn't remember a time when he wasn't completely fixated on games in all forms. In preschool, when they asked us to dress up as what we ...

05/12/2025

The Grinch steals the spotlight as the theme for The Late Late Toy Show 2025

The Grinch steals the spotlight as the theme for The Late Late Toy Show 2025 Tune in tonight at 9:35pm on RT One and worldwide on RT Player #LateLateToyShow...

05/12/2025

RT Announces New Presenters of Flagship News Programmes

RT Announces New Presenters of Flagship News Programmes New RT Six One News co-presenter Tommy Meskill Sarah McInerney & Justin McCarthy join Morning Ir...

04/12/2025

ToolsOnAir Blackmagic Design HyperDeck Event Presets for just:in mac pro 2025 & just:in linux

ToolsOnAir Blackmagic Design HyperDeck Event Presets for just:in mac pro 2025 & ...

04/12/2025

ToolsOnAir AJA Ki Pro Event Presets for just:in mac pro 2025 & just:in linux

ToolsOnAir AJA Ki Pro Event Presets for just:in mac pro 2025 & just:in linux More Details:Starting with version 5.5, both just:in mac pro and just:in linux sol...

04/12/2025

Young Journalist finalists looking to the future

Wangu Kanuri from Kenya and Godwin Asediba from Ghana are two of this years finalists for Thomsons Young Journalist of the Year Award. The pair are runners-up i...

04/12/2025

SVG Sit-Down: ProximaVision's Claudio Lisman on Why Tethered Drones Could Be a Game-Changer for Live Sports Production

SVG Sit-Down: ProximaVision's Claudio Lisman on Why Tethered Drones Could Be...

04/12/2025

SVG Campus Shot Callers: Imry Halevi, Senior Associate Director of Athletics, Content & Strategic Communications, Harvard University

SVG Campus Shot Callers: Imry Halevi, Senior Associate Director of Athletics, Co...

04/12/2025

Platinum White Paper: LiveU Lightweight Sports Production: A Step Change in Sports Storytelling

Platinum White Paper: LiveU Lightweight Sports Production: A Step Change in Spor...

04/12/2025

London to Riyadh: DAZN Brings the Boxing Glamour to New Production Levels for Benavidez v Yarde in Saudi Arabia

London to Riyadh: DAZN brings the boxing glamour to new production levels for Be...

04/12/2025

Analysis: Paramount Bets on the Battering Ram' with Champions League Play

Analysis: Paramount bets on the battering ram' with Champions League play By Callum McCarthy, Editor-at-Large Tuesday, December 2, 2025 - 10:12 Print ...

04/12/2025

Space City Home Network Launches SCHN+ DTC App for Astros and Rockets

Space City Home Network Launches SCHN DTC App for Astros and RocketsThe Rockets and Astros were previously the lone NBA and MLB teams without a DTC appBy Jason...

04/12/2025

SVG Summit 2025 Preview: Content Workflows Workshop Spotlights Evolution of Sports Media Supply Chain

SVG Summit 2025 Preview: Content Workflows Workshop Spotlights Evolution of Spor...

04/12/2025

New Sponsor Spotlight: Geotech's Patrick Wambold On the Unreal Engine Revolution Taking Place in Sports Broadcasting

New Sponsor Spotlight: Geotech's Patrick Wambold On the Unreal Engine Revolu...

04/12/2025

Curt Gowdy Jr. - Master Storyteller, Nationally and Regionally

Curt Gowdy Jr. - Master Storyteller, Nationally and RegionallyBy Jason Dachman, Editorial Director, U.S. Thursday, December 4, 2025 - 1:52 pm Print This Sto...

04/12/2025

Cutting Through Rocks ( ) Shows the Difference That One Person Can Make for Change

(L-R) Rebecca Lichtenfeld, Mohammadreza Eyni, Sara Khaki, and Judith Helfand att...

04/12/2025

SBS launches Future Frames initiative to support emerging First Nations video editing talent

SBS launches Future Frames initiative to support emerging First Nations video ed...