Sony Pixel Power calrec Sony

KServe Providers Dish Up NIMble Inference in Clouds and Data Centers

02/06/2024

Deploying generative AI in the enterprise is about to get easier than ever.

NVIDIA NIM, a set of generative AI inference microservices, works with KServe, open-source software that automates putting AI models to work at the scale of a cloud computing application.

The combination ensures generative AI can be deployed like any other large enterprise application. It also makes NIM widely available through platforms from dozens of companies, such as Canonical, Nutanix and Red Hat.

The integration of NIM on KServe extends NVIDIA's technologies to the open-source community, ecosystem partners and customers. Through NIM, they can all access the performance, support and security of the NVIDIA AI Enterprise software platform with an API call - the push-button of modern programming.

Serving AI on Kubernetes KServe got its start as part of Kubeflow, a machine learning toolkit based on Kubernetes, the open-source system for deploying and managing software containers that hold all the components of large distributed applications.

As Kubeflow expanded its work on AI inference, what became KServe was born and ultimately evolved into its own open-source project.

Many companies have contributed to and adopted the KServe software that runs today at companies including AWS, Bloomberg, Canonical, Cisco, Hewlett Packard Enterprise, IBM, Red Hat, Zillow and NVIDIA.

Under the Hood With KServe KServe is essentially an extension of Kubernetes that runs AI inference like a powerful cloud application. It uses a standard protocol, runs with optimized performance and supports PyTorch, Scikit-learn, TensorFlow and XGBoost without users needing to know the details of those AI frameworks.

The software is especially useful these days, when new large language models (LLMs) are emerging rapidly.

KServe lets users easily go back and forth from one model to another, testing which one best suits their needs. And when an updated version of a model gets released, a KServe feature called canary rollouts automates the job of carefully validating and gradually deploying it into production.

Another feature, GPU autoscaling, efficiently manages how models are deployed as demand for a service ebbs and flows, so customers and service providers have the best possible experience.

An API Call to Generative AI The goodness of KServe is now available with the ease of NVIDIA NIM.

With NIM, a simple API call takes care of all the complexities. Enterprise IT admins get the metrics they need to ensure their application is running with optimal performance and efficiency, whether it's in their data center or on a remote cloud service - even if they change the AI models they're using.

NIM lets IT professionals become generative AI pros, transforming their company's operations. That's why a host of enterprises such as Foxconn and ServiceNow are deploying NIM microservices.

NIM Rides Dozens of Kubernetes Platforms Thanks to its integration with KServe, users will be able access NIM on dozens of enterprise platforms such as Canonical's Charmed KubeFlow and Charmed Kubernetes, Nutanix GPT-in-a-Box 2.0, Red Hat's OpenShift AI and many others.

Red Hat has been working with NVIDIA to make it easier than ever for enterprises to deploy AI using open source technologies, said KServe contributor Yuan Tang, a principal software engineer at Red Hat. By enhancing KServe and adding support for NIM in Red Hat OpenShift AI, we're able to provide streamlined access to NVIDIA's generative AI platform for Red Hat customers.

Through the integration of NVIDIA NIM inference microservices with Nutanix GPT-in-a-Box 2.0, customers will be able to build scalable, secure, high-performance generative AI applications in a consistent way, from the cloud to the edge, said the vice president of engineering at Nutanix, Debojyoti Dutta, whose team contributes to KServe and Kubeflow.

As a company that also contributes significantly to KServe, we're pleased to offer NIM through Charmed Kubernetes and Charmed Kubeflow, said Andreea Munteanu, MLOps product manager at Canonical. Users will be able to access the full power of generative AI, with the highest performance, efficiency and ease thanks to the combination of our efforts.

Dozens of other software providers can feel the benefits of NIM simply because they include KServe in their offerings.

Serving the Open-Source Community NVIDIA has a long track record on the KServe project. As noted in a recent technical blog, KServe's Open Inference Protocol is used in NVIDIA Triton Inference Server, which helps users run many AI models simultaneously across many GPUs, frameworks and operating modes.

With KServe, NVIDIA focuses on use cases that involve running one AI model at a time across many GPUs.

As part of the NIM integration, NVIDIA plans to be an active contributor to KServe, building on its portfolio of contributions to open-source software that includes Triton and TensorRT-LLM. NVIDIA is also an active member of the Cloud Native Computing Foundation, which supports open-source code for generative AI and other projects.

Try the NIM API on the NVIDIA API Catalog using the Llama 3 8B or Llama 3 70B LLM models today. Hundreds of NVIDIA partners worldwide are using NIM to deploy generative AI.

Watch NVIDIA founder and CEO Jensen Huang's COMPUTEX keynote to get the latest on AI and more.
LINK: https://blogs.nvidia.com/blog/kserve-nim-inference/...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

01/06/2026

Dolby Sets the New Standard for Premium Entertainment at CES 2026

January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026 Throughout the week, Dolby brings to life the latest innovatio...

02/05/2026

Dalet Flex LTS Delivers Smarter Search, Faster Editing, and an AI-Ready Foundation for Modern Media

Dalet, a leading technology and service provider for media-rich organizations, t...

01/05/2026

NBCUniversal's Peacock to Be First Streamer to Integrate Dolby's Full Suite of Premium Picture and Sound Innovations

January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...

01/04/2026

DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION

January 4 2026, 18:00 (PST) DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION Douyin Users Can Now Create And Share Videos With Stun...

06/02/2026

Chris Myers Joins Net Insight as SVP of Sales, Americas

Share Copy link Facebook X Linkedin Bluesky Email...

06/02/2026

Sen. Cruz Announces Hearing on Broadcast Media Ownership Rules

Share Copy link Facebook X Linkedin Bluesky Email...

06/02/2026

NAB Show Relocates TV and Radio HQ To LVCC Central Hall

Share Copy link Facebook X Linkedin Bluesky Email...

06/02/2026

Sony Solutions Widely Deployed for Super Bowl LX in San Francisco

Share Copy link Facebook X Linkedin Bluesky Email...

06/02/2026

Telemundo Puerto Rico Launches In Mainland U.S.

Share Copy link Facebook X Linkedin Bluesky Email...

06/02/2026

February 05, 2026

How invisible vaccine scaffolding boosts HIV immune response Scripps Research scientists designed a DNA scaffold that carries HIV vaccine proteins into the bo...

05/02/2026

Tech Focus: Wireless Audio, Part 2 - RF Mics Have a Key Role in Sports Broadcasting

Three examples of how wireless microphones are deployed to bring fans in deep an...

05/02/2026

Samsung's Galaxy S25 Ultra Camera To Capture the Opening Ceremony

Broadcast coverage will include 25 cameras distributed around the venues, including to some athletes; Galaxy AI Interpreter will also be deployed The Opening C...

05/02/2026

Kiswe to Power Mountain West Conference's New Direct-to-Consumer Streaming Platform

Kiswe has partnered with the Mountain West Conference to power the next iteratio...

05/02/2026

NBCUniversal, Roku Launch the NBC Winter Olympics Experience

NBCUniversal and Roku announce the launch of the 2026 NBC Winter Olympics Experience, a destination delivering NBCUniversal's comprehensive CTV coverage of ...

05/02/2026

Vizrt Transforms Corporate Communications with AI-Powered Augmented Reality in Zoom

Vizrt, which specializes in live production technology as well as transforming v...

05/02/2026

Canon Intros RF7-14mm Fisheye Zoom, RF14mm Prime Lens

Canon USA has launched the RF7-14mm F2.8-3.5 L fisheye STM zoom lens and the RF14mm F1.4 L VCM prime lens. Building on Canon's legacy of innovative optics, ...

05/02/2026

UMass Lowell's Tsongas Center Upgrades with Ikegami UHK-X600 Cameras

The Paul E. Tsongas Center at UMass Lowell in Massachusetts has chosen Ikegami cameras for incorporation into its broadcast-quality television production facili...

05/02/2026

Exchange, NBCUniversal to Provide Service Members with Free Streaming of Winter Olympics

Once again, service members and Veterans worldwide will enjoy free access to NBC...

05/02/2026

Advanced Systems Group Appoints Industry Veteran Derek Pezzotti to Lead Sports and Venue Market Growth

Advanced Systems Group, LLC (ASG), a technology and services provider for media ...

05/02/2026

Broadcast Management Group Expands Management Team to Support Managed Services and Live Production Growth

Broadcast Management Group (BMG) is strengthening its leadership team to support...

05/02/2026

NBC Sports Selects Comcast Technology Solutions for Production of Winter Olympics

NBC Sports selects Comcast Technology Solutions (CTS) to provide multiscreen vid...

05/02/2026

AIM Sports Group Enhances AIM Sportsplex With Spiideo's Advanced Automated Video Technology

AIM Sports Group, a sports enterprise dedicated to elevating youth athletics thr...

05/02/2026

Inside the 2026 Milano Cortina IBC: How Tech Makes a Difference for Rightsholders, Fans, the Environment

Designed for efficient use of shared services and resources, the home of OBS pro...

05/02/2026

SVG Students To Watch: Brandon Malin, University of Michigan

The Yankees fan from Connecticut is executive producer of BTN StudentU for the Wolverines In the live-sports-video industry, the future is bright. Our series S...

05/02/2026

OBS Is Ready To Deliver for Milano Cortina Opening Ceremony

In an Olympic first, the ceremony will be held in four locations simultaneously...

05/02/2026

Remembering Charlie Jablonski, an Olympic Broadcasting Legend

Members of the broadcast and tech communities share four decades of memories of the technology leader The 2026 Milano Cortina Olympics are upon us, and every O...

05/02/2026

NBC Sports Has an Army of Technology Providers Supporting Winter Olympics Production

Key vendors include Appear, Audio-Technica, Canon, Chyron, Cisco, Comcast Techno...

05/02/2026

Spotify Partners With Bookshop.org and Debuts Page Match Feature to Bridge Physical, E-book, and Audio Formats

Since bringing audiobooks to Spotify in 2022, we've helped listeners discove...

05/02/2026

How to Use Page Match to Seamlessly Switch Between a Book and Its Audiobook on Spotify

Today, Spotify announced two new updates to give book lovers a more personalized...

05/02/2026

HC-130J Aircraft Enhances Coast Guard Readiness

A U.S. Coast Guard HC-130J aircraft during a test flight at L3Harris' facility in Waco, Texas....

05/02/2026

Al Seer Marine and L3Harris Deepen Strategic Agreement to Advance Maritime Unmanned Systems in the Middle East

Al Seer Marine and L3Harris have announced a strategic partnership combining UAE...

05/02/2026

Well-Thought-Out UX: The Quiet Power Behind Our Latest Improvements

AI should balance automation (replacing tasks) and augmentation (empowering humans). Automate the mundane and augment the creative by applying the right AI type...

05/02/2026

Football And Younger Viewers Drive Ad Supported TV Viewing To 2025 High, Nielsen's Q4 2025 Ad Supported Gauge Finds

During this period, streaming comprised the majority of ad supported TV (45.6%),...

05/02/2026

New Orlando TV Station Focuses On Puerto Rican Viewers

Share Copy link Facebook X Linkedin Bluesky Email...

05/02/2026

Teads, Google TV Partner To Grow CTV HomeScreen Ad Availability

Share Copy link Facebook X Linkedin Bluesky Email...

05/02/2026

Advanced Systems Group Appoints Industry Veteran Derek Pe...

Advanced Systems Group, LLC (ASG), a technology and services provider for media creatives and content owners, announced the appointment of Derek Pezzotti as Sen...

05/02/2026

Taurus Technologies Elevates Podcast Production with Brig...

Taurus Technologies, a Dallas-area professional AV systems integrator, has upgraded its in-house podcast studio with Brightline Lighting's AV/720 low-voltag...

05/02/2026

NBC Sports Selects Production Infrastructure and Signal P...

NBC Universal to Present XXV Olympic Winter Games Feb. 6-22 and Milan Cortina Paralympics March 6-15 NBC Sports to Utilize Grass Valley's Frame Rate Conver...

05/02/2026

Atomos Unveils All New Shogun AV-19

Atomos today announced Shogun AV-19, a rack-mountable, 19-inch 4K HDR monitor-recorder-switcher designed for professional live production, broadcast, and video ...

05/02/2026

Vizrt revolutionizes corporate communications with AI-pow...

Vizrt, the leader in live production technology, revolutionizing viewer experience and engagement, today introduces two brand new solutions in partnership with ...

05/02/2026

Appear Appoints Simon Frost as Chief Marketing Officer to...

Appear, a global leader in live production technology, today announced the appointment of Simon Frost in a newly created role as Chief Marketing Officer (CMO). ...

05/02/2026

Noah Chamis ICLS Illuminates Only Murders in the Building...

New York gaffer Noah Chamis, ICLS ( You Deserve Each Other , The Half of It , Project Runway ) practices a mix of technical precision and creative play in his...

05/02/2026

NBC Sports Deploys Audio-Technica Microphones for Winter Olympics

Share Copy link Facebook X Linkedin Bluesky Email...

05/02/2026

Hemisphere Media Group, Entravision Launch WAPA Orlando

Share Copy link Facebook X Linkedin Bluesky Email...

05/02/2026

SMT Providing Timing And Production Data Services for Winter Olympics

Share Copy link Facebook X Linkedin Bluesky Email...

05/02/2026

BBC Studios and UKTV appoint Karin Marelle as Global Head of Acquisitions

BBC Studios and UKTV have appointed Karin Marelle to lead their Global Acquisitions team, overseeing the sourcing of content across BBC Studios' global chan...

05/02/2026

The Miniature Wife, A Sky Exclusive comedy drama starring Elizabeth Banks and Matthew Macfadyen, to land on 9 April

Thursday 5 February 2026 The Miniature Wife, A Sky Exclusive comedy drama starr...

05/02/2026

Trailer Revealed for Sky Original Film FUZE starring Aaron Taylor-Johnson, Theo James, Gugu Mbatha-Raw and Sam Worthington

Thursday 5 February 2026 Trailer Revealed for Sky Original Film FUZE starring A...

05/02/2026

Rohde & Schwarz at MWC Barcelona 2026: Enabling connections, empowering innovations

Rohde & Schwarz at MWC Barcelona 2026: Enabling connections, empowering innovati...