Sony Pixel Power calrec Sony

KServe Providers Dish Up NIMble Inference in Clouds and Data Centers

02/06/2024

Deploying generative AI in the enterprise is about to get easier than ever.

NVIDIA NIM, a set of generative AI inference microservices, works with KServe, open-source software that automates putting AI models to work at the scale of a cloud computing application.

The combination ensures generative AI can be deployed like any other large enterprise application. It also makes NIM widely available through platforms from dozens of companies, such as Canonical, Nutanix and Red Hat.

The integration of NIM on KServe extends NVIDIA's technologies to the open-source community, ecosystem partners and customers. Through NIM, they can all access the performance, support and security of the NVIDIA AI Enterprise software platform with an API call - the push-button of modern programming.

Serving AI on Kubernetes KServe got its start as part of Kubeflow, a machine learning toolkit based on Kubernetes, the open-source system for deploying and managing software containers that hold all the components of large distributed applications.

As Kubeflow expanded its work on AI inference, what became KServe was born and ultimately evolved into its own open-source project.

Many companies have contributed to and adopted the KServe software that runs today at companies including AWS, Bloomberg, Canonical, Cisco, Hewlett Packard Enterprise, IBM, Red Hat, Zillow and NVIDIA.

Under the Hood With KServe KServe is essentially an extension of Kubernetes that runs AI inference like a powerful cloud application. It uses a standard protocol, runs with optimized performance and supports PyTorch, Scikit-learn, TensorFlow and XGBoost without users needing to know the details of those AI frameworks.

The software is especially useful these days, when new large language models (LLMs) are emerging rapidly.

KServe lets users easily go back and forth from one model to another, testing which one best suits their needs. And when an updated version of a model gets released, a KServe feature called canary rollouts automates the job of carefully validating and gradually deploying it into production.

Another feature, GPU autoscaling, efficiently manages how models are deployed as demand for a service ebbs and flows, so customers and service providers have the best possible experience.

An API Call to Generative AI The goodness of KServe is now available with the ease of NVIDIA NIM.

With NIM, a simple API call takes care of all the complexities. Enterprise IT admins get the metrics they need to ensure their application is running with optimal performance and efficiency, whether it's in their data center or on a remote cloud service - even if they change the AI models they're using.

NIM lets IT professionals become generative AI pros, transforming their company's operations. That's why a host of enterprises such as Foxconn and ServiceNow are deploying NIM microservices.

NIM Rides Dozens of Kubernetes Platforms Thanks to its integration with KServe, users will be able access NIM on dozens of enterprise platforms such as Canonical's Charmed KubeFlow and Charmed Kubernetes, Nutanix GPT-in-a-Box 2.0, Red Hat's OpenShift AI and many others.

Red Hat has been working with NVIDIA to make it easier than ever for enterprises to deploy AI using open source technologies, said KServe contributor Yuan Tang, a principal software engineer at Red Hat. By enhancing KServe and adding support for NIM in Red Hat OpenShift AI, we're able to provide streamlined access to NVIDIA's generative AI platform for Red Hat customers.

Through the integration of NVIDIA NIM inference microservices with Nutanix GPT-in-a-Box 2.0, customers will be able to build scalable, secure, high-performance generative AI applications in a consistent way, from the cloud to the edge, said the vice president of engineering at Nutanix, Debojyoti Dutta, whose team contributes to KServe and Kubeflow.

As a company that also contributes significantly to KServe, we're pleased to offer NIM through Charmed Kubernetes and Charmed Kubeflow, said Andreea Munteanu, MLOps product manager at Canonical. Users will be able to access the full power of generative AI, with the highest performance, efficiency and ease thanks to the combination of our efforts.

Dozens of other software providers can feel the benefits of NIM simply because they include KServe in their offerings.

Serving the Open-Source Community NVIDIA has a long track record on the KServe project. As noted in a recent technical blog, KServe's Open Inference Protocol is used in NVIDIA Triton Inference Server, which helps users run many AI models simultaneously across many GPUs, frameworks and operating modes.

With KServe, NVIDIA focuses on use cases that involve running one AI model at a time across many GPUs.

As part of the NIM integration, NVIDIA plans to be an active contributor to KServe, building on its portfolio of contributions to open-source software that includes Triton and TensorRT-LLM. NVIDIA is also an active member of the Cloud Native Computing Foundation, which supports open-source code for generative AI and other projects.

Try the NIM API on the NVIDIA API Catalog using the Llama 3 8B or Llama 3 70B LLM models today. Hundreds of NVIDIA partners worldwide are using NIM to deploy generative AI.

Watch NVIDIA founder and CEO Jensen Huang's COMPUTEX keynote to get the latest on AI and more.
LINK: https://blogs.nvidia.com/blog/kserve-nim-inference/...
See more stories from nvidia

Most recent headlines

09/11/2025

Dalet Unveils Agentic AI Media Workflows at IBC2025

Dalet today announced a transformative leap forward for media operations: Agentic Artificial Intelligence (AI) that unifies the Dalet ecosystem under one natura...

22/10/2025

Prime Video Inks Deal To Present NFL Black Friday Game Worldwide

Prime Video Inks Deal To Present NFL Black Friday Game Worldwide By SVG Staff Wednesday, October 22, 2025 - 10:06 am Print This Story | Subscribe Story ...

22/10/2025

NBA Tip-Off: ESPN Goes 1080p HDR End-to-End, Flipping HDR Switch on REMI and REMCO Shows

NBA Tip-Off: ESPN Goes 1080p HDR End-to-End, Flipping HDR Switch on REMI and REM...

22/10/2025

FloSports Empowers Division II, III Athletic Departments With Turnkey Production Suite for Livestreaming Production

FloSports Empowers Division II, III Athletic Departments With Turnkey Production...

22/10/2025

Wall Street Video Summit Debuts, Bringing Together 200 Financial Enterprise Video Executives in NYC

Wall Street Video Summit Debuts, Bringing Together 200 Financial Enterprise Vide...

22/10/2025

Dueling Pianos: International Chopin Piano Competition Is as Competitive as a Ballgame - and Has Amazing Audio

Dueling Pianos: International Chopin Piano Competition Is as Competitive as a Ba...

22/10/2025

Celebrate the Anniversaries of Shakira's Landmark Albums With Spotify-Exclusive EP and Video Special

In 1995, a young Colombian artist released an album that would change Latin pop ...

22/10/2025

SGL Carbon expands sustainable energy supply and invests in photovoltaic system at Meitingen site

Over the past few months, a photovoltaic system has been installed on a three-he...

22/10/2025

Orion Meets SLS: L3Harris Technology Ready to go to the Moon

The Orion spacecraft for NASA's Artemis II mission is stacked on the Space Launch System (SLS) rocket in High Bay 3 of the Vehicle Assembly Building at Kenn...

22/10/2025

Hybrid SATCOM: Delivering Resilient and Agile Connectivity Today

L3Harris' Hybrid SATCOM is resilient by design, offering path diversity that eliminates vulnerabilities by routing data across the best available networks i...

22/10/2025

The 2025 NAB Show New York Opens With More Than 12,000 Attendees Expected

WASHINGTON, D.C. Organizers of NAB Show New York said they are expecting more than 12,000 registered attendees from about 100 countries along with 260 exhibitor...

22/10/2025

The 2025 NAB Show New York Set to Open with More Than 12,000 Attendees Expected

WASHINGTON, D.C The organizers of The 2025 NAB Show New York have announced that they are expecting more than 12,000 registered attendees from about 100 countr...

22/10/2025

Masque Sound and Jaffe Holden Create Transformative Perfo...

Masque Sound, a leading theatrical sound reinforcement, installation and design company, supplied an extensive gear package of professional-grade equipment for ...

22/10/2025

Lightware UCX-3x3-TPX-RX20 sets new standard for connecte...

Lightware, a global leader in signal management and AV connectivity solutions, is seeing strong market momentum for the UCX-3x3-TPX-RX20, a compact transmitter-...

22/10/2025

Chyron Releases PAINT 10.2 Telestration Platform

MELVILLE, N.Y. Chyron has released PAINT 10.2, the latest update for its telestration platform, adding support for SMPTE ST 2110 IP workflows, expanding brandin...

22/10/2025

NBCUniversal Invests in ATSC 3.0 Authority Behind Run3TV

WASHINGTON Run3TV today said NBCUniversal is joining as an investor in the ATSC 3.0 Framework Authority, which develops the Run3TV NextGen TV application platfo...

22/10/2025

swXtch.io to Feature SRT-X Gateway, groundSwXtch at NAB Show New York

ATLANTA swXtch.io will feature two new networking solutions extending the company's reach across more cloud and on-prem workflows at NAB Show New York, set ...

22/10/2025

HBO Max Increases Prices for All Tiers

The Warner Bros. Discoverys HBO Max streaming services has increased prices for all its streaming tiers effectively immediately for new customers. Existing cust...

22/10/2025

OpenDrives Inks Agreement With Versatile Distribution Services

LOS ANGELES OpenDrives has signed a new distribution partnership deal with Versatile Distribution Services (VDS) to strengthen its channel and streamline how it...

22/10/2025

The 2025 NAB Show New York Set to Open with More Than 12,000 Attendees

WASHINGTON, D.C The organizers of The 2025 NAB Show New York have announced that they are expecting more than 12,000 registered attendees from about 100 countr...

22/10/2025

Samora Pinderhughes Brings Immersive Sound to Berklee's Signature Series

Samora Pinderhughes Brings Immersive Sound to Berklee's Signature Series The artist and composer, who's worked with Herbie Hancock, Robert Glasper, Co...

22/10/2025

BMI Day at Berklee Celebrates Composer Fil Eisler and Awards Scholarship to Student Jack Ryan

BMI Day at Berklee Celebrates Composer Fil Eisler and Awards Scholarship to Stud...

22/10/2025

Tribeca Announces Star-Studded Lineup of Membership Events Featuring Tracy Morgan, Alex Rodriguez, and Lucy Liu

October 22nd, 2025 TRIBECA ANNOUNCES STAR-STUDDED LINEUP OF MEMBERSHIP EVENTS F...

22/10/2025

Rohde & Schwarz and TRUMPF cooperate in drone defense

Rohde & Schwarz and TRUMPF cooperate in drone defense Rohde & Schwarz and TRUMPF partner to deliver a comprehensive drone defense solution combining Rohde & S...

22/10/2025

ARTE Enhances Live Production Capabilities with Grass Valley's NativeIP Solutions

European Broadcaster Upgrades To Grass Valley's NativeIP LDX 135 Cameras And...

22/10/2025

Everyone Gets a Better Deal on Verizon with New FOX One Perk

Everyone Gets a Better Deal on Verizon with New FOX One Perk The $15 FOX One streaming service perk is yet another way Verizon continues to add savings for cu...

22/10/2025

First look promo Hidden Assets Series 3

First Look Hidden Assets Series 3 Premieres on 9th November 9:30pm on RT One & RT Player WATCH HERE Promo Link: Hidden Assets Series 3 | RT The Crimin...

21/10/2025

NAB New York 2025: Although AES Show Is on Its Own, Audio Will Be a Major Part of the East Coast Expo

NAB New York 2025: Although AES Show Is on Its Own, Audio Will Be a Major Part o...

21/10/2025

NAB New York 2025: Business of Broadcast and Media,' Future of Content' Are Themes for Two-Day Show

NAB New York 2025: Business of Broadcast and Media,' Future of Content'...

21/10/2025

SVG All-Stars: Ethan Folz, Senior Director, Digital Operations and Quality of Experience, DTC Products, Tech, and Operations, NBA

SVG All-Stars: Ethan Folz, Senior Director, Digital Operations and Quality of Ex...

21/10/2025

NBA on NBC/Peacock: Livestream Offers Graphic Overlays, Predictive Gaming, Ancillary Camera Angles, Real-Time Highlights

NBA on NBC/Peacock: Livestream Offers Graphic Overlays, Predictive Gaming, Ancil...

21/10/2025

NBA on NBC/Peacock: At the Front Bench With Producer Frank DiGraci and Director Pierre Moossa

NBA on NBC/Peacock: At the Front Bench With Producer Frank DiGraci and Director ...

21/10/2025

NBA on NBC/Peacock: NBC Sports, NEP Build Ultra-Flexible Production Plan That Seamlessly Blends Onsite, REMI Models

NBA on NBC/Peacock: NBC Sports, NEP Build Ultra-Flexible Production Plan That Se...

21/10/2025

What to Watch: 6 Indigenous-Led Contemporary Stories Supported by the Documentary Film Program

Indigenous storytelling has been at the heart of the work of the Sundance Instit...

21/10/2025

2026 Sundance Film Festival Announces Park City Legacy Program Featuring Archival Screenings, Special Talks, and Culmination Event

Top L-R: Mysterious Skin, American Dream Second Row L-R: Little Miss Sunshine, D...

21/10/2025

DEADBEAT' Delivered a Raw Communal Party With Tame Impala and The Dare in Brooklyn

Last week, Spotify and Columbia Records transformed Pier 4 at the Brooklyn Army ...

21/10/2025

SBS Learn's Dharug Ngurra resource empowers classrooms to meaningfully celebrate NSW Aboriginal Languages Week

SBS Learn's Dharug Ngurra resource empowers classrooms to meaningfully celeb...

21/10/2025

Turning Viewers into Promoters: What Rising NPS Scores Tell Us About the Power of Experience

As global operators simplify and evolve their digital platforms, NPS improvement...

21/10/2025

L3Harris Successfully Completes Important Milestone for Army SATCOM Program

Critical Design Review completion is a key milestone on the path toward Wideband Global Satellite Communications certification for the network in 2026, opening ...

21/10/2025

In the Undersea Domain - Reliability is Key

New Australian undersea training range to implement and improve warfighting tactics, proficiency and safety; enable joint/allied training that contributes to pr...

21/10/2025

WWTV Bridges 48 Miles with Bold IP Studio Upgrade, Powered by Clear-Com and Key Code Media

eds3_5_jq(document).ready(function($) { $(#eds_sliderM519).chameleonSlider_2_1({...

21/10/2025

The Gauge: Poland | September 2025

September, the beginning of autumn, brought an expected revival to the TV market, largely due to the new fall TV schedules. The time spent in front of the TV sc...

21/10/2025

Nielsen: September Football Blitz Leads to Historic Monthly Spike in Broadcast Viewership

Broadcast Booms with 20% Uptick vs. August, Achieving Largest Monthly Increase ...

21/10/2025

THE GAUGE: MEXICO SEPTEMBER 2025

During September, streaming's share of TV viewing in Mexico settled at 24.5%, a marginal shift of -0.5 share points from the previous month. Disclaimer: YU...

21/10/2025

BBright, GlobalM Successfully Trial End-to-End UHD Interoperability

RENNES, France BBright and GlobalM have conducted a technical trial validating Ultra HD interoperability across the entire contribution chain in the cloud, achi...

21/10/2025

Kokusai Denki To Debut New 4K Camera at 2025 NAB Show New York

Kokusai Denki Electric America will mark the U.S. debut of a new 4K camera at the 2025 NAB New York, Oct. 22-23. Now available, the Z-HD6500-S1 UHD/HD productio...

21/10/2025

Triveni Digital to Showcase Industrys Most Comprehensive...

Triveni Digital, a trusted leader in ATSC 1.0 and 3.0 service delivery, data broadcasting, and quality assurance solutions, will showcase its entire NEXTGEN TV ...

21/10/2025

Radio Azzurra Advances to Digital Workflow with DHD SX2 A...

Radio Azzurra FM, longest-running radio station in the province of Novara, northwest Italy, has invested in an DHD SX2 audio routing and mixing console for inte...

21/10/2025

Globecast Appoints G Morgan as EVP of Sales for Globecast...

Globecast, the leading provider of broadcast, media and entertainment managed services, has announced the appointment of G Morgan as Executive Vice President of...