Sony Pixel Power calrec Sony

KServe Providers Dish Up NIMble Inference in Clouds and Data Centers

02/06/2024

Deploying generative AI in the enterprise is about to get easier than ever.

NVIDIA NIM, a set of generative AI inference microservices, works with KServe, open-source software that automates putting AI models to work at the scale of a cloud computing application.

The combination ensures generative AI can be deployed like any other large enterprise application. It also makes NIM widely available through platforms from dozens of companies, such as Canonical, Nutanix and Red Hat.

The integration of NIM on KServe extends NVIDIA's technologies to the open-source community, ecosystem partners and customers. Through NIM, they can all access the performance, support and security of the NVIDIA AI Enterprise software platform with an API call - the push-button of modern programming.

Serving AI on Kubernetes KServe got its start as part of Kubeflow, a machine learning toolkit based on Kubernetes, the open-source system for deploying and managing software containers that hold all the components of large distributed applications.

As Kubeflow expanded its work on AI inference, what became KServe was born and ultimately evolved into its own open-source project.

Many companies have contributed to and adopted the KServe software that runs today at companies including AWS, Bloomberg, Canonical, Cisco, Hewlett Packard Enterprise, IBM, Red Hat, Zillow and NVIDIA.

Under the Hood With KServe KServe is essentially an extension of Kubernetes that runs AI inference like a powerful cloud application. It uses a standard protocol, runs with optimized performance and supports PyTorch, Scikit-learn, TensorFlow and XGBoost without users needing to know the details of those AI frameworks.

The software is especially useful these days, when new large language models (LLMs) are emerging rapidly.

KServe lets users easily go back and forth from one model to another, testing which one best suits their needs. And when an updated version of a model gets released, a KServe feature called canary rollouts automates the job of carefully validating and gradually deploying it into production.

Another feature, GPU autoscaling, efficiently manages how models are deployed as demand for a service ebbs and flows, so customers and service providers have the best possible experience.

An API Call to Generative AI The goodness of KServe is now available with the ease of NVIDIA NIM.

With NIM, a simple API call takes care of all the complexities. Enterprise IT admins get the metrics they need to ensure their application is running with optimal performance and efficiency, whether it's in their data center or on a remote cloud service - even if they change the AI models they're using.

NIM lets IT professionals become generative AI pros, transforming their company's operations. That's why a host of enterprises such as Foxconn and ServiceNow are deploying NIM microservices.

NIM Rides Dozens of Kubernetes Platforms Thanks to its integration with KServe, users will be able access NIM on dozens of enterprise platforms such as Canonical's Charmed KubeFlow and Charmed Kubernetes, Nutanix GPT-in-a-Box 2.0, Red Hat's OpenShift AI and many others.

Red Hat has been working with NVIDIA to make it easier than ever for enterprises to deploy AI using open source technologies, said KServe contributor Yuan Tang, a principal software engineer at Red Hat. By enhancing KServe and adding support for NIM in Red Hat OpenShift AI, we're able to provide streamlined access to NVIDIA's generative AI platform for Red Hat customers.

Through the integration of NVIDIA NIM inference microservices with Nutanix GPT-in-a-Box 2.0, customers will be able to build scalable, secure, high-performance generative AI applications in a consistent way, from the cloud to the edge, said the vice president of engineering at Nutanix, Debojyoti Dutta, whose team contributes to KServe and Kubeflow.

As a company that also contributes significantly to KServe, we're pleased to offer NIM through Charmed Kubernetes and Charmed Kubeflow, said Andreea Munteanu, MLOps product manager at Canonical. Users will be able to access the full power of generative AI, with the highest performance, efficiency and ease thanks to the combination of our efforts.

Dozens of other software providers can feel the benefits of NIM simply because they include KServe in their offerings.

Serving the Open-Source Community NVIDIA has a long track record on the KServe project. As noted in a recent technical blog, KServe's Open Inference Protocol is used in NVIDIA Triton Inference Server, which helps users run many AI models simultaneously across many GPUs, frameworks and operating modes.

With KServe, NVIDIA focuses on use cases that involve running one AI model at a time across many GPUs.

As part of the NIM integration, NVIDIA plans to be an active contributor to KServe, building on its portfolio of contributions to open-source software that includes Triton and TensorRT-LLM. NVIDIA is also an active member of the Cloud Native Computing Foundation, which supports open-source code for generative AI and other projects.

Try the NIM API on the NVIDIA API Catalog using the Llama 3 8B or Llama 3 70B LLM models today. Hundreds of NVIDIA partners worldwide are using NIM to deploy generative AI.

Watch NVIDIA founder and CEO Jensen Huang's COMPUTEX keynote to get the latest on AI and more.
LINK: https://blogs.nvidia.com/blog/kserve-nim-inference/...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

01/06/2026

Dolby Sets the New Standard for Premium Entertainment at CES 2026

January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026 Throughout the week, Dolby brings to life the latest innovatio...

02/05/2026

Dalet Flex LTS Delivers Smarter Search, Faster Editing, and an AI-Ready Foundation for Modern Media

Dalet, a leading technology and service provider for media-rich organizations, t...

01/05/2026

NBCUniversal's Peacock to Be First Streamer to Integrate Dolby's Full Suite of Premium Picture and Sound Innovations

January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...

01/04/2026

DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION

January 4 2026, 18:00 (PST) DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION Douyin Users Can Now Create And Share Videos With Stun...

23/03/2026

IBEW Calls for Scrutiny of Skydance-CBS Layoffs and Proposed CNN Merger

Share Copy link Facebook X Linkedin Bluesky Email...

23/03/2026

Bruno Mars Risk It All Music Video Captures Timeless Text...

Pro8mm, the Super 8 experts, provided cameras, Super 8 movie film, and scanning services for Bruno Mars' Risk It All music video. The debut single from Br...

23/03/2026

Matthews Intros Lightweight Aluminum Grid Clamps

Matthews, introduces their first aluminum grid clamp collection, engineered for the rigging needs of film, television and live production. Combining light weigh...

23/03/2026

RT Statement on the Death of Sports Broadcaster Michael Lyster

RT is sad today to learn of the death of legendary RT Sport broadcaster Michael Lyster, who died this morning aged 71 years. Kevin Bakhurst, Director-General...

23/03/2026

RT Documentary On One wins its first ever dedicated music award

RT Documentary On One has scooped its first ever dedicated music award. At the 2026 Icelandic Music Awards, composer lfur Eldj rn won Release of the Year in t...

23/03/2026

Czechia v Republic of Ireland live on RT2, RT Player and RT Radio 1

Inside Sport, Liveline, Morning Ireland and 2FM DRIVE will all be in Prague to bring fans to the heart of the action Every Moment, Every Generation RT | FIFA...

22/03/2026

VSL update Synchron Woodwinds & Strings

Free updates now available VSL have just released some free updates that add some existing features to a selection of libraries in their expansive Synchron ...

21/03/2026

MPG announce new Impact Award

Presented to War Child UK's HELP(2) project The MPG (Music Producers Guild) have announced the launch of the MPG Impact Award, a brand-new honour that w...

21/03/2026

Eduardo Tarilonte's Ancient ERA Persia from Best Service

Microtuning support for Arabic, Persian & Turkish scales The latest release from Best Service brings together a selection of string, wind and percussion ins...

21/03/2026

New campaign from NAATI and SBS CulturalConnect highlights how we all deserve to be understood'

New campaign from NAATI and SBS CulturalConnect highlights how we all deserve t...

21/03/2026

Statement regarding Rhoda Roberts AO

Statement regarding Rhoda Roberts AO 21 March, 2026 Media releases SBS is deeply saddened by the passing of Widjabul Wieybal woman from the Bundjalung Na...

21/03/2026

Survey: Fans Prefer Sports on Broadcast Over Streaming

Share Copy link Facebook X Linkedin Bluesky Email...

21/03/2026

Graham Promotes Stephanie Slagle to VP, CRO & GM of WDIV Local 4

Share Copy link Facebook X Linkedin Bluesky Email...

21/03/2026

Study: Repurposed Traditional TV Ads for CTV Is a Missed Opportunity

Share Copy link Facebook X Linkedin Bluesky Email...

21/03/2026

Carr Backs Trump Army/Navy Game Executive Order

Share Copy link Facebook X Linkedin Bluesky Email...

21/03/2026

Opponents File Emergency FCC Petition to Block Nexstar/Tegna Merger

Share Copy link Facebook X Linkedin Bluesky Email...

21/03/2026

Eight States Ask for Court to Stop Nexstar/Tegna Merger

Share Copy link Facebook X Linkedin Bluesky Email...

21/03/2026

Cine Gear Connect NY Ramps Up for March 28 - 2026

Cine Gear Connect NY, presented by Universal Production Services, is filling in the slate for a full day of panels, peers, learning the latest, and mixing it up...

21/03/2026

Studio Technologies Debuts New StudioComm System at NAB 2026

Studio Technologies Debuts New StudioComm System at NAB 2026 Brie Clayton March 20, 2026 0 Comments StudioComm Model 794 Central Controller and Model ...

21/03/2026

Restoration Christian Fellowship Captures Worship Music Videos with PYXIS 12K

Restoration Christian Fellowship Captures Worship Music Videos with PYXIS 12K Brie Clayton March 20, 2026 0 Comments PYXIS' open gate provides cre...

20/03/2026

NAB 2026: Net Insight unveils Market-Leading JPEG XS at Scale for Live IP Media Production

Net Insight will introduce a JPEG XS solution for full IP environments at NAB Sh...

20/03/2026

NAB 2026: LTN and Harmonic Expand Partnership to Support FAST Growth and C-Band Migration

LTN has expanded its technology partnership with Harmonic ahead of the FCC's...

20/03/2026

NAB 2026: Solid State Logic to Preview SSL Live V6.2 with New SolidPitch Effect and Major Workflow Enhancements

Solid State Logic will preview SSL Live V6.2 at NAB Show, booth C6907. The softw...

20/03/2026

NAB 2026: Fujifilm Announces Availability of FUJINON UA22x4.8BERD 4K Broadcast Zoom Lens

FUJIFILM North America Corporation's Optical Devices Division has announced ...

20/03/2026

NAB 2026: Fujifilm Announces Development of FUJINON UA16x4BERD, UA30x7.3BERD, and UA94x8.7BESM 4K Broadcast Zooms

FUJIFILM North America Corporation's Optical Devices Division has announced ...

20/03/2026

TrueVisions NOW Chooses Bitmovin's Observability

TrueVisions NOW, a streaming platform in Thailand and part of the TrueVisions Group, has selected Bitmovin's Observability product for real-time video analy...

20/03/2026

Marquee Sports Network Expands Distribution to Hulu + Live TV, Prime Video

Marquee Sports Network has announced distribution agreements with Hulu + Live TV and Prime Video ahead of the 2026 MLB season. Marquee Sports Network is now av...

20/03/2026

NAB 2026: Software-Defined, AI-Powered Workflow Tells the Story of the New FOR-A America

FOR-A will exhibit software-defined and AI-driven solutions at NAB Show 2026, bo...

20/03/2026

TNA Wrestling and Eurosport India Announce New Multi-Year Exclusive Programming Agreement

TNA Wrestling and Eurosport India have entered into a multi-year exclusive progr...

20/03/2026

GameTime Productions Expands Technical Vision for Athletes Unlimited Basketball in Nashville

When Athletes Unlimited brought its professional women's basketball season t...

20/03/2026

Calrec Craft Interview with Senior Broadcast Audio A1 Engineer and Music Director Rick Bernier

In this craft interview, Rick Bernier reflects on a career that has taken him to...

20/03/2026

NAB 2026: IP Innovation, SoftwareBased Media Infrastructure & Dynamic Media Facility Workflows on Display for Lawo

Lawo will announce a new product ahead of NAB Show 2026 in Las Vegas, where it w...

20/03/2026

Ratings Roundup: 2026 World Baseball Classic is Most Watched WBC Telecast EVER with Over 10 Million Viewers

Ratings Roundup is a rundown of recent rating news and is derived from press rel...

20/03/2026

MLB Names Polymarket Exclusive Prediction Market Exchange Partner and Signs Agreement with CFTC to Establish Integrity Framework

Major League Baseball (MLB) has named Polymarket as its Official Prediction Mark...

20/03/2026

How Big Tech AI Will Lead Broadcast Innovation: Lessons from the Enterprise Market

With AI now the industry-wide priority, Big Tech companies are uniquely position...

20/03/2026

SVG GameDay, Ep. 8: Los Angeles Angels' Davin Maske - SoCal Baseball in Anaheim

In-venue and creative video staffers at the professional and collegiate level ha...

20/03/2026

Fanatics Studios, FOX Sports Focus on Cavalcade of Stars in Fanatics Flag Football Classic

Abundant player mics and RF and other ground-level cameras will be used to captu...

20/03/2026

ESPN Ramps Up Production Levels for NCAA Women's Tournament as Popularity, Viewership Skyrocket

Regional sites also will receive big boost in production resources, including on...

20/03/2026

Give Me the Backstory: Get to Know Addison Heimann, the Writer-Director of Touch Me

By Jessica Herndon One of the most exciting things about the Sundance Film Fest...

20/03/2026

Spotify Marks 5 Years of EQUAL With EQUAL: The Podcast' and Global Events

In 2021, we launched EQUAL, a program designed to address an industry reality that persists: Women artists, songwriters, and producers too often face fewer oppo...

20/03/2026

Spotify's BTS Music Quiz Celebrates ARIRANG' and Puts ARMY Knowledge to the Test

BTS' long-awaited fifth studio album, ARIRANG, is finally here. To celebrate...

20/03/2026

Spotify and Kenia Os Bring K de Karma' From Streaming to Stage With Fan-First Experiences

A new era for Kenia Os has arrived, and Spotify marked the moment by putting fan...

20/03/2026

Spotify y Kenia Os llevan K de Karma' del streaming al escenario con experiencias nicas para Top Fans

Una nueva era para Kenia Os ha llegado, y Spotify marc el momento poniendo a lo...

20/03/2026

Sound Magic launch Supreme Drums Orange

Combines sampling & physical modelling Sound Magic have announced the launch of a comprehensive virtual drum instrument that's been designed to cater to...

20/03/2026

Mix Rescue: Ian Shepherd Video

How much difference should mastering make? In our latest Mix Rescue feature, SOS Editor in Chief Sam Inglis revisits a project from back in 2019, carrying o...