Sony Pixel Power calrec Sony

Decoding How NVIDIA AI Workbench Powers App Development

19/06/2024

Editor's note: This post is part of the AI Decoded series, which demystifies AI by making the technology more accessible and showcases new hardware, software, tools and accelerations for NVIDIA RTX PC and workstation users.

The demand for tools to simplify and optimize generative AI development is skyrocketing. Applications based on retrieval-augmented generation (RAG) - a technique for enhancing the accuracy and reliability of generative AI models with facts fetched from specified external sources - and customized models are enabling developers to tune AI models to their specific needs.

While such work may have required a complex setup in the past, new tools are making it easier than ever.

NVIDIA AI Workbench simplifies AI developer workflows by helping users build their own RAG projects, customize models and more. It's part of the RTX AI Toolkit - a suite of tools and software development kits for customizing, optimizing and deploying AI capabilities - launched at COMPUTEX earlier this month. AI Workbench removes the complexity of technical tasks that can derail experts and halt beginners.

What Is NVIDIA AI Workbench? Available for free, NVIDIA AI Workbench enables users to develop, experiment with, test and prototype AI applications across GPU systems of their choice - from laptops and workstations to data center and cloud. It offers a new approach for creating, using and sharing GPU-enabled development environments across people and systems.

A simple installation gets users up and running with AI Workbench on a local or remote machine in just minutes. Users can then start a new project or replicate one from the examples on GitHub. Everything works through GitHub or GitLab, so users can easily collaborate and distribute work. Learn more about getting started with AI Workbench.

How AI Workbench Helps Address AI Project Challenges Developing AI workloads can require manual, often complex processes, right from the start.

Setting up GPUs, updating drivers and managing versioning incompatibilities can be cumbersome. Reproducing projects across different systems can require replicating manual processes over and over. Inconsistencies when replicating projects, like issues with data fragmentation and version control, can hinder collaboration. Varied setup processes, moving credentials and secrets, and changes in the environment, data, models and file locations can all limit the portability of projects.

AI Workbench makes it easier for data scientists and developers to manage their work and collaborate across heterogeneous platforms. It integrates and automates various aspects of the development process, offering:

Ease of setup: AI Workbench streamlines the process of setting up a developer environment that's GPU-accelerated, even for users with limited technical knowledge.

Seamless collaboration: AI Workbench integrates with version-control and project-management tools like GitHub and GitLab, reducing friction when collaborating.

Consistency when scaling from local to cloud: AI Workbench ensures consistency across multiple environments, supporting scaling up or down from local workstations or PCs to data centers or the cloud.

RAG for Documents, Easier Than Ever NVIDIA offers sample development Workbench Projects to help users get started with AI Workbench. The hybrid RAG Workbench Project is one example: It runs a custom, text-based RAG web application with a user's documents on their local workstation, PC or remote system.

Every Workbench Project runs in a container - software that includes all the necessary components to run the AI application. The hybrid RAG sample pairs a Gradio chat interface frontend on the host machine with a containerized RAG server - the backend that services a user's request and routes queries to and from the vector database and the selected large language model.

This Workbench Project supports a wide variety of LLMs available on NVIDIA's GitHub page. Plus, the hybrid nature of the project lets users select where to run inference.

Workbench Projects let users version the development environment and code. Developers can run the embedding model on the host machine and run inference locally on a Hugging Face Text Generation Inference server, on target cloud resources using NVIDIA inference endpoints like the NVIDIA API catalog, or with self-hosting microservices such as NVIDIA NIM or third-party services.

The hybrid RAG Workbench Project also includes:

Performance metrics: Users can evaluate how RAG- and non-RAG-based user queries perform across each inference mode. Tracked metrics include Retrieval Time, Time to First Token (TTFT) and Token Velocity.

Retrieval transparency: A panel shows the exact snippets of text - retrieved from the most contextually relevant content in the vector database - that are being fed into the LLM and improving the response's relevance to a user's query.

Response customization: Responses can be tweaked with a variety of parameters, such as maximum tokens to generate, temperature and frequency penalty.

To get started with this project, simply install AI Workbench on a local system. The hybrid RAG Workbench Project can be brought from GitHub into the user's account and duplicated to the local system.

More resources are available in the AI Decoded user guide. In addition, community members provide helpful video tutorials, like the one from Joe Freeman below.

Customize, Optimize, Deploy Developers often seek to customize AI models for specific use cases. Fine-tuning, a technique that changes the model by training it with additional data, can be useful for style transfer or changing model behavior. AI Workbench helps with fine-tuning, as well.

The Llama-factory AI Workbench Project enables QLoRa, a fine-tuning method that minimizes memory requirements, for a variety of models, as well as
LINK: https://blogs.nvidia.com/blog/ai-decoded-workbench-hybrid-rag/...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

06/09/2026

Dolby and MagentaTV Bring Fans Closer to the FIFA World Cup 2026 in Germany with Dolby Vision and Dolby Atmos

June 9 2026, 23:00 (PDT) Dolby and MagentaTV Bring Fans Closer to the FIFA Worl...

04/08/2026

Dalet Announces Commercial Availability of Dalia, Bringing Media-Aware Agentic AI to Enterprise Productions

Dalet, a leading technology and service provider for media-rich organizations, t...

04/07/2026

Detective Conan: Fallen Angel of the Highway Opens in Dolby Cinemas Across Japan, Presented in Dolby Atmos and Dolby ...

April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...

11/06/2026

HBSs Johannes Franken on Digital Innovations, the Role of the Influencer at the 2026 FIFA World Cup

The immense size of the tourney and its Atlantic-spanning operation also disting...

11/06/2026

Nielsen: Soccer Fandom in North America Tops 136 Million, Up 10.9% in Five Years

Nielsen has released a new soccer fandom consumer research report, The Fans Behind The Game: FIFA World Cup 2026 Edition, examining the soccer audience in the...

11/06/2026

Telemundo Announces All-Day Opening Day Coverage for FIFA World Cup 2026 on June 11

Telemundo will launch its FIFA World Cup 2026 coverage on Thursday, June 11 with...

11/06/2026

Fubo Announces Distribution Agreement With NBCUniversal

FuboTV Inc. has announced a distribution agreement with NBCUniversal. Fubo customers can now stream Telemundo and Universo, with NBC Sports Network (NBCSN), NBC...

11/06/2026

DAZN Announces In-App Features for FIFA World Cup 2026 Coverage in Spain, Italy, and Japan

DAZN has announced its in-app features for FIFA World Cup 2026 coverage in Spain...

11/06/2026

Roblox Report: Sports Engagement on Platform Drives Real-World Fandom and Purchases

Roblox has released the 2026 Roblox Digital Expression Report: Wave 4 - Sports D...

11/06/2026

Andrea Bocelli, David Guetta, Megan Thee Stallion, and EJAE Release Official FIFA World Cup 2026 Anthem DNA'

FIFA has unveiled DNA, the Official FIFA World Cup 2026 Anthem, performed by A...

11/06/2026

ESPN Announces Extensive English- and Spanish-Language World Cup 2026 Coverage

ESPN will provide English- and Spanish-language news and information coverage of FIFA World Cup 2026 across its U.S. media platforms from June 11 through July 1...

11/06/2026

SVG Students To Watch: Teddy Batkin, Rochester Institute of Technology

The latest product of the outstanding RIT Sports Network program, this recent grad from Long Island is carving out a promising path in broadcast engineering In...

11/06/2026

DAZN and DSPORTS Announce Distribution Agreement Across Five Latin American Countries

DAZN has announced a multi-year agreement to make DSPORTS channels available to ...

11/06/2026

Resource Actors Throughout the Years at Sundance Institute's Directors Lab

Laura Dern at the 1986 Sundance Institute Directors Lab (Photo by Eric Edwards) By Lucy Spicer It takes a village to bring together the Sundance Institute lab...

11/06/2026

Introducing a New Standard for Podcast Plays and Upgraded Creator Analytics Experience

As podcast formats evolve in the streaming era, podcasting needs updated, transp...

11/06/2026

RADAR Italia Unveils 6 New Artists and a New Approach for 2026

As Spotify's global RADAR program enters its sixth year in Italy, a new class of artists is stepping into the spotlight. Today, we're announcing the six...

11/06/2026

5 Audiobooks that Amplify and Celebrate Queer Voices

Pride Month is a time for celebration, reflection, and amplifying the diverse stories and perspectives from the LGBTQIA+ community that enrich our world. To hel...

11/06/2026

VSL introduce Synchron Solo Violin 1 & Cello (sordino)

First in new line of muted string libraries VSL have just announced the launch of two new string libraries that represent the first two instalments in a new...

11/06/2026

Novation reveal the Launchkey 61 MK4 White

New colour option for 61-key Launchkey MK4 At Superbooth 2025, Novation introduced the Launchkey Mini 37 White and Launchkey 49 White, bringing an additiona...

11/06/2026

Arturia announce the MiniLab 37

Larger, but still compact! Arturia's popular compact MIDI controller keyboard is now available in a, well, slightly less compact version! The new MiniLa...

11/06/2026

Eurosatory 2026: Rohde & Schwarz shapes the new-generation battlefield

Eurosatory 2026: Rohde & Schwarz shapes the new-generation battlefield Rohde & Schwarz unveils next generation SIGINT/EW and CUAS solutions on uncrewed system...

11/06/2026

Rohde & Schwarz unveils NEMACS - Directional, ultra secure connectivity for the future battlefield

Rohde & Schwarz unveils NEMACS - Directional, ultra secure connectivity for the ...

11/06/2026

MTI FILM acquires Mango/New Edit

MTI FILM acquires Mango/New Edit Posted by MTI Film on June 10, 2026 LOS ANGELES, CA - June 2026 - MTI FILM, the multiple Emmy Award winning Hollywood post-p...

11/06/2026

Ungrounded LLM Fabricates Every Detail for Nearly 1 in 5 Movie and TV Titles Tested, New Gracenote Report Finds

Study underscores the need for authoritative content intelligence to build trust...

11/06/2026

PTZOptics, LayerJot Partner on AI-Powered PTZ at InfoComm 2026

Share Copy link Facebook X Linkedin Bluesky Email...

11/06/2026

Chyron Unveils PAINT 10.4

Share Copy link Facebook X Linkedin Bluesky Email...

11/06/2026

Maxon Brings Real-Time Architectural Visualization to AIA26 With New Redshift for Revit and Archicad Integration Beta

Maxon Brings Real-Time Architectural Visualization to AIA26 With New Redshift fo...

11/06/2026

ABC Kid's Caper Crew Shoots Australian Adventure with Blackmagic Design

ABC Kid's Caper Crew Shoots Australian Adventure with Blackmagic Design Brie Clayton June 11, 2026 0 Comments DP Judd Overton and team bring Wes A...

11/06/2026

PTZOptics and LayerJot demo Visual Reasoning at InfoComm...

PTZOptics, and LayerJot today announced live demonstrations at InfoComm 2026 showing how prompt-based AI, robotic camera control, and high-performance computing...

11/06/2026

Lightware launches GPIO Button to deliver simplified hard...

Lightware, an industry leader in signal management, announces the release of GPIO-Button-10S, a dedicated control interface enabling straightforward press-to-a...

11/06/2026

NABs LeGeyt Urges Congress to Limit NFL's Antitrust Exemption

Share Copy link Facebook X Linkedin Bluesky Email...

11/06/2026

Fubo Inks New Distribution Agreement with NBCUniversal

Share Copy link Facebook X Linkedin Bluesky Email...

11/06/2026

Kiloview to Showcase Broadcast-Grade AV-over-IP Solutions...

Kiloview, a leading innovator in AV-over-IP video solutions, will return to InfoComm 2026 (Booth# N8327) with broadcast-grade AV-over-IP solutions designed for ...

11/06/2026

Australian Games Industry Glossary of Terms

Australian Games Industry Glossary of Terms 10 June 2026 From DAU and EULA to COT and QADE, here's a list of game industry terms, industry jargon and their...

11/06/2026

Berklee's Tonya Butler Named Music Business Educator of the Year

Berklee's Tonya Butler Named Music Business Educator of the Year The Music Business Association honored Butler at its annual Bizzy Awards. June 10, 2026 ...

11/06/2026

Ann Mincieli to Receive Honorary Doctorate at Berklee NYC Graduate Commencement

Ann Mincieli to Receive Honorary Doctorate at Berklee NYC Graduate Commencement The five-time Grammy-winning engineer and producer, known for her longstanding...

11/06/2026

Daisy May Cooper rallies the nation ahead of ICC Womens T20 World Cup

Thursday 11 June 2026 Daisy May Cooper rallies the nation ahead of ICC Women's T20 World CupTurn on cookies to view this content. Go to Privacy options and...

11/06/2026

Hadewych Minis and Geert van Rampelberg to Star in New Netflix Series Directed by Paula van der Oest

Back to All News Hadewych Minis and Geert van Rampelberg to Star in New Netflix...

11/06/2026

Official Trailer for Anime Adaptation of Thunder 3' Unveiled Ahead of July 9 Premiere

Back to All News Official Trailer for Anime Adaptation of Thunder 3' Unvei...

11/06/2026

RT Radio 1 and Irish Lights mark RT 100 with special broadcasts from Ireland's Lighthouses

Summer solstice shows from C il House and Late Date from 9pm on Saturday 20 Jun...

11/06/2026

Save Big and Play Bigger: GeForce NOW Summer Sale Brings Major Membership Savings

The GeForce NOW summer sale kicked off today with limited-time savings of up to ...

10/06/2026

SVG Sit-Down: Team Whistle's Joe Caporoso on Building World Cup Content Around Fans, Culture, IRL Experiences

DAZN-owned digital-media company launches three fan-first series leaning into cr...

10/06/2026

Clear-Com Appoints Jason Dino as Southwest Regional Sales Manager

Clear-Com has announced the appointment of Jason Dino as Southwest Regional Sales Manager USA, covering Southern California and the Southwest region. Dino joins...

10/06/2026

Caretta Research: 2026 World Cup Revenue Growth Due to More Matches; Rights Revenue Up 32%

An 11% decrease in number of global broadcast deals reflects the organization...

10/06/2026

Women Without Boundaries Awards Are Back!

The Women Without Boundaries Awards recognize women whose work is advancing the future of media, broadcast, AV, workplace technology, digital experience, and re...

10/06/2026

On Eve of World Cup Kickoff, FIFA and HBS Offer Deep Dive into IBC Operations, Commentary, and Ref Cam

Today is match day minus two for FIFA and HBS. On Thursday, there will be two ma...

10/06/2026

SES Supporting World's Biggest Soccer Tournament Broadcast Distribution Worldwide

SES is supporting broadcast distribution of the world's biggest football tou...

10/06/2026

BirdDog Achieves Full NDI 6.3 Compatibility Across Entire Product Line

NDI has announced that BirdDog has become the first hardware manufacturer to achieve full NDI 6.3 compatibility across its complete lineup of cameras, encoders,...

10/06/2026

Emmy Award-Winning Audio Team To Present at SVG Audio Symposium

Vince Caputo and Scott Carter, winners of the 2026 Sports Emmy for Outstanding Post Produced Audio have been announced as presenters for the 2026 SVG Advanced A...