Sony Pixel Power calrec Sony

Decoding How NVIDIA AI Workbench Powers App Development

19/06/2024

Editor's note: This post is part of the AI Decoded series, which demystifies AI by making the technology more accessible and showcases new hardware, software, tools and accelerations for NVIDIA RTX PC and workstation users.

The demand for tools to simplify and optimize generative AI development is skyrocketing. Applications based on retrieval-augmented generation (RAG) - a technique for enhancing the accuracy and reliability of generative AI models with facts fetched from specified external sources - and customized models are enabling developers to tune AI models to their specific needs.

While such work may have required a complex setup in the past, new tools are making it easier than ever.

NVIDIA AI Workbench simplifies AI developer workflows by helping users build their own RAG projects, customize models and more. It's part of the RTX AI Toolkit - a suite of tools and software development kits for customizing, optimizing and deploying AI capabilities - launched at COMPUTEX earlier this month. AI Workbench removes the complexity of technical tasks that can derail experts and halt beginners.

What Is NVIDIA AI Workbench? Available for free, NVIDIA AI Workbench enables users to develop, experiment with, test and prototype AI applications across GPU systems of their choice - from laptops and workstations to data center and cloud. It offers a new approach for creating, using and sharing GPU-enabled development environments across people and systems.

A simple installation gets users up and running with AI Workbench on a local or remote machine in just minutes. Users can then start a new project or replicate one from the examples on GitHub. Everything works through GitHub or GitLab, so users can easily collaborate and distribute work. Learn more about getting started with AI Workbench.

How AI Workbench Helps Address AI Project Challenges Developing AI workloads can require manual, often complex processes, right from the start.

Setting up GPUs, updating drivers and managing versioning incompatibilities can be cumbersome. Reproducing projects across different systems can require replicating manual processes over and over. Inconsistencies when replicating projects, like issues with data fragmentation and version control, can hinder collaboration. Varied setup processes, moving credentials and secrets, and changes in the environment, data, models and file locations can all limit the portability of projects.

AI Workbench makes it easier for data scientists and developers to manage their work and collaborate across heterogeneous platforms. It integrates and automates various aspects of the development process, offering:

Ease of setup: AI Workbench streamlines the process of setting up a developer environment that's GPU-accelerated, even for users with limited technical knowledge.

Seamless collaboration: AI Workbench integrates with version-control and project-management tools like GitHub and GitLab, reducing friction when collaborating.

Consistency when scaling from local to cloud: AI Workbench ensures consistency across multiple environments, supporting scaling up or down from local workstations or PCs to data centers or the cloud.

RAG for Documents, Easier Than Ever NVIDIA offers sample development Workbench Projects to help users get started with AI Workbench. The hybrid RAG Workbench Project is one example: It runs a custom, text-based RAG web application with a user's documents on their local workstation, PC or remote system.

Every Workbench Project runs in a container - software that includes all the necessary components to run the AI application. The hybrid RAG sample pairs a Gradio chat interface frontend on the host machine with a containerized RAG server - the backend that services a user's request and routes queries to and from the vector database and the selected large language model.

This Workbench Project supports a wide variety of LLMs available on NVIDIA's GitHub page. Plus, the hybrid nature of the project lets users select where to run inference.

Workbench Projects let users version the development environment and code. Developers can run the embedding model on the host machine and run inference locally on a Hugging Face Text Generation Inference server, on target cloud resources using NVIDIA inference endpoints like the NVIDIA API catalog, or with self-hosting microservices such as NVIDIA NIM or third-party services.

The hybrid RAG Workbench Project also includes:

Performance metrics: Users can evaluate how RAG- and non-RAG-based user queries perform across each inference mode. Tracked metrics include Retrieval Time, Time to First Token (TTFT) and Token Velocity.

Retrieval transparency: A panel shows the exact snippets of text - retrieved from the most contextually relevant content in the vector database - that are being fed into the LLM and improving the response's relevance to a user's query.

Response customization: Responses can be tweaked with a variety of parameters, such as maximum tokens to generate, temperature and frequency penalty.

To get started with this project, simply install AI Workbench on a local system. The hybrid RAG Workbench Project can be brought from GitHub into the user's account and duplicated to the local system.

More resources are available in the AI Decoded user guide. In addition, community members provide helpful video tutorials, like the one from Joe Freeman below.

Customize, Optimize, Deploy Developers often seek to customize AI models for specific use cases. Fine-tuning, a technique that changes the model by training it with additional data, can be useful for style transfer or changing model behavior. AI Workbench helps with fine-tuning, as well.

The Llama-factory AI Workbench Project enables QLoRa, a fine-tuning method that minimizes memory requirements, for a variety of models, as well as
LINK: https://blogs.nvidia.com/blog/ai-decoded-workbench-hybrid-rag/...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

01/06/2026

Dolby Sets the New Standard for Premium Entertainment at CES 2026

January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026 Throughout the week, Dolby brings to life the latest innovatio...

02/05/2026

Dalet Flex LTS Delivers Smarter Search, Faster Editing, and an AI-Ready Foundation for Modern Media

Dalet, a leading technology and service provider for media-rich organizations, t...

01/05/2026

NBCUniversal's Peacock to Be First Streamer to Integrate Dolby's Full Suite of Premium Picture and Sound Innovations

January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...

01/04/2026

DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION

January 4 2026, 18:00 (PST) DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION Douyin Users Can Now Create And Share Videos With Stun...

20/03/2026

FCC Approves Nexstar's Acquisition of Tegna

Share Copy link Facebook X Linkedin Bluesky Email...

20/03/2026

Icesi names Ronald David Reyes as the 2025 recipient of t...

Colombia's Icesi University and WSDG are proud to announce Ronald David Reyes as the recipient of the 2025 WSDG Excellence Scholarship, awarded to an outsta...

20/03/2026

Celebrating the greatest creators - One Battle After Ano...

Avid today celebrated the filmmakers, editors and sound teams that worked with Avid Media Composer and Pro Tools to create the vast majority of this year'...

20/03/2026

MRMC Names CP Communications Its Official U.S. Rental, Sales Partner

Share Copy link Facebook X Linkedin Bluesky Email...

20/03/2026

FOR-A To Feature Software-Defined, AI-Driven Solutions At 2026 NAB Show

Share Copy link Facebook X Linkedin Bluesky Email...

20/03/2026

2026 NAB Show Exhibitor Insight: Riedel Communications

Share Copy link Facebook X Linkedin Bluesky Email...

20/03/2026

DirecTV Files Suit to Block Nexstar/Tegna Deal

Share Copy link Facebook X Linkedin Bluesky Email...

20/03/2026

Fujifilm Announces Four New Broadcast Zoom Lenses

Share Copy link Facebook X Linkedin Bluesky Email...

20/03/2026

NAB 2026: Software-Defined, AI-Powered Workflow Tells the...

Real-time 9:16 AI-Generated Autocropping; Software-Defined Station in a Box; and Software Switcher with Unlimited Layering Are Among Show Highlights For the fi...

20/03/2026

Signiant Showcases New Content Innovations Driving Visibility, Access, and Action at NAB 2026

Signiant Showcases New Content Innovations Driving Visibility, Access, and Actio...

20/03/2026

Caffeine Relies on DaVinci Resolve Studio for End to End Post Workflow

Caffeine Relies on DaVinci Resolve Studio for End to End Post Workflow Brie Clayton March 19, 2026 0 Comments Blackmagic Cloud helps Mexican post faci...

20/03/2026

It's time to play The Money List! Baz Ashmawy is back at the helm as the quiz returns for a fourth season

It's time to play The Money List! Baz Ashmawy is back at the helm as the qui...

19/03/2026

The Rise of Streaming, Particularly for Sports, Revives Loudness Issues

Live sports production increases complexity, with dynamic audio levels and an overall philosophy that encourages transient volume spikes Fourteen years ago, Am...

19/03/2026

Advanced Systems Group Names Peter Thordarson as Technical Account Executive

Advanced Systems Group, a technology and services provider for media creatives and content owners, announced the appointment of Peter Thordarson to the newly cr...

19/03/2026

SVG Students To Watch: Arya Taymuree, University of Washington

For this senior from the Bay Area, the speed and pressure of live sports production play right into her strengths In the live-sports-video industry, the future...

19/03/2026

Grass Valley Expands Partnership with University of Pittsburgh Athletics, Upgrading Production Infrastructure to SMPTE ST 2110 IP

Grass Valley has expanded its long-term partnership with University of Pittsburg...

19/03/2026

Audio-Technica Debuts ATV-SG1 and ATV-SG1LE On-Camera Shotgun Microphones

Audio-Technica has released the ATV-SG1 and ATV-SG1LE On-Camera Shotgun Microphones, designed for use with DSLR, mirrorless SLR, and other cameras. The ATV-SG1...

19/03/2026

NAB 2026: Harmonic Enhances XOS Advanced Media Processor to Streamline Next-Generation Broadcast Distribution

Harmonic (booth W2831) announces updates to its XOS Advanced Media Processor aim...

19/03/2026

DAZN and Top Rank Sign Multi-Year Rights Deal to Bring Marquee Events and Historic Archive to the Global Home of Boxing

DAZN and Top Rank have announced a multi-year partnership that will bring Top Ra...

19/03/2026

IHSE and Cyviz Announce Strategic Partnership

IHSE, a provider of KVM systems, has announced a partnership with Cyviz AS, a provider of technology solutions for collaboration and mission-critical operations...

19/03/2026

Net Insight appoints Larissa Grner-Meeus as Chief Product Officer (CPO)

Net Insight has appointed Larissa G rner-Meeus as Chief Product Officer. She joins the company's executive management team. G rner-Meeus holds a Dipl-Ing. ...

19/03/2026

Leader Appoints Rob Stanley as Regional Sales Manager UK & Northern Europe

Leader Electronics of Europe has appointed Rob Stanley as Regional Sales Manager for the UK and Northern Europe. In the role, he will manage key accounts and ha...

19/03/2026

FIFA and YouTube Team Up in FIFA World Cup 2026 Preferred Platform Agreement

FIFA has announced that YouTube will be a Preferred Platform for the FIFA World Cup 2026. Under the agreement, FIFA's Media Partners will be able to publis...

19/03/2026

Upgrade to NCAA March Madness Live App Expands Multi-Game Viewing, Enhances Second-Screen Experience

New features across mobile, connected devices, and automotive platforms undersco...

19/03/2026

PSSI Global Services Welcomes Ben Bradshaw as Director of Product and Network Development

PSSI Global Services has appointed Ben Bradshaw as Director of Product and Netwo...

19/03/2026

NAB 2026: Cobalt Digital to Unveil Additions to End-to-End IPMX and ST 2110 Ecosystem

Cobalt Digital has announced its NAB 2026 product lineup, which includes additio...

19/03/2026

Sportradar Releases Industry Outlook on the Future of U.S. Sports Viewing

Sportradar has released a new report, Innovation in Sports Media: The Next Era of Sports Viewing, examining how the sports viewing experience in the U.S. is evo...

19/03/2026

Matrox Video's ConvertIP Awarded in Rai Framework Agreement Supporting IP Modernization Strategy

Matrox Video has been awarded a three-year framework agreement to supply its Con...

19/03/2026

Controlled Chaos: Inside the Mighty Production Engine Behind the NCAA Men's Basketball Tournament's First Week

CBS Sports' Jason Cohen and TNT Sports' Chris Brown lead the charge on n...

19/03/2026

Loud and Fun Is the Goal for NCAA Tourney Audio

A1 Dave Grundtvig and his team deploy plenty of mics to capture the sounds and energy from the stands as well the court March Madness is a tournament in which ...

19/03/2026

Spotify Marks 5 Years of EQUAL With EQUAL: The Podcast and Global Events

In 2021, we launched EQUAL, a program designed to address an industry reality that persists: Women artists, songwriters, and producers too often face fewer oppo...

19/03/2026

Toontrack release Transistor Organ EKX

Latest EZKeys 2 expansion arrives Toontrack's staggering collection of EZKeys 2 expansions has grown once again, and the latest instalment delivers a on...

19/03/2026

Roland preview Melody Flip

New generative AI plug-in due in May 2026 Roland have announced the upcoming launch of a new generative AI tool created in collaboration with Sony Computer ...

19/03/2026

Native Instruments CEO Statement

Nick Williams updates users on insolvency process Nick Williams, the CEO of Native Instruments, has released the following official statement regarding thei...

19/03/2026

Milab to restart production

Iconic Swedish mic manufacturer back in action Legendary Swedish microphone manufacturer Milab have announced that production is now fully underway, and mic...

19/03/2026

FT1-EMU plug-in from Freqport

Acclaimed saturation unit goes virtual Freqport's Freqtube FT1 (reviewed here in SOS February 2023) offers a convenient way to integrate real valve-base...

19/03/2026

SGL Carbon: Restructuring ensures earnings forecast and creates basis for new growth

The discontinuation of loss-making business activities as part of the restructur...

19/03/2026

Silicon Valley satire The Audacity premieres 15 April on SBS and SBS On Demand

Silicon Valley satire The Audacity premieres 15 April on SBS and SBS On Demand 19 March, 2026 Media releases From one of the writer/producers of Succession...

19/03/2026

SBS brings communities together at Bondi Pavilion for Harmony Week multilingual broadcast

SBS brings communities together at Bondi Pavilion for Harmony Week multilingual ...

19/03/2026

Clarification from SBS regarding Western Sydney expansion

Clarification from SBS regarding Western Sydney expansion 19 March, 2026 Media releases From an SBS spokesperson: SBS wishes to clarify some media coverag...

19/03/2026

Leader appoints Rob Stanley as Regional Sales Manager UK...

Test & measurement innovator, Leader Electronics of Europe, is pleased to announce the appointment of Rob Stanley as Regional Sales Manager - UK & Northern Euro...

19/03/2026

Accedo One and Magine Pro Officially Launch Leyra Deliver...

The recently announced joint venture between Accedo One and Magine Pro has been officially launched as Leyra. The new company will combine the two complementary...

19/03/2026

Lightware matrices are the go-to choice for signal manage...

Budapest, Hungary, March 2026 - Demand for traditional matrix switching remains strong across live events, rental and staging markets. With a reputation for rel...

19/03/2026

DPA Elevates 4097 Micro Shotgun With CORE Technology

DPA Microphones adds to its CORE microphone selection with the 4097 CORE Micro Shotgun, which delivers a new level of clarity, headroom and sonic transparency...

19/03/2026

Starfish highlights flexible TS Splicer releases and new...

Starfish Technologies will present the latest releases of its TS Splicer (Win) and TS Splicer (K8) at NAB Show 2026, together with a new Monitoring Dashboard de...