Editor's note: This post is part of the AI Decoded series, which demystifies AI by making the technology more accessible, and which showcases new hardware, software, tools and accelerations for RTX PC users.NVIDIA's RTX AI platform includes tools and software development kits that help Windows developers create cutting-edge generative AI features to deliver the best performance on AI PCs and workstations.
At GTC - NVIDIA's annual technology conference - a dream team of industry luminaries, developers and researchers have come together to learn from one another, fueling what's next in AI and accelerated computing.
This special edition of AI Decoded from GTC spotlights the best AI tools currently available and looks at what's ahead for the 100 million RTX PC and workstation users and developers.
Chat with RTX, the tech demo and developer reference project that quickly and easily allows users to connect a powerful LLM to their own data, showcased new capabilities and new models in the GTC exhibit hall.
The winners of the Gen AI on RTX PCs contest were announced Monday. OutlookLLM, Rocket League BotChat and CLARA were highlighted in one of the AI Decoded talks in the generative AI theater and each are accelerated by NVIDIA TensorRT-LLM. Two other AI Decoded talks included using generative AI in content creation and a deep dive on Chat with RTX.
Developer frameworks and interfaces with TensorRT-LLM integration continue to grow as Jan.ai, Langchain, LlamaIndex and Oobabooga will all soon be accelerated - helping to grow the already more than 500 AI applications for RTX PCs and workstations.
NVIDIA NIM microservices are coming to RTX PCs and workstations. They provide pre-built containers, with industry standard APIs, enabling developers to accelerate deployment on RTX PCs and workstations. NVIDIA AI Workbench, an easy-to-use developer toolkit to manage AI model customization and optimization workflows, is now generally available for RTX developers.
These ecosystem integrations and tools will accelerate development of new Windows apps and features. And today's contest winners are an inspiring glimpse into what that content will look like.
Hear More, See More, Chat More Chat with RTX, or ChatRTX for short, uses retrieval-augmented generation, NVIDIA TensorRT-LLM software and NVIDIA RTX acceleration to bring local generative AI capabilities to RTX-powered Windows systems. Users can quickly and easily connect local files as a dataset to an open large language model like Mistral or Llama 2, enabling queries for quick, contextually relevant answers.
Moving beyond text, ChatRTX will soon add support for voice, images and new models.
Users will be able to talk to ChatRTX with Whisper - an automatic speech recognition system that uses AI to process spoken language. When the feature becomes available, ChatRTX will be able to understand spoken language, and provide text responses.
A future update will also add support for photos. By integrating OpenAI's CLIP - Contrastive Language-Image Pre-training - users will be able to search by words, terms or phrases to find photos in their private library.
In addition to Google's Gemma, ChatGLM will get support in a future update.
Developers can start with the latest version of the developer reference project on GitHub.
Generative AI for the Win The NVIDIA Generative AI on NVIDIA RTX developer contest prompted developers to build a Windows app or plug-in.
I found that playing against bots that react to game events with in-game messages in near real time adds a new level of entertainment to the game, and I'm excited to share my approach to incorporating AI into gaming as a participant in this developer contest. The target audience for my project is anyone who plays Rocket League with RTX hardware. - Brian Caffey, Rocket League BotChat developer
Submissions were judged on three criteria, including a short demo video posted to social media, relative impact and ease of use of the project, and how effectively NVIDIA's technology stack was used in the project. Each of the three winners received a pass to GTC, including a spot in the NVIDIA Deep Learning Institute GenAI/LLM courses, and a GeForce RTX 4090 GPU to power future development work.
OutlookLLM gives Outlook users generative AI features - such as email composition - securely and privately in their email client on RTX PCs and workstations. It uses a local LLM served via TensorRT-LLM.
Rocket League BotChat, for the popular Rocket League game, is a plug-in that allows bots to send contextual in-game chat messages based on a log of game events, such as scoring a goal or making a save. Designed to be used only in offline games against bot players, the plug-in is configurable in many ways via its settings menu.
CLARA (short for Command Line Assistant with RTX Acceleration) is designed to enhance the command line interface of PowerShell by translating plain English instructions into actionable commands. The extension runs locally, quickly and keeps users in their PowerShell context. Once it's enabled, users type their English instructions and press the tab button to invoke CLARA. Installation is straightforward, and there are options for both script-based and manual setup.
Congratulations to the #GenAIonRTX #DevContest winners:
CLARA Talk with computers in human language Matthew Yaeger
Rocket League BotChat Generate in-game chat messages Brian Caffey
Outlook LLM Compose emails securely with AI Francisco Gonzalez Blanch
pic.twitter.com/i52w5Pn1n9
NVIDIA AI Developer (@NVIDIAAIDev) March 19, 2024
From the Generative AI Theater GTC attendees can attend three AI Decoded talks on Wednesday, March 20 at the generative AI theater. These 15-minute sessions will guide the audience through ChatRTX and how developers can product










