
Editor's note: This post is part of the AI Decoded series, which demystifies AI by making the technology more accessible, and showcases new hardware, software, tools and accelerations for GeForce RTX PC and NVIDIA RTX workstation users.
From games and content creation apps to software development and productivity tools, AI is increasingly being integrated into applications to enhance user experiences and boost efficiency.
Those efficiency boosts extend to everyday tasks, like web browsing. Brave, a privacy-focused web browser, recently launched a smart AI assistant called Leo AI that, in addition to providing search results, helps users summarize articles and videos, surface insights from documents, answer questions and more.
Leo AI helps users summarize articles and videos, surface insights from documents, answer questions and more. The technology behind Brave and other AI-powered tools is a combination of hardware, libraries and ecosystem software that's optimized for the unique needs of AI.
Why Software Matters NVIDIA GPUs power the world's AI, whether running in the data center or on a local PC. They contain Tensor Cores, which are specifically designed to accelerate AI applications like Leo AI through massively parallel number crunching - rapidly processing the huge number of calculations needed for AI simultaneously, rather than doing them one at a time.
But great hardware only matters if applications can make efficient use of it. The software running on top of GPUs is just as critical for delivering the fastest, most responsive AI experience.
The first layer is the AI inference library, which acts like a translator that takes requests for common AI tasks and converts them to specific instructions for the hardware to run. Popular inference libraries include NVIDIA TensorRT, Microsoft's DirectML and the one used by Brave and Leo AI via Ollama, called llama.cpp.
Llama.cpp is an open-source library and framework. Through CUDA - the NVIDIA software application programming interface that enables developers to optimize for GeForce RTX and NVIDIA RTX GPUs - provides Tensor Core acceleration for hundreds of models, including popular large language models (LLMs) like Gemma, Llama 3, Mistral and Phi.
On top of the inference library, applications often use a local inference server to simplify integration. The inference server handles tasks like downloading and configuring specific AI models so that the application doesn't have to.
Ollama is an open-source project that sits on top of llama.cpp and provides access to the library's features. It supports an ecosystem of applications that deliver local AI capabilities. Across the entire technology stack, NVIDIA works to optimize tools like Ollama for NVIDIA hardware to deliver faster, more responsive AI experiences on RTX.
Applications like Brave's Leo AI can access RTX-powered AI acceleration to enhance user experiences. NVIDIA's focus on optimization spans the entire technology stack - from hardware to system software to the inference libraries and tools that enable applications to deliver faster, more responsive AI experiences on RTX.
Local vs. Cloud Brave's Leo AI can run in the cloud or locally on a PC through Ollama.
There are many benefits to processing inference using a local model. By not sending prompts to an outside server for processing, the experience is private and always available. For instance, Brave users can get help with their finances or medical questions without sending anything to the cloud. Running locally also eliminates the need to pay for unrestricted cloud access. With Ollama, users can take advantage of a wider variety of open-source models than most hosted services, which often support only one or two varieties of the same AI model.
Users can also interact with models that have different specializations, such as bilingual models, compact-sized models, code generation models and more.
RTX enables a fast, responsive experience when running AI locally. Using the Llama 3 8B model with llama.cpp, users can expect responses up to 149 tokens per second - or approximately 110 words per second. When using Brave with Leo AI and Ollama, this means snappier responses to questions, requests for content summaries and more.
NVIDIA internal throughput performance measurements on NVIDIA GeForce RTX GPUs, featuring a Llama 3 8B model with an input sequence length of 100 tokens, generating 100 tokens. Get Started With Brave With Leo AI and Ollama Installing Ollama is easy - download the installer from the project's website and let it run in the background. From a command prompt, users can download and install a wide variety of supported models, then interact with the local model from the command line.
For simple instructions on how to add local LLM support via Ollama, read the company's blog. Once configured to point to Ollama, Leo AI will use the locally hosted LLM for prompts and queries. Users can also switch between cloud and local models at any time.
Brave with Leo AI running on Ollama and accelerated by RTX is a great way to get more out of your browsing experience. You can even summarize and ask questions about AI Decoded blogs! Developers can learn more about how to use Ollama and llama.cpp in the NVIDIA Technical Blog.
Generative AI is transforming gaming, videoconferencing and interactive experiences of all kinds. Make sense of what's new and what's next by subscribing to the AI Decoded newsletter.
North America Stories
29/10/2025
MLS, EDGE Sound Research To Debut Immersive Embodied Sound' at LAFC vs. Aus...
29/10/2025
SVG Remote Production Forum 2025: All Sessions Now Available to Watch on SVG PLA...
29/10/2025
World Series 2025: How Audio Is Transported Around the Sites and BeyondThe signals also move not just between two countries but around the globeBy Dan Daley, Au...
29/10/2025
A still from 306 Hollywood, a film by sibling filmmakers Jonathan Bogar n and El...
29/10/2025
WUPPERTAL, Germany Riedel Communications has hired Ulrich Voigt as director, live production solutions, taking over the leadership of its SimplyLive business fr...
29/10/2025
OKLAHOMA CITY and TULSA, Okla. Sinclair has named Mark Martin as vice president and general manager of KOKH-KOCB Oklahoma City and KTUL Tulsa....
29/10/2025
BELLEVUE, Wash. Julie Van Ullen has joined cross-platform TV ad measurement company iSpot as president and chief revenue officer....
29/10/2025
Brutal g et, a Swiss broadcast services provider, has rolled out a state-of-the-art outside broadcast (OB) vehicle built on a Lawo AoIP (audio-over-internet pro...
29/10/2025
WASHINGTON FCC Commissioner Olivia Trusty has announced a temporary staff change in her office....
29/10/2025
Berklee Valencia Talent Helps Score Alejandro Amen bar's El cautivo Faculty and alumni from Berklee Valencia's scoring for film, television, and video...
29/10/2025
The Walt Disney Company today announced they have closed their transaction to combine Fubo's business with Disney's Hulu + Live TV business....
29/10/2025
WUPPERTAL, Germany Riedel Communications has hired Ulrich Voigt as director, live production solutions, taking over the leadership of its SimplyLive business fr...
29/10/2025
LOS ANGELES Major League Soccer will introduce a broadcast audio enhancement tonight during the LAFC vs. Austin FC playoff match....
29/10/2025
ESPN said it will produce animated telecasts for NFL, NHL, NBA and WNBA games across The Walt Disney Co. and ESPN platforms during the 2025-26 season under an a...
29/10/2025
WASHINGTON Despite the government shutdown, the Federal Communications Commission has passed, with some revisions, a previously announced Notice of Proposed Rul...
29/10/2025
Back to All News
Start of Filming for Daniel S nchez Ar valos New Netflix Movie
Entertainment
29 October 2025
GlobalSpain
Link copied to clipboard
The fil...
29/10/2025
Editor's note: This post is part of Into the Omniverse, a series focused on ...
28/10/2025
ESPN Announces Monsters Funday Football', Its Latest Real-Time Animated Bro...
28/10/2025
SVG All-Stars: Catherine Chalfant, Manager, Remote Operations, ESPNThe Ole Miss alum is an operational force behind ESPN's extensive college-football catalo...
28/10/2025
Elevating the experience: AI and data take Ryder Cup to the next level By Joe OHalloran
Tuesday, October 28, 2025 - 10:25
Print This Story
NBC produced th...
28/10/2025
Conquering the Air (waves): Taking a close up look at the IBC Accelerator Priva...
28/10/2025
World Series 2025: Spectrum SportsNet LA Brings Dodgers Fans Closer to the Actio...
28/10/2025
Dylan Southern and Benedict Cumberbatch at the premiere of The Thing with Feathers (photo by George Pimentel / Shutterstock for Sundance Film Festival)...
28/10/2025
Disney, NBCUniversal, FOX, Paramount Each Achieve Double-Digit Monthly Growth
...
28/10/2025
CINCINNATI The E.W. Scripps Company has announced an agreement to sell WRTV, its local ABC-affiliated station in Indianapolis, to Circle City Broadcasting for $...
28/10/2025
Berklee College of Music and Berklee Valencia Named to Billboards 2025 Top Music...
28/10/2025
NEW YORK As AI usage continues to spike, a new study from IAB delves into an important aspect of how AI is transforming the advertising business with new data s...
28/10/2025
Charlie Jablonski, a broadcast tech pioneer who helped shape the modern era of Olympics television coverage, died Oct. 25 at his home in Lake George N.Y., the N...
28/10/2025
VIENNA, Austria Bitmovin has launched Bitmovin Observability, a new stand-alone video data solution that delivers real-time insights into video playback. The so...
28/10/2025
LOS ANGELES LucidLink, the file streaming platform, has announced a Frame.io integration and expanded mobile capabilities at Adobe Max....
28/10/2025
Mediagenix, a global leader in smart content solutions to profitably connect the right content to the right audience, today announced that it has joined the Ama...
28/10/2025
Lightware, an industry leader in connectivity and signal management solutions, has announced a major update to its Taurus
platform, which now delivers flawless...
28/10/2025
Following a successful mid-September International Broadcasting Convention in Amsterdam, Hiltron Communications will promote its full range of satellite communi...
28/10/2025
Open Broadcast Systems has chosen MC&S (Media Consulting & Services) as a reseller to help strengthen its presence in France.
With over twenty years of experi...
28/10/2025
Bitmovin, leading provider of video streaming solutions, has launched Bitmovin Observability, a new stand-alone video data solution that delivers real-time insi...
28/10/2025
Ease Live, the leader in interactive TV technology, today announced the successful launch of interactive graphical overlays for UEFA Champions League matches fo...
28/10/2025
LucidLink, the file streaming platform, today at Adobe MAX announced a Frame.io integration and expanded mobile capabilities, streamlining collaboration and hel...
28/10/2025
ATLANTA Gray Media has promoted Nick Hasenecz to general manager of WNDU, its NBC affiliate in the South Bend-Elkhart, Ind., market....
28/10/2025
Applications Open for Berklee Fenway Neighborhood Improvement Grant Boston nonprofits can apply by December 12 for funding to support community projects that ...
28/10/2025
From live broadcast innovation to post-production intelligence, NAB New York 2025 showcased how rapidly media workflows are evolving toward AI-driven, hybrid, a...
28/10/2025
Governments everywhere are racing to harness the power of AI - but legacy infras...
28/10/2025
AI is moving from the digital world into the physical one. Across factory floors...
28/10/2025
NVIDIA is delivering the telecom industry a major boost in open-source software for building AI-native 5G and 6G networks.
NVIDIA Aerial software will soon be ...
28/10/2025
The race to bottle a star now runs on AI.
NVIDIA, General Atomics and a team of international partners have built a high-fidelity, AI-enabled digital twin for ...
28/10/2025
Along the Pacific Ocean in Monterey, California, the Naval Postgraduate School (...
28/10/2025
To democratize access to AI technology nationwide, AI education and deployment c...
28/10/2025
Leading technology companies in aerospace and automotive are accelerating their ...
28/10/2025
Scripps Research awarded $4 million to advance platform for neurodevelopmental disorders The California Institute for Regenerative Medicine (CIRM) grant support...
27/10/2025
You can touch this: Haptics becoming central to the virtual live experience By Adrian Pennington
Friday, October 24, 2025 - 09:12
Print This Story
The vid...