Sony Pixel Power calrec Sony

Brave New World: Leo AI and Ollama Bring RTX-Accelerated Local LLMs to Brave Browser Users

02/10/2024

Editor's note: This post is part of the AI Decoded series, which demystifies AI by making the technology more accessible, and showcases new hardware, software, tools and accelerations for GeForce RTX PC and NVIDIA RTX workstation users.

From games and content creation apps to software development and productivity tools, AI is increasingly being integrated into applications to enhance user experiences and boost efficiency.

Those efficiency boosts extend to everyday tasks, like web browsing. Brave, a privacy-focused web browser, recently launched a smart AI assistant called Leo AI that, in addition to providing search results, helps users summarize articles and videos, surface insights from documents, answer questions and more.

Leo AI helps users summarize articles and videos, surface insights from documents, answer questions and more. The technology behind Brave and other AI-powered tools is a combination of hardware, libraries and ecosystem software that's optimized for the unique needs of AI.

Why Software Matters NVIDIA GPUs power the world's AI, whether running in the data center or on a local PC. They contain Tensor Cores, which are specifically designed to accelerate AI applications like Leo AI through massively parallel number crunching - rapidly processing the huge number of calculations needed for AI simultaneously, rather than doing them one at a time.

But great hardware only matters if applications can make efficient use of it. The software running on top of GPUs is just as critical for delivering the fastest, most responsive AI experience.

The first layer is the AI inference library, which acts like a translator that takes requests for common AI tasks and converts them to specific instructions for the hardware to run. Popular inference libraries include NVIDIA TensorRT, Microsoft's DirectML and the one used by Brave and Leo AI via Ollama, called llama.cpp.

Llama.cpp is an open-source library and framework. Through CUDA - the NVIDIA software application programming interface that enables developers to optimize for GeForce RTX and NVIDIA RTX GPUs - provides Tensor Core acceleration for hundreds of models, including popular large language models (LLMs) like Gemma, Llama 3, Mistral and Phi.

On top of the inference library, applications often use a local inference server to simplify integration. The inference server handles tasks like downloading and configuring specific AI models so that the application doesn't have to.

Ollama is an open-source project that sits on top of llama.cpp and provides access to the library's features. It supports an ecosystem of applications that deliver local AI capabilities. Across the entire technology stack, NVIDIA works to optimize tools like Ollama for NVIDIA hardware to deliver faster, more responsive AI experiences on RTX.

Applications like Brave's Leo AI can access RTX-powered AI acceleration to enhance user experiences. NVIDIA's focus on optimization spans the entire technology stack - from hardware to system software to the inference libraries and tools that enable applications to deliver faster, more responsive AI experiences on RTX.

Local vs. Cloud Brave's Leo AI can run in the cloud or locally on a PC through Ollama.

There are many benefits to processing inference using a local model. By not sending prompts to an outside server for processing, the experience is private and always available. For instance, Brave users can get help with their finances or medical questions without sending anything to the cloud. Running locally also eliminates the need to pay for unrestricted cloud access. With Ollama, users can take advantage of a wider variety of open-source models than most hosted services, which often support only one or two varieties of the same AI model.

Users can also interact with models that have different specializations, such as bilingual models, compact-sized models, code generation models and more.

RTX enables a fast, responsive experience when running AI locally. Using the Llama 3 8B model with llama.cpp, users can expect responses up to 149 tokens per second - or approximately 110 words per second. When using Brave with Leo AI and Ollama, this means snappier responses to questions, requests for content summaries and more.

NVIDIA internal throughput performance measurements on NVIDIA GeForce RTX GPUs, featuring a Llama 3 8B model with an input sequence length of 100 tokens, generating 100 tokens. Get Started With Brave With Leo AI and Ollama Installing Ollama is easy - download the installer from the project's website and let it run in the background. From a command prompt, users can download and install a wide variety of supported models, then interact with the local model from the command line.

For simple instructions on how to add local LLM support via Ollama, read the company's blog. Once configured to point to Ollama, Leo AI will use the locally hosted LLM for prompts and queries. Users can also switch between cloud and local models at any time.

Brave with Leo AI running on Ollama and accelerated by RTX is a great way to get more out of your browsing experience. You can even summarize and ask questions about AI Decoded blogs! Developers can learn more about how to use Ollama and llama.cpp in the NVIDIA Technical Blog.

Generative AI is transforming gaming, videoconferencing and interactive experiences of all kinds. Make sense of what's new and what's next by subscribing to the AI Decoded newsletter.
LINK: https://blogs.nvidia.com/blog/rtx-ai-brave-browser/...
See more stories from nvidia

North America Stories

29/10/2025

MLS, EDGE Sound Research To Debut Immersive Embodied Sound' at LAFC vs. Austin FC Playoff Match

MLS, EDGE Sound Research To Debut Immersive Embodied Sound' at LAFC vs. Aus...

29/10/2025

SVG Remote Production Forum 2025: All Sessions Now Available to Watch on SVG PLAY

SVG Remote Production Forum 2025: All Sessions Now Available to Watch on SVG PLA...

29/10/2025

World Series 2025: How Audio Is Transported Around the Sites and Beyond

World Series 2025: How Audio Is Transported Around the Sites and BeyondThe signals also move not just between two countries but around the globeBy Dan Daley, Au...

29/10/2025

Inside the Archives: Celebrating Archives Month Through Sundance Film Festival Films

A still from 306 Hollywood, a film by sibling filmmakers Jonathan Bogar n and El...

29/10/2025

Riedel Names Ulrich Voigt Director of Live Production Solutions

WUPPERTAL, Germany Riedel Communications has hired Ulrich Voigt as director, live production solutions, taking over the leadership of its SimplyLive business fr...

29/10/2025

Sinclair Taps Mark Martin to Lead Stations in Oklahoma

OKLAHOMA CITY and TULSA, Okla. Sinclair has named Mark Martin as vice president and general manager of KOKH-KOCB Oklahoma City and KTUL Tulsa....

29/10/2025

iSpot Taps Julie Van Ullen as President and Chief Revenue Officer

BELLEVUE, Wash. Julie Van Ullen has joined cross-platform TV ad measurement company iSpot as president and chief revenue officer....

29/10/2025

Lawo Delivers Audio, IP Infrastructure for New Swiss OB Vehicle

Brutal g et, a Swiss broadcast services provider, has rolled out a state-of-the-art outside broadcast (OB) vehicle built on a Lawo AoIP (audio-over-internet pro...

29/10/2025

FCC Commissioner Olivia Trusty Announces Temporary Staff Changes

WASHINGTON FCC Commissioner Olivia Trusty has announced a temporary staff change in her office....

29/10/2025

Berklee Valencia Talent Helps Score Alejandro Amenbar's El cautivo

Berklee Valencia Talent Helps Score Alejandro Amen bar's El cautivo Faculty and alumni from Berklee Valencia's scoring for film, television, and video...

29/10/2025

Disney Takes Ownership of Fubo

The Walt Disney Company today announced they have closed their transaction to combine Fubo's business with Disney's Hulu + Live TV business....

29/10/2025

Ulrich Voigt

WUPPERTAL, Germany Riedel Communications has hired Ulrich Voigt as director, live production solutions, taking over the leadership of its SimplyLive business fr...

29/10/2025

MLS To Unveil Immersive Embodied Sound' During Playoff Match

LOS ANGELES Major League Soccer will introduce a broadcast audio enhancement tonight during the LAFC vs. Austin FC playoff match....

29/10/2025

ESPN, Sony Ink Deal to Expand Animated Altcasts for 2025-26

ESPN said it will produce animated telecasts for NFL, NHL, NBA and WNBA games across The Walt Disney Co. and ESPN platforms during the 2025-26 season under an a...

29/10/2025

FCC Approves Notice of Proposed Rulemaking on NextGen TV

WASHINGTON Despite the government shutdown, the Federal Communications Commission has passed, with some revisions, a previously announced Notice of Proposed Rul...

29/10/2025

Start of Filming for Daniel Snchez Arvalo's New Netflix Movie

Back to All News Start of Filming for Daniel S nchez Ar valos New Netflix Movie Entertainment 29 October 2025 GlobalSpain Link copied to clipboard The fil...

29/10/2025

Into the Omniverse: Open World Foundation Models Generate Synthetic Worlds for Physical AI Development

Editor's note: This post is part of Into the Omniverse, a series focused on ...

28/10/2025

SVG All-Stars: Catherine Chalfant, Manager, Remote Operations, ESPN

SVG All-Stars: Catherine Chalfant, Manager, Remote Operations, ESPNThe Ole Miss alum is an operational force behind ESPN's extensive college-football catalo...

28/10/2025

Elevating the Experience: AI and Data Take Ryder Cup to the Next Level

Elevating the experience: AI and data take Ryder Cup to the next level By Joe OHalloran Tuesday, October 28, 2025 - 10:25 Print This Story NBC produced th...

28/10/2025

Conquering the Air (Waves): Taking a Close Up Look at the IBC Accelerator Private 5G from Land to Sea to Sky'

Conquering the Air (waves): Taking a close up look at the IBC Accelerator Priva...

28/10/2025

World Series 2025: Spectrum SportsNet LA Brings Dodgers Fans Closer to the Action With Pre/Postgame Coverage

World Series 2025: Spectrum SportsNet LA Brings Dodgers Fans Closer to the Actio...

28/10/2025

The Thing with Feathers Brings the Horror of Grief to the Screen

Dylan Southern and Benedict Cumberbatch at the premiere of The Thing with Feathers (photo by George Pimentel / Shutterstock for Sundance Film Festival)...

28/10/2025

Football Scores Extra Points for Multi-Platform Companies in Nielsen's September Media Distributor Gauge

Disney, NBCUniversal, FOX, Paramount Each Achieve Double-Digit Monthly Growth ...

28/10/2025

Scripps to Sell WRTV to Circle City Broadcasting for $83 million

CINCINNATI The E.W. Scripps Company has announced an agreement to sell WRTV, its local ABC-affiliated station in Indianapolis, to Circle City Broadcasting for $...

28/10/2025

Berklee College of Music and Berklee Valencia Named to Billboards 2025 Top Music Business Schools List

Berklee College of Music and Berklee Valencia Named to Billboards 2025 Top Music...

28/10/2025

Survey: Consumers Rank AI as a Major Influence on Their Shopping Decisions

NEW YORK As AI usage continues to spike, a new study from IAB delves into an important aspect of how AI is transforming the advertising business with new data s...

28/10/2025

Broadcast Tech Pioneer Charlie Jablonski Has Died

Charlie Jablonski, a broadcast tech pioneer who helped shape the modern era of Olympics television coverage, died Oct. 25 at his home in Lake George N.Y., the N...

28/10/2025

Bitmovin Unveils Real-Time Observability Solution for Video Streaming

VIENNA, Austria Bitmovin has launched Bitmovin Observability, a new stand-alone video data solution that delivers real-time insights into video playback. The so...

28/10/2025

LucidLink Now Integrated With Adobe Frame.io

LOS ANGELES LucidLink, the file streaming platform, has announced a Frame.io integration and expanded mobile capabilities at Adobe Max....

28/10/2025

Mediagenix Joins AWS ISV Accelerate Program

Mediagenix, a global leader in smart content solutions to profitably connect the right content to the right audience, today announced that it has joined the Ama...

28/10/2025

Lightware Taurus product family introduces 5K support

Lightware, an industry leader in connectivity and signal management solutions, has announced a major update to its Taurus platform, which now delivers flawless...

28/10/2025

Hiltron to Promote its Broad Range of Satcom Products and...

Following a successful mid-September International Broadcasting Convention in Amsterdam, Hiltron Communications will promote its full range of satellite communi...

28/10/2025

Open Broadcast Systems Selects Media Consulting and Servi...

Open Broadcast Systems has chosen MC&S (Media Consulting & Services) as a reseller to help strengthen its presence in France. With over twenty years of experi...

28/10/2025

Bitmovin Unveils Real-Time Observability Solution for Vid...

Bitmovin, leading provider of video streaming solutions, has launched Bitmovin Observability, a new stand-alone video data solution that delivers real-time insi...

28/10/2025

Ease Live Powers Interactive Champions League Viewer Expe...

Ease Live, the leader in interactive TV technology, today announced the successful launch of interactive graphical overlays for UEFA Champions League matches fo...

28/10/2025

LucidLink unveils Frame io integration and expanded mobil...

LucidLink, the file streaming platform, today at Adobe MAX announced a Frame.io integration and expanded mobile capabilities, streamlining collaboration and hel...

28/10/2025

Nick Hascenez Named GM of WNDU South Bend

ATLANTA Gray Media has promoted Nick Hasenecz to general manager of WNDU, its NBC affiliate in the South Bend-Elkhart, Ind., market....

28/10/2025

Applications Open for Berklee Fenway Neighborhood Improvement Grant

Applications Open for Berklee Fenway Neighborhood Improvement Grant Boston nonprofits can apply by December 12 for funding to support community projects that ...

28/10/2025

NAB New York 2025 Recap: AI, Cloud, and Hybrid Take Center Stage

From live broadcast innovation to post-production intelligence, NAB New York 2025 showcased how rapidly media workflows are evolving toward AI-driven, hybrid, a...

28/10/2025

NVIDIA and US Technology Leaders Unveil AI Factory Design to Modernize Government and Secure the Nation

Governments everywhere are racing to harness the power of AI - but legacy infras...

28/10/2025

NVIDIA IGX Thor Robotics Processor Brings Real-Time Physical AI to the Industrial and Medical Edge

AI is moving from the digital world into the physical one. Across factory floors...

28/10/2025

NVIDIA Open Sources Aerial Software to Accelerate AI-Native 6G

NVIDIA is delivering the telecom industry a major boost in open-source software for building AI-native 5G and 6G networks. NVIDIA Aerial software will soon be ...

28/10/2025

NVIDIA and General Atomics Advance Commercial Fusion Energy

The race to bottle a star now runs on AI. NVIDIA, General Atomics and a team of international partners have built a high-fidelity, AI-enabled digital twin for ...

28/10/2025

NVIDIA, NPS Commission the Navy's AI Flagship for Training Tomorrow's Leaders

Along the Pacific Ocean in Monterey, California, the Naval Postgraduate School (...

28/10/2025

Fueling Economic Development Across the US: How NVIDIA Is Empowering States, Municipalities and Universities to Drive Innovation

To democratize access to AI technology nationwide, AI education and deployment c...

28/10/2025

NVIDIA AI Physics Transforms Aerospace and Automotive Design, Accelerating Engineering by 500x

Leading technology companies in aerospace and automotive are accelerating their ...

28/10/2025

October 27, 2025

Scripps Research awarded $4 million to advance platform for neurodevelopmental disorders The California Institute for Regenerative Medicine (CIRM) grant support...

27/10/2025

You Can Touch This: Haptics Becoming Central to the Virtual Live Experience

You can touch this: Haptics becoming central to the virtual live experience By Adrian Pennington Friday, October 24, 2025 - 09:12 Print This Story The vid...