Sony Pixel Power calrec Sony

Brave New World: Leo AI and Ollama Bring RTX-Accelerated Local LLMs to Brave Browser Users

02/10/2024

Editor's note: This post is part of the AI Decoded series, which demystifies AI by making the technology more accessible, and showcases new hardware, software, tools and accelerations for GeForce RTX PC and NVIDIA RTX workstation users.

From games and content creation apps to software development and productivity tools, AI is increasingly being integrated into applications to enhance user experiences and boost efficiency.

Those efficiency boosts extend to everyday tasks, like web browsing. Brave, a privacy-focused web browser, recently launched a smart AI assistant called Leo AI that, in addition to providing search results, helps users summarize articles and videos, surface insights from documents, answer questions and more.

Leo AI helps users summarize articles and videos, surface insights from documents, answer questions and more. The technology behind Brave and other AI-powered tools is a combination of hardware, libraries and ecosystem software that's optimized for the unique needs of AI.

Why Software Matters NVIDIA GPUs power the world's AI, whether running in the data center or on a local PC. They contain Tensor Cores, which are specifically designed to accelerate AI applications like Leo AI through massively parallel number crunching - rapidly processing the huge number of calculations needed for AI simultaneously, rather than doing them one at a time.

But great hardware only matters if applications can make efficient use of it. The software running on top of GPUs is just as critical for delivering the fastest, most responsive AI experience.

The first layer is the AI inference library, which acts like a translator that takes requests for common AI tasks and converts them to specific instructions for the hardware to run. Popular inference libraries include NVIDIA TensorRT, Microsoft's DirectML and the one used by Brave and Leo AI via Ollama, called llama.cpp.

Llama.cpp is an open-source library and framework. Through CUDA - the NVIDIA software application programming interface that enables developers to optimize for GeForce RTX and NVIDIA RTX GPUs - provides Tensor Core acceleration for hundreds of models, including popular large language models (LLMs) like Gemma, Llama 3, Mistral and Phi.

On top of the inference library, applications often use a local inference server to simplify integration. The inference server handles tasks like downloading and configuring specific AI models so that the application doesn't have to.

Ollama is an open-source project that sits on top of llama.cpp and provides access to the library's features. It supports an ecosystem of applications that deliver local AI capabilities. Across the entire technology stack, NVIDIA works to optimize tools like Ollama for NVIDIA hardware to deliver faster, more responsive AI experiences on RTX.

Applications like Brave's Leo AI can access RTX-powered AI acceleration to enhance user experiences. NVIDIA's focus on optimization spans the entire technology stack - from hardware to system software to the inference libraries and tools that enable applications to deliver faster, more responsive AI experiences on RTX.

Local vs. Cloud Brave's Leo AI can run in the cloud or locally on a PC through Ollama.

There are many benefits to processing inference using a local model. By not sending prompts to an outside server for processing, the experience is private and always available. For instance, Brave users can get help with their finances or medical questions without sending anything to the cloud. Running locally also eliminates the need to pay for unrestricted cloud access. With Ollama, users can take advantage of a wider variety of open-source models than most hosted services, which often support only one or two varieties of the same AI model.

Users can also interact with models that have different specializations, such as bilingual models, compact-sized models, code generation models and more.

RTX enables a fast, responsive experience when running AI locally. Using the Llama 3 8B model with llama.cpp, users can expect responses up to 149 tokens per second - or approximately 110 words per second. When using Brave with Leo AI and Ollama, this means snappier responses to questions, requests for content summaries and more.

NVIDIA internal throughput performance measurements on NVIDIA GeForce RTX GPUs, featuring a Llama 3 8B model with an input sequence length of 100 tokens, generating 100 tokens. Get Started With Brave With Leo AI and Ollama Installing Ollama is easy - download the installer from the project's website and let it run in the background. From a command prompt, users can download and install a wide variety of supported models, then interact with the local model from the command line.

For simple instructions on how to add local LLM support via Ollama, read the company's blog. Once configured to point to Ollama, Leo AI will use the locally hosted LLM for prompts and queries. Users can also switch between cloud and local models at any time.

Brave with Leo AI running on Ollama and accelerated by RTX is a great way to get more out of your browsing experience. You can even summarize and ask questions about AI Decoded blogs! Developers can learn more about how to use Ollama and llama.cpp in the NVIDIA Technical Blog.

Generative AI is transforming gaming, videoconferencing and interactive experiences of all kinds. Make sense of what's new and what's next by subscribing to the AI Decoded newsletter.
LINK: https://blogs.nvidia.com/blog/rtx-ai-brave-browser/...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

04/08/2026

Dalet Announces Commercial Availability of Dalia, Bringing Media-Aware Agentic AI to Enterprise Productions

Dalet, a leading technology and service provider for media-rich organizations, t...

04/07/2026

Detective Conan: Fallen Angel of the Highway Opens in Dolby Cinemas Across Japan, Presented in Dolby Atmos and Dolby ...

April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...

01/06/2026

Dolby Sets the New Standard for Premium Entertainment at CES 2026

January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026 Throughout the week, Dolby brings to life the latest innovatio...

02/05/2026

Dalet Flex LTS Delivers Smarter Search, Faster Editing, and an AI-Ready Foundation for Modern Media

Dalet, a leading technology and service provider for media-rich organizations, t...

01/05/2026

NBCUniversal's Peacock to Be First Streamer to Integrate Dolby's Full Suite of Premium Picture and Sound Innovations

January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...

20/04/2026

Google Cloud Embraces the Rise of Agentic Production

Share Copy link Facebook X Linkedin Bluesky Email...

20/04/2026

Creators Go All in on AI, Niche Content

Share Copy link Facebook X Linkedin Bluesky Email...

20/04/2026

NBC Sports' Jon Miller: Broadcast Is Having a Moment'

Share Copy link Facebook X Linkedin Bluesky Email...

20/04/2026

Beyond the Lift and Shift': Cloud Migration's New Mandate

Share Copy link Facebook X Linkedin Bluesky Email...

20/04/2026

Virtual Production Finds Its Footing

Share Copy link Facebook X Linkedin Bluesky Email...

20/04/2026

Corporate Creators: All Companies Are Media Companies Now

Share Copy link Facebook X Linkedin Bluesky Email...

20/04/2026

IABM Rebrands as the International Association of MediaTech

Share Copy link Facebook X Linkedin Bluesky Email...

20/04/2026

CBS Detroit Debuts New AR/VR Technology-Driven Studio

Share Copy link Facebook X Linkedin Bluesky Email...

20/04/2026

Fox Sports Taps Appear X Platform for Remote Production

Share Copy link Facebook X Linkedin Bluesky Email...

20/04/2026

CueScript and Lighting Design Group Expand Customer Oppor...

CueScript and Lighting Design Group Expand Customer Opportunities Through New Partnership Find both companies at 2026 NAB Show in CueScript Booth # C 4720 ...

20/04/2026

Layercake Deepens Bitmovin Integration to Power End-to-En...

[Sydney, NSW, 20 April 2026] - Layercake, the company behind the intelligent media orchestration platform Streamcake, today announced the formalisation of its i...

20/04/2026

FOX Sports selects Appear X Platform for next-generation...

Deployment spans FOX Sports' REMI infrastructure, IP production for a major global soccer event, and its Jewel Events production systems Appear, a global l...

20/04/2026

Pro Sound Effects Launches the Industry's First and Only Native Sound Effects Integration for Avid Media Composer at NAB 2026

Pro Sound Effects Launches the Industry's First and Only Native Sound Effect...

20/04/2026

SBE Elevates Fred Willard to SBE Fellow

Share Copy link Facebook X Linkedin Bluesky Email...

20/04/2026

Blackmagic Design Announces Blackmagic Camera for iOS 3.3 Update

Blackmagic Design Announces Blackmagic Camera for iOS 3.3 Update Brie Clayton April 20, 2026 0 Comments New update adds camera control and monitoring ...

20/04/2026

Maxon Announces Free Tools and Mobile Expansion of ZBrush and Cinema 4D

Maxon Announces Free Tools and Mobile Expansion of ZBrush and Cinema 4D Brie Clayton April 20, 2026 0 Comments Cinema 4D brings professional 3D workfl...

20/04/2026

Vizrt AI Keyer kills the green screen and creates virtual scenes in any environment

Vizrt AI Keyer kills the green screen and creates virtual scenes in any environm...

20/04/2026

Register now - Market & Audience Department Ask Me Anything (AMA) Session

Register now - Market & Audience Department Ask Me Anything (AMA) Session 11 February 2026 Screen Australia Head of Market & Audience Rakel Tansley Talking to...

20/04/2026

Screen Australia appoints Tanya Phegan as Narrative Content Head of Development

Screen Australia appoints Tanya Phegan as Narrative Content Head of Development 17 March 2026 Tanya Phegan Screen Australia has today announced the appointmen...

20/04/2026

Applications Open for Skip Ahead 11

Applications Open for Skip Ahead 11 19 March 2026 Past Skip Ahead recipients (L-R): Macfarlane Bros, Rainbow Bop, Lyanna Kea. Screen Australia and YouTube Aus...

20/04/2026

Screen Australia empowers the next games generation, including new creatives from neighbouring disciplines

Screen Australia empowers the next games generation, including new creatives fro...

20/04/2026

Official Co-production Ask Me Anything (AMA) Session

Official Co-production Ask Me Anything (AMA) Session 26 March 2026 Image (L-R): Mix Tape, Michele McDonald, Flower & Flour. Interested in international Co-pro...

20/04/2026

Screen Australia announces Narrative Content funding for 91 projects, including four short films paired with industry mentors

Screen Australia announces Narrative Content funding for 91 projects, including ...

20/04/2026

Production Infrastructure and Capacity Analysis (PICA) pinpoints four key workforce challenges in the Australian screen industry

Production Infrastructure and Capacity Analysis (PICA) pinpoints four key workfo...

20/04/2026

Australians in Film and Screen Australia Announce the 2026 Participants in the Talent Gateway and Global Producers Program

Australians in Film and Screen Australia Announce the 2026 Participants in the T...

20/04/2026

Screen Australia relaunches website with new tools and improved user experience

Screen Australia relaunches website with new tools and improved user experience 16 April 2026 Screen Australia relaunches website Screen Australia has relaun...

20/04/2026

Ikegami Announces VFE-P07D Monocular OLED Viewfinder

Share Copy link Facebook X Linkedin Bluesky Email...

20/04/2026

EVS Launches Choreon Robotic Control Solution

Share Copy link Facebook X Linkedin Bluesky Email...

20/04/2026

Ross Video Showcases End-To-End Production Ecosystem at 2026 Nab Show

Share Copy link Facebook X Linkedin Bluesky Email...

20/04/2026

Heidi Steffen to Become President of TitanTV

Share Copy link Facebook X Linkedin Bluesky Email...

20/04/2026

NVIDIA and Partners Showcase the Future of AI-Driven Manufacturing at Hannover Messe 2026

Manufacturing is at an inflection point. Across every major industrial economy, ...

20/04/2026

Autonomous AI at Scale: Adobe Agents Unlock Breakthrough Creative Intelligence With NVIDIA and WPP

AI agents are transforming how work gets done across all industries, acceleratin...

19/04/2026

NAB Show 2026 Is Here! Follow All of our Live Coverage!

Blackmagic Design has announced the ATEM 4 M/E Constellation IP and ATEM 4 M/E Constellation IP Plus, two SMPTE 2110-native live production switchers. The ATEM ...

19/04/2026

Live From NAB 2026: Grass Valley CEO Jon Wilson on AMPPs Explosive Growth, Hybrid Workflows, and Whats New at the Show

Grass Valley is finding the right balance between its hardware heritage with an ...

19/04/2026

Live From NAB 2026: Oracles Kip Schauer on Why OCI Is Doubling Down on Media, Sports, and Broadcast

Oracle's strategy rests on the foundational strengths of Oracle Cloud Infras...

19/04/2026

Live From NAB 2026: Program Productions Jess Kowatch on Whats New with ProCrewz and the Impact of AI on Crewing

Program Productions, the live sports production industry's leading crewer, i...

19/04/2026

Live From NAB 2026: Aggrekos Joe Scionti on Powering the Super Bowl, PGA Championship, and the Road to the FIFA World Cup

At the 2026 NAB Show in Las Vegas, SVG sat down with Joe Scionti, Account Manage...

19/04/2026

NAB 2026: Evertz to highlight evertz.io XChange for live event management and market switching

Evertz (Booth N817) is set to present new services within its evertz.io platform...

19/04/2026

NAB 2026: Evertz to showcase IPMX-certified NUCLEUS and MMA platforms for AV and ST 2110 integration

Evertz (Booth N817) will showcase its IPMX-certified NUCLEUS platform alongside ...

19/04/2026

NAB 2026: Evertz to showcase ENX media core for hybrid SDI and IP facilities

Evertz (Booth N817) is set to showcase ENX at NAB 2026, a media core platform designed to support hybrid SDI and IP infrastructures in production facilities and...

19/04/2026

NAB 2026: Evertz introduces Studer VistaVUE Touch for broadcast control

Evertz (Booth N817) will introduce Studer VistaVUE Touch at NAB 2026, a control surface designed to integrate audio, video and control workflows within a custom...

19/04/2026

NAB 2026: Evertz highlights X-CALIBER high-density encoding platform for media transport

Evertz (Booth N817) will highlight X-CALIBER at NAB 2026, an encoding and decodi...