Sony Pixel Power calrec Sony

Brave New World: Leo AI and Ollama Bring RTX-Accelerated Local LLMs to Brave Browser Users

02/10/2024

Editor's note: This post is part of the AI Decoded series, which demystifies AI by making the technology more accessible, and showcases new hardware, software, tools and accelerations for GeForce RTX PC and NVIDIA RTX workstation users.

From games and content creation apps to software development and productivity tools, AI is increasingly being integrated into applications to enhance user experiences and boost efficiency.

Those efficiency boosts extend to everyday tasks, like web browsing. Brave, a privacy-focused web browser, recently launched a smart AI assistant called Leo AI that, in addition to providing search results, helps users summarize articles and videos, surface insights from documents, answer questions and more.

Leo AI helps users summarize articles and videos, surface insights from documents, answer questions and more. The technology behind Brave and other AI-powered tools is a combination of hardware, libraries and ecosystem software that's optimized for the unique needs of AI.

Why Software Matters NVIDIA GPUs power the world's AI, whether running in the data center or on a local PC. They contain Tensor Cores, which are specifically designed to accelerate AI applications like Leo AI through massively parallel number crunching - rapidly processing the huge number of calculations needed for AI simultaneously, rather than doing them one at a time.

But great hardware only matters if applications can make efficient use of it. The software running on top of GPUs is just as critical for delivering the fastest, most responsive AI experience.

The first layer is the AI inference library, which acts like a translator that takes requests for common AI tasks and converts them to specific instructions for the hardware to run. Popular inference libraries include NVIDIA TensorRT, Microsoft's DirectML and the one used by Brave and Leo AI via Ollama, called llama.cpp.

Llama.cpp is an open-source library and framework. Through CUDA - the NVIDIA software application programming interface that enables developers to optimize for GeForce RTX and NVIDIA RTX GPUs - provides Tensor Core acceleration for hundreds of models, including popular large language models (LLMs) like Gemma, Llama 3, Mistral and Phi.

On top of the inference library, applications often use a local inference server to simplify integration. The inference server handles tasks like downloading and configuring specific AI models so that the application doesn't have to.

Ollama is an open-source project that sits on top of llama.cpp and provides access to the library's features. It supports an ecosystem of applications that deliver local AI capabilities. Across the entire technology stack, NVIDIA works to optimize tools like Ollama for NVIDIA hardware to deliver faster, more responsive AI experiences on RTX.

Applications like Brave's Leo AI can access RTX-powered AI acceleration to enhance user experiences. NVIDIA's focus on optimization spans the entire technology stack - from hardware to system software to the inference libraries and tools that enable applications to deliver faster, more responsive AI experiences on RTX.

Local vs. Cloud Brave's Leo AI can run in the cloud or locally on a PC through Ollama.

There are many benefits to processing inference using a local model. By not sending prompts to an outside server for processing, the experience is private and always available. For instance, Brave users can get help with their finances or medical questions without sending anything to the cloud. Running locally also eliminates the need to pay for unrestricted cloud access. With Ollama, users can take advantage of a wider variety of open-source models than most hosted services, which often support only one or two varieties of the same AI model.

Users can also interact with models that have different specializations, such as bilingual models, compact-sized models, code generation models and more.

RTX enables a fast, responsive experience when running AI locally. Using the Llama 3 8B model with llama.cpp, users can expect responses up to 149 tokens per second - or approximately 110 words per second. When using Brave with Leo AI and Ollama, this means snappier responses to questions, requests for content summaries and more.

NVIDIA internal throughput performance measurements on NVIDIA GeForce RTX GPUs, featuring a Llama 3 8B model with an input sequence length of 100 tokens, generating 100 tokens. Get Started With Brave With Leo AI and Ollama Installing Ollama is easy - download the installer from the project's website and let it run in the background. From a command prompt, users can download and install a wide variety of supported models, then interact with the local model from the command line.

For simple instructions on how to add local LLM support via Ollama, read the company's blog. Once configured to point to Ollama, Leo AI will use the locally hosted LLM for prompts and queries. Users can also switch between cloud and local models at any time.

Brave with Leo AI running on Ollama and accelerated by RTX is a great way to get more out of your browsing experience. You can even summarize and ask questions about AI Decoded blogs! Developers can learn more about how to use Ollama and llama.cpp in the NVIDIA Technical Blog.

Generative AI is transforming gaming, videoconferencing and interactive experiences of all kinds. Make sense of what's new and what's next by subscribing to the AI Decoded newsletter.
LINK: https://blogs.nvidia.com/blog/rtx-ai-brave-browser/...
See more stories from nvidia

Most recent headlines

11/12/2025

AI for Sustainability: Lessons from Sarajevo

Thomson and the Center for News, Technology and Innovation (CNTI) convened a two-day workshop in Sarajevo bringing together more than 35 journalists, editors, p...

11/12/2025

ESPN's Aims for Spectacular With Heisman Trophy Show

ESPN's Aims for Spectacular With Heisman Trophy ShowEvent firsts include 1080p HDR production airing on both national broadcast and cableBy Dan Daley, Audio...

11/12/2025

SVG Students To Watch: Frankie Patton, University of Colorado

SVG Students To Watch: Frankie Patton, University of ColoradoThe 2025 grad is hitting the ground running as a PA on national broadcastsBy Brandon Costa, Directo...

11/12/2025

SVG Summit 2025 Technology Exhibits Preview, Part 3

SVG Summit 2025 Technology Exhibits Preview, Part 3By SVG Staff Thursday, December 11, 2025 - 7:24 am Print This Story | Subscribe Story Highlights The 2...

11/12/2025

SVG Sit-Down: What Makes Gen Z, X, and Y Fans Tick? Dave Gavant of WSC Sports Goes Inside the 2025 Fan Engagement Survey

SVG Sit-Down: What Makes Gen Z, X, and Y Fans Tick? Dave Gavant of WSC Sports Go...

11/12/2025

SVG Summit 2025 Preview: 5G, MXL, Spectrum Loss, and Outerspace on Tap for Tuesday Tech Talks'

SVG Summit 2025 Preview: 5G, MXL, Spectrum Loss, and Outerspace on Tap for Tues...

11/12/2025

2025 Sports Broadcasting Hall of Fame: David Levy, Turner Titan and Master of All Sports-Media Trades

2025 Sports Broadcasting Hall of Fame: David Levy, Turner Titan and Master of Al...

11/12/2025

SVG Launches Follow the Money' Podcast: Go Inside the Sports Media Biz with Sam McCleery and John Kosner

SVG Launches Follow the Money' Podcast: Go Inside the Sports Media Biz with...

11/12/2025

A Deep Dive Inside Game Creek Video's Bird and Magic Mobile Units, Home to Amazon's NBA on Prime Video'

A Deep Dive Inside Game Creek Video's Bird and Magic Mobile Units, Home to A...

11/12/2025

How Sound Effects for Monsters Funday Football' Emulated the Sonic Soul of Monsters, Inc.'

How Sound Effects for Monsters Funday Football' Emulated the Sonic Soul of ...

11/12/2025

SVG New Sponsor Spotlight: CSP Mobile Productions' Len Chase on Upgrading Truck Fleet to 1080p, HDR, and ST 2110

SVG New Sponsor Spotlight: CSP Mobile Productions' Len Chase on Upgrading Tr...

11/12/2025

Spotify and The Game Awards Debut Gaming-Inspired Spotify Singles From Labrinth, Evanescence x GUNSHIP, and Bilmuri

Having the right song soundtrack your moves can make all the difference when gam...

11/12/2025

Celebrate Taylor Swift's Record-Breaking Year and New Docuseries with Exclusive Playlist Cover Art Stickers

It's been a big year for Taylor Swift. Her highly anticipated album The Life...

11/12/2025

L3Harris Ramps Up Production of Next-Gen Missile Tracking Satellites at Expanded Florida Facility

New satellites for the SDA Tranche 1 Tracking program in production at L3Harris&...

11/12/2025

L3Harris Delivers First Meadowlands Production Unit to US Space Force

The Meadowlands system, a compact and mobile version of the CCS, uses ground-based radio frequency units to disrupt satellite communications....

11/12/2025

L3Harris Demonstrates Interoperable Network to Unify Department of War and U.S. Government Agencies

The L3Harris demonstration united tactical communications devices, counter-UAS c...

11/12/2025

2025: L3Harris Year in Review

Throughout 2025, L3Harris delivered innovative solutions to U.S. and allied warfighters across every domain. With an unrelenting commitment to excellence, our...

11/12/2025

Nielsen reveals exclusive new data and insights in annual Tops of Sports report

A Majority of the World's Population (51%) Identify As Soccer Fans The 2025 MLB postseason notched 58.2 billion viewing minutes, up +24% from the prior y...

11/12/2025

Zixi Names Roi Sasson Vice President, Engineering

WALTHAM, Mass. Video-over-IP software provider Zixi said Roi Sasson has joined the company as vice president, engineering....

11/12/2025

LG Ad Solutions Expands Local CTV Data Coverage

MOUNTAIN VIEW, Calif. In a move that highlights the growing competition between broadcasters and CTV platforms for local advertising, LG Ad Solutions has announ...

11/12/2025

Boston Conservatory Earns Several Best of Accolades in 2025

Boston Conservatory Earns Several Best of Accolades in 2025 Highlights include a faculty Grammy win, a seventh consecutive year on Playbill's list of co...

11/12/2025

Lawo, SMPTE To Conduct ST 2110 Practical Lab

RASTATT, Germany Lawo and the Society of Motion Picture and Television Engineers (SMPTE) have partnered to launch the SMPTE ST 2110 Practical Lab, an immersive ...

11/12/2025

Comcast's Xfinity Revamps National Video Plans

PHILADELPHIA Comcasts Xfinity operating brand has announced the launch of new national video plans with all-in pricing that the operator said will provide custo...

11/12/2025

Analyst: Pay TV Video Subs Rise for First Time Since 2017

After eight years of declines, MoffettNathansons new Cord Cutting Monitor for Q3 2025 shows that pay TV subscribers to linear TV packages rose by 303,000, the f...

11/12/2025

Happy Holidays from Berklee

Happy Holidays from Berklee Enjoy this years holiday student-performance video. December 10, 2025 By Office of the President Dear Berklee community, As w...

11/12/2025

VEON's Banglalink Receives Regulatory Approval to Launch Digital Payment Services to Advance Financial Inclusion in Bangladesh

11 Dec 2025 VEON's Banglalink Receives Regulatory Approval to Launch Digita...

11/12/2025

Sky Arts paints a picture of Britains beauty as Landscape Artist of the Year returns on 14 January

Thursday 11 December 2025 Sky Arts paints a picture of Britain's beauty as ...

11/12/2025

Meet the Most Relatable Hero This Holiday: Main Trailer and Poster Unveiled for Cashero'

Back to All News Meet the Most Relatable Hero This Holiday: Main Trailer and Po...

11/12/2025

'High Tides': Netflix Shares Release Date Final Season

Back to All News High Tides: Netflix Shares Release Date Final Season Credit: Netflix / Thomas Nolf Entertainment 11 December 2025 GlobalNetherlandsBelgium...

11/12/2025

Made in Texas: How Netflix House Dallas Is Leaving a Lasting Footprint

Back to All News Made in Texas: How Netflix House Dallas Is Leaving a Lasting Footprint From left to right: America's Sweethearts: Dallas Cowboys Cheerl...

11/12/2025

The Second Part of 'One Hundred Years of Solitude' Will Arrive in August 2026

Back to All News The Second Part of One Hundred Years of Solitude Will Arrive i...

11/12/2025

Website launch for pianist Georgina

It's not every day we're asked to create a website for a classical musician! So, we were delighted to help pianist Georgina Duncan as she embarks on he...

11/12/2025

2026 HPA Tech Retreat Brings Together Visionaries to Navigate Media's Next Chapter

The Hollywood Professional Association (HPA) today announced additional programm...

11/12/2025

Shane Lowry, Oti Mabuse and Brendan Fraser among the guests on this week's Late Late Show Christmas Special!

It's beginning to look a lot like Christmas! Irish golfing hero Shane Lowry...

11/12/2025

The RT Sport Sportsperson of the Year Nominees 2025 Revealed

RT Sport Awards 2025 live on RT One and RT Player at 8:05pm on Saturday 20 December On Saturday 20 December live on RT One and RT Player at the earlier ti...

11/12/2025

Ride Into Adventure With Capcom's Monster Hunter Stories' Series in the Cloud

Hunters, saddle up - adventure awaits in the cloud. Journey into the world of M...

11/12/2025

Fair City star Stephanie Kelly revealed as fifth contestant for Dancing with the Stars 2026

Sash is ready to samba as Fair City star Stephanie Kelly is revealed as the late...

10/12/2025

Sound-Alike Commercials Are Part of Sports' Soundtrack

Sound-Alike Commercials Are Part of Sports' Soundtrack Johnny Cash for Coca-Cola is the latest in a long litany of sonic approximationsBy Dan Daley, Audio ...

10/12/2025

Immersive Sound Is Logical Next Step for Sports Venues

Immersive Sound Is Logical Next Step for Sports VenuesSound-systems suppliers are sanguine, but the market has its challengesBy Dan Daley, Audio Editor Wednes...

10/12/2025

The Romans Built Arenas for Immersive Sound 2,000 Years Ago

The Romans Built Arenas for Immersive Sound 2,000 Years AgoThe historic Arena of Nimes in France is still in use todayBy Dan Daley, Audio Editor Wednesday, De...

10/12/2025

SVG Summit 2025 Preview: Audio Workshop Hits on Immersive, Virtualized, and Next-Gen Streaming Workflows

SVG Summit 2025 Preview: Audio Workshop Hits on Immersive, Virtualized, and Next...

10/12/2025

SVG Summit 2025 Technology Exhibits Preview: Audio Spotlight

SVG Summit 2025 Technology Exhibits Preview: Audio SpotlightBy SVG Staff Wednesday, December 10, 2025 - 8:21 am Print This Story | Subscribe Story Highlig...

10/12/2025

SVG Europe Audio: Listening to the Sounds of Powder and Ice at Milano Cortina with a Behind the Scenes Tour of OBS and NBC's Audio Set Ups

SVG Europe Audio: Listening to the sounds of powder and ice at Milano Cortina wi...

10/12/2025

Advancements in Audio Technology: Capturing the Atmosphere of Live Sports

Advancements in audio technology: Capturing the atmosphere of live sports By David Davies Tuesday, November 25, 2025 - 09:27 Print This Story Although wor...

10/12/2025

Everything Smelled of Popcorn: The Art of Bringing the Complex Sound of Esports to Fans With Sound Supervisor Matt Gilbert

Everything smelled of popcorn: The art of bringing the complex sound of esports ...

10/12/2025

2026 Sundance Film Festival Unveils 97 Projects Selected for the Feature Film and Episodic Program

Top L-R: Ha-Chan, Shake Your Booty!, Hanging by a Wire, Broken English, Buddy C...

10/12/2025

You're in Control: Spotify Lets You Steer the Algorithm

For the first time, Spotify is giving users the power to steer the algorithm. Gustav S derstr m, Spotify's Co-President, CPO, and CTO, shares the vision beh...

10/12/2025

L3Harris to Produce Additional Solid Rocket Motors for Precision-Guided Artillery System

L3Harris' new contract for Guided Multiple Launch Rocket System Insensitive ...

10/12/2025

US Space Force Expands Offensive Space Programs Through L3Harris Foreign Sales

L3Harris Meadowlands system has been designed with an open architecture software system that allows for more flexible and efficient software updates. This capab...