
NVIDIA today announced optimizations across all its platforms to accelerate Meta Llama 3, the latest generation of the large language model (LLM).
The open model combined with NVIDIA accelerated computing equips developers, researchers and businesses to innovate responsibly across a wide variety of applications.
Trained on NVIDIA AI Meta engineers trained Llama 3 on computer clusters packing 24,576 NVIDIA H100 Tensor Core GPUs, linked with RoCE and NVIDIA Quantum-2 InfiniBand networks.
To further advance the state of the art in generative AI, Meta recently described plans to scale its infrastructure to 350,000 H100 GPUs.
Putting Llama 3 to Work Versions of Llama 3, accelerated on NVIDIA GPUs, are available today for use in the cloud, data center, edge and PC.
From a browser, developers can try Llama 3 at ai.nvidia.com. It's packaged as an NVIDIA NIM microservice with a standard application programming interface that can be deployed anywhere.
Businesses can fine-tune Llama 3 with their data using NVIDIA NeMo, an open-source framework for LLMs that's part of the secure, supported NVIDIA AI Enterprise platform. Custom models can be optimized for inference with NVIDIA TensorRT-LLM and deployed with NVIDIA Triton Inference Server.
Taking Llama 3 to Devices and PCs Llama 3 also runs on NVIDIA Jetson Orin for robotics and edge computing devices, creating interactive agents like those in the Jetson AI Lab.
What's more, NVIDIA RTX and GeForce RTX GPUs for workstations and PCs speed inference on Llama 3. These systems give developers a target of more than 100 million NVIDIA-accelerated systems worldwide.
Get Optimal Performance with Llama 3 Best practices in deploying an LLM for a chatbot involves a balance of low latency, good reading speed and optimal GPU use to reduce costs.
Such a service needs to deliver tokens - the rough equivalent of words to an LLM - at about twice a user's reading speed which is about 10 tokens/second.
Applying these metrics, a single NVIDIA H200 Tensor Core GPU generated about 3,000 tokens/second - enough to serve about 300 simultaneous users - in an initial test using the version of Llama 3 with 70 billion parameters.
That means a single NVIDIA HGX server with eight H200 GPUs could deliver 24,000 tokens/second, further optimizing costs by supporting more than 2,400 users at the same time.
For edge devices, the version of Llama 3 with eight billion parameters generated up to 40 tokens/second on Jetson AGX Orin and 15 tokens/second on Jetson Orin Nano.
Advancing Community Models An active open-source contributor, NVIDIA is committed to optimizing community software that helps users address their toughest challenges. Open-source models also promote AI transparency and let users broadly share work on AI safety and resilience.
Learn more about how NVIDIA's AI inference platform, including how NIM, TensorRT-LLM and Triton use state-of-the-art techniques such as low-rank adaptation to accelerate the latest LLMs.
Most recent headlines
11/12/2025
Dalet, a leading provider of cloud-native, end-to-end media workflow solutions, ...
03/12/2025
ToolsOnAir Composition Builder 2025 Boilerplate
More Details: The Composition Builder 2025 application for macOS enables TV stations and Live Event broadcast...
03/12/2025
ToolsOnAr just:live pro 2025 Boilerplate
More Details: just:live pro 2025 is a Single Channel Live Production Playout solution for video and static or real-t...
03/12/2025
ToolsOnAr just:play pro 2025 Boilerplate
More Details: just:play pro 2025 is a Single Channel automated 24/7 Master Control playout solution with SD, HD and ...
03/12/2025
ToolsOnAr live:cut 2025 Boilerplate
More Details: live:cut is an option to just:in mac pro 2025 and enables multicamera production workflows for up to 16 cam...
03/12/2025
ToolsOnAir Just In Mac Lite NDI 2025 Boilerplate
More Details: The Just In Mac Lite NDI application is a streamlined media capture solution designed specific...
03/12/2025
ToolsOnAir Just In Mac Lite 2025 Boilerplate
More Details: The Just In Mac Lite application is a streamlined media capture solution designed specifically for...
03/12/2025
ToolsOnAir just:in mac pro 2025 Boilerplate
More Details: just:in mac pro is a macOS-based client-server multichannel capture solution to record SDI, HDMI, N...
03/12/2025
Tracy Bonareri Onchoke, Thomson's Young Journalist of the year 2025 is hoping the accolade will be a springboard to more cross-border collaboration between ...
03/12/2025
MLS Cup 2025 Production To Feature Four iPhone 17 Pros as Game-Coverage CamerasStay tuned to SVG on Friday for our in-depth story on this year's MLS Cup pro...
03/12/2025
SVG LIVE! 2025: All Sessions Now Available to Watch on SVG PLAYThe inaugural event placed a spotlight on the exciting world of live entertainmentBy SVG Staff
...
03/12/2025
(L-R) Peter Scriver and Seth Scriver introduce their documentary Endless Cookie for its premiere at the Egyptian Theatre in Park City. (Photo by Andrew H. Wa...
03/12/2025
For the fourth time, Bad Bunny is the most-streamed Wrapped artist on Spotify gl...
03/12/2025
The wait is over. It's time to look back at the audio that defined your year with 2025 Spotify Wrapped, our annual celebration for fans, artists, creators, ...
03/12/2025
Por cuarta vez, Bad Bunny es el Top Artista Global de Wrapped en Spotify, con 19...
03/12/2025
Spotify Wrapped is back, and as always, it's powered by the billions of streams that fans around the world delivered throughout the year. From the artists w...
03/12/2025
Spotify Wrapped is the moment when hundreds of millions of fans around the world...
03/12/2025
With more than 700 million listeners around the world turning to Spotify to soundtrack their lives, it's time to look back at the audio that defined the yea...
03/12/2025
Con m s de 700 millones de oyentes en todo el mundo usando Spotify para acompa ar su d a a d a, es momento de mirar hacia atr s y ver el audio que marc el a o....
03/12/2025
From page-turning thrillers to inspiring memoirs, audiobooks are becoming a core...
03/12/2025
Com mais de 700 milh es de ouvintes em todo o mundo recorrendo ao Spotify para embalar seu dia a dia, chega o momento de revisitar o udio que marcou o ano. Nos...
03/12/2025
Spotify Wrapped celebrates the audio that defined our year, and the annual globa...
03/12/2025
COLUMBIA, Md. Lionsgate and its TV syndicator subsidiary Debmar-Mercury have selected LTN to launch and deliver the new MovieSphereGold all-movie digital networ...
03/12/2025
VIENNA, Austria Video streaming solutions provider Bitmovin and ThinkAnalytics, a provider of AI-powered data analytics for TV, have formed a strategic partners...
03/12/2025
SAN FRANCISCO & THE COLONY, Texas Dolby Laboratories is making what it is calling a new chapter in its retail efforts as part of an agreement with NFM (Nebras...
03/12/2025
MONTREAL Grass Valley has delivered a 4K Ultra-High-Definition (UHD) outside broadcast (OB) truck to Guangdong Radio and Television (GRT), in partnership with B...
03/12/2025
Our users spoke and we listened. December's Maxon One release delivers long-awaited improvements across Cinema 4D, Redshift, ZBrush, and Red Giant.
Whether...
03/12/2025
DHD audio reports a successful 2025 with new additions to its range of broadcast-quality audio production, post-production and routing equipment.
DHD innovatio...
03/12/2025
Grass Valley recently delivered a cutting-edge 4K Ultra-High-Definition (UHD) outside broadcast (OB) truck to Guangdong Radio and Television (GRT), in partnersh...
03/12/2025
CINCINNATI GatesAir has introduced Maxiva XTK, an update to its Maxiva XTE software-defined TV exciter. XTK is a new cost-efficient model primarily developed fo...
03/12/2025
WUPPERTAL, Germany Riedel Communications is partnering with Haivision, a global provider of mission-critical, real-time video networking and visual collaboratio...
03/12/2025
WASHINGTON The Federal Communications Commission has opened a docket for comments on the proposed $6.2 billion Nexstar acquisition for Tegna and set deadlines f...
03/12/2025
MILAN, Italy Brightcove has released seven new features designed to expand global reach, improve audience engagement, enhance live-streaming quality and streaml...
03/12/2025
NEW YORK Great American Media said it plans to launch Pure Flix Familia, a dedicated Spanish-language platform, in 2026....
03/12/2025
SOUTHPORT, Conn. Main Street Sports Group has announced that the FanDuel Sports Network app is now available directly on Vizio and on smart TVs with Vizio OS. T...
03/12/2025
SAN JOSE, Calif. Harmonic has announced that Telia, the second-largest telecom operator in Norway, is modernizing its broadband network with the company's c...
03/12/2025
GREEN BAY, Wis. Sinclair said Jay Zollar, vice president and general manager of WLUK-WCWF here, will retire Dec. 31 after 26 years running the stations. Station...
03/12/2025
Wuppertal December 3, 2025
Riedel and Haivision Join Forces to Advance Wireless Video TransmissionRiedel Communications today announced a new partnership with...
03/12/2025
Back to All News
Netflix Strengthens Longstanding Commitment to Southeast Asia ...
03/12/2025
Breaking the Trend: Small Business Creation Jumps 69% as Entrepreneurs Bet Big on Growth Published on Dec 3, 2025 Categories: Data and insights
LinkedIn Co...
03/12/2025
Tell us a little bit about your job I am a Digital Marketing Executive working on Paid Social, PPC, and Organic Social. I joined the team in May 2025 and am enj...
03/12/2025
SAN JOSE, Calif. and VIENNA - Dec. 3, 2025 - Harmonic (NASDAQ: HLIT) and Normann...
03/12/2025
The top 10 most intelligent open-source models all use a mixture-of-experts arch...
03/12/2025
S an Nollaig
Celebrate 2025's top Irish sporting stars with RT Sport Awards live from RT Studios on RT One and RT Player
RT pays tribute to Ror...
02/12/2025
Case Study: How Mid-Atlantic Sports Network Moved to All-IP Distribution in 60 D...
02/12/2025
2025 Sports Broadcasting Hall of Fame: Lee Corso, Coach, Commentator, FirebrandBy Ken Kerschbaumer
Tuesday, December 2, 2025 - 7:00 am
Print This Story | S...
02/12/2025
SVG All-Stars: Dan Nabors, Senior Director, Remote Engineering, TNT SportsThe veteran tech leader is helping guide Warner Bros. Discovery's at-home' re...
02/12/2025
Epic rematch: DAZN on bringing Eubank Jr v Benn II to spectacular life with 1080...
02/12/2025
National Lacrosse League Opens Season With New Cloud-Based Official Replay-Revie...
02/12/2025
Platinum White Paper: The Cinematic Look in Live Production - Bridging Aesthetic...