
Large language model development is about to reach supersonic speed thanks to a collaboration between NVIDIA and Anyscale.
At its annual Ray Summit developers conference, Anyscale - the company behind the fast growing open-source unified compute framework for scalable computing - announced today that it is bringing NVIDIA AI to Ray open source and the Anyscale Platform. It will also be integrated into Anyscale Endpoints, a new service announced today that makes it easy for application developers to cost-effectively embed LLMs in their applications using the most popular open source models.
These integrations can dramatically speed generative AI development and efficiency while boosting security for production AI, from proprietary LLMs to open models such as Code Llama, Falcon, Llama 2, SDXL and more.
Developers will have the flexibility to deploy open-source NVIDIA software with Ray or opt for NVIDIA AI Enterprise software running on the Anyscale Platform for a fully supported and secure production deployment.
Ray and the Anyscale Platform are widely used by developers building advanced LLMs for generative AI applications capable of powering intelligent chatbots, coding copilots and powerful search and summarization tools.
NVIDIA and Anyscale Deliver Speed, Savings and Efficiency Generative AI applications are captivating the attention of businesses around the globe. Fine-tuning, augmenting and running LLMs requires significant investment and expertise. Together, NVIDIA and Anyscale can help reduce costs and complexity for generative AI development and deployment with a number of application integrations.
NVIDIA TensorRT-LLM, new open-source software announced last week, will support Anyscale offerings to supercharge LLM performance and efficiency to deliver cost savings. Also supported in the NVIDIA AI Enterprise software platform, Tensor-RT LLM automatically scales inference to run models in parallel over multiple GPUs, which can provide up to 8x higher performance when running on NVIDIA H100 Tensor Core GPUs, compared to prior-generation GPUs.
TensorRT-LLM automatically scales inference to run models in parallel over multiple GPUs and includes custom GPU kernels and optimizations for a wide range of popular LLM models. It also implements the new FP8 numerical format available in the NVIDIA H100 Tensor Core GPU Transformer Engine and offers an easy-to-use and customizable Python interface.
NVIDIA Triton Inference Server software supports inference across cloud, data center, edge and embedded devices on GPUs, CPUs and other processors. Its integration can enable Ray developers to boost efficiency when deploying AI models from multiple deep learning and machine learning frameworks, including TensorRT, TensorFlow, PyTorch, ONNX, OpenVINO, Python, RAPIDS XGBoost and more.
With the NVIDIA NeMo framework, Ray users will be able to easily fine-tune and customize LLMs with business data, paving the way for LLMs that understand the unique offerings of individual businesses.
NeMo is an end-to-end, cloud-native framework to build, customize and deploy generative AI models anywhere. It features training and inferencing frameworks, guardrailing toolkits, data curation tools and pretrained models, offering enterprises an easy, cost-effective and fast way to adopt generative AI.
Options for Open-Source or Fully Supported Production AI Ray open source and the Anyscale Platform enable developers to effortlessly move from open source to deploying production AI at scale in the cloud.
The Anyscale Platform provides fully managed, enterprise-ready unified computing that makes it easy to build, deploy and manage scalable AI and Python applications using Ray, helping customers bring AI products to market faster at significantly lower cost.
Whether developers use Ray open source or the supported Anyscale Platform, Anyscale's core functionality helps them easily orchestrate LLM workloads. The NVIDIA AI integration can help developers build, train, tune and scale AI with even greater efficiency.
Ray and the Anyscale Platform run on accelerated computing from leading clouds, with the option to run on hybrid or multi-cloud computing. This helps developers easily scale up as they need more computing to power a successful LLM deployment.
The collaboration will also enable developers to begin building models on their workstations through NVIDIA AI Workbench and scale them easily across hybrid or multi-cloud accelerated computing once it's time to move to production.
NVIDIA AI integrations with Anyscale are in development and expected to be available by the end of the year.
Developers can sign up to get the latest news on this integration as well as a free 90-day evaluation of NVIDIA AI Enterprise.
To learn more, attend the Ray Summit in San Francisco this week or watch the demo video below.
See this notice regarding NVIDIA's software roadmap.
Most recent headlines
06/10/2025
France T l visions, France's leading broadcaster, has received the 2025 EBU ...
04/09/2025
Monumental Sports & Entertainment (MSE), in collaboration with Dalet, has been a...
16/06/2025
By Bailey Pennick
One of the most exciting things about the Sundance Film Festi...
16/06/2025
The Cannes Lions International Festival of Creativity is officially underway for...
16/06/2025
On Spotify, francophone content continues to cross borders at an unprecedented rate. In 2024 alone, more than 123 million listeners worldwide streamed audio con...
16/06/2025
TYSONS, Va. Tegna Inc. is embarking on a notable expansion of their already substantial local news programming by launching live and on-demand, local newscasts ...
16/06/2025
Netflix has announced that it is expanding its global programmatic ad offerings by partnering with Yahoo DSP. This will enable brands to buy Netflix advertising...
16/06/2025
Instrument now boasts full NKS support
Sub51 and Soundtrax have just announced the launch of an updated and improved version of their innovative sample-base...
16/06/2025
NEW YORK In a landmark agreement to overtake the burgeoning connected TV (CTV) advertising market, Amazon Ads and Roku today announced a new integration that gi...
16/06/2025
ATLANTA, BALTIMORE, CINCINNATI and IRVING, Texas The four major broadcast groups behind the ATSC 3.0-based EdgeBeam Wireless datacasting joint venture today nam...
16/06/2025
BURLINGTON, Mass. Avid today announced an extended agreement with Amazon MGM Studios to integrate Avid's Media Composer and Avid NEXIS on Amazon Web Service...
16/06/2025
Maxon, maker of powerful, approachable software for creators working in 2D and 3D design, motion graphics, visual effects, gaming and more, today announced the ...
16/06/2025
Alfalite, the only European manufacturer of LED displays, announces the launch of SKYPIX RGBW & IM, a new series of ceiling-mounted LED panels designed specifi...
16/06/2025
Two new compact 4HP modules introduced
ALM/Busy Circuits have just announced the launch of two new Eurorack modules, the Pip Filter and Pip LFO, both of whi...
16/06/2025
16 Jun 2025
VEON Announces USD 35 Million Share Buyback Announcement marks the third phase of USD 100 million share buyback program
Dubai, June 16, 2025: VEON...
16/06/2025
Save 40% or More on All Ivory II Collections!From now through June 30th, enjoy huge savings on all Ivory II Piano Collections. Our biggest discounts ever are be...
16/06/2025
Behind The Broadcast Booth, Ep. 3: Golf. My Future. My Game. Founder and CEO Cra...
16/06/2025
The REMI Revolution Is Here: How Remote Production Technology in Esports Pioneer...
16/06/2025
From Super Bowl to Indy 500, New Orleans Artist Frenchy' Captures Energy of...
16/06/2025
NFL Films Enhances Post Studio With Dolby Atmos Audio Forty-three channels of audio enable the facility to migrate to immersive By Dan Daley, Audio Editor
Mo...
16/06/2025
SVG New Sponsor Spotlight: Storj's David Colantuoni on Expanding Cloud-Based...
16/06/2025
Grass Valley 4K Cameras Head to Greece for View Master Events' New OB Truck By Ken Kerschbaumer, Editorial Director
Monday, June 16, 2025 - 2:33 pm
Pri...
16/06/2025
Monday 16 June 2025
Sky Arts' Access All Arts Week, a free nationwide arts ...
16/06/2025
Monday 16 June 2025
Families and children are invited to dress up, have fun and raise money to protect nature
WWF UK and Sky Kids are teaming up to launch Wea...
16/06/2025
The Rohde & Schwarz R&S M3AR radio family reaches 10,000 unit milestone, demonst...
16/06/2025
FOX Advertising Launches Enhanced Brand Storytelling Program with Strategic Inve...
16/06/2025
Run with Ray is back! RT Radio 1's The Ray D'Arcy Show hits the road th...
15/06/2025
July 2025 in Dublin, Berlin, Amsterdam & London
Photo: Thea Martre
Music Production for Women (MPW) have announced that they will be running a series of fo...
15/06/2025
Composer/producer launches free virtual instruments
Sulcata Sound is the latest venture of Jason Graves, a two-time British Academy Award-winnning composer,...
14/06/2025
NEW YORK Pluto TV and the All Womens Sports Network have launched a free ad-supported streaming TV (FAST) AWSN channel in the U.S., Canada, the U.K. and the Nor...
14/06/2025
NEW YORK and CINCINNATI E.W. Scripps has announced a new, multiyear agreement with the WNBA that will continue Ions regular-season coverage of the league on Fri...
14/06/2025
WASHINGTON The National Association of Broadcasters highlighted the hidden importance of spectrum in the production of major sporting events and described wha...
14/06/2025
WASHINGTON Sunsetting ATSC 1.0, expanding business opportunities for NextGen Broadcast and increasing international adoption of the ATSC 3.0 standard were top o...
14/06/2025
SAN FRANCISCO Samba TV and Acxiom have announced that they will dramatically expand their longstanding relationship....
14/06/2025
July 2025 in Dublin, Berlin, Amsterdam & London
Photo: Thea Martre
Music Production for Women (MPW) have announced that they will be running a series of fo...
14/06/2025
San Francisco State University's School of Cinema Uses Blackmagic Design
Brie Clayton June 13, 2025
0 Comments
More than 40 Blackmagic Design came...
14/06/2025
Boris FX Mocha Pro Adds New AI Tools To Tackle VFX Tasks Fast
Jessie Electa Petrov June 13, 2025
0 Comments
The 2025.5 release helps artists work more...
14/06/2025
AJA Debuts DRM2-Plus Mini-Converter Frame at InfoComm 2025
Brie Clayton June 13, 2025
0 Comments
Next-gen frame addresses diverse rackmount needs wit...
13/06/2025
(L-R) Lindsay Utz, Michelle Walshe, and The Right Honourable Dame Jacinda Ardern attend the 2025 Sundance Film Festival premiere of Prime Minister at Eccles T...
13/06/2025
Photo credit: Atsushi Nishijima
If you're a true lover of rom-coms, chances...
13/06/2025
Pure Drama and Fierce Rivalries set to dominate the world's most iconic spor...
13/06/2025
Johannesburg, 12 June 2025 - The National Film and Video Foundation (NFVF), an a...
13/06/2025
ABILENE. Texas A severe storm knocked down the tower and severely damaged the news studio and main facility of Sinclair-owned KTXS here on Sunday, June 8....
13/06/2025
Berklee's Music Business/Management Department Recognized by the Music Biz A...
13/06/2025
WASHINGTON The ATSC, the Broadcast Standards Association, honored veteran technologist Aldo Cugnini and Clarence Hau, Senior Vice President of Standards, Policy...
13/06/2025
(Editor's note: The 2025 UFL Championship Game between the D.C. Defenders and Michigan Panthers kicks off Saturday, June 14, at 8 p.m. Eastern. The game wil...
13/06/2025
New iPad/iPhone synth App announced
Following on from last year's release of Gradient Synth - which reached #6 on the App Store's Paid Music charts ...
13/06/2025
LONDON Warner Bros. Discovery has announced that HBO Max will launch direct-to-consumer in multiple new countries this July as the streamer becomes available in...
13/06/2025
AI voice transcription and captioning platform Verbit has added a new feature to its Captivate ASR solution the ability to identify specific features in automat...
13/06/2025
WASHINGTON Federal Communications Commission member Anna Gomez has wrapped up two weeks in California visiting broadcasters, television studio executives, enter...