
Large language model development is about to reach supersonic speed thanks to a collaboration between NVIDIA and Anyscale.
At its annual Ray Summit developers conference, Anyscale - the company behind the fast growing open-source unified compute framework for scalable computing - announced today that it is bringing NVIDIA AI to Ray open source and the Anyscale Platform. It will also be integrated into Anyscale Endpoints, a new service announced today that makes it easy for application developers to cost-effectively embed LLMs in their applications using the most popular open source models.
These integrations can dramatically speed generative AI development and efficiency while boosting security for production AI, from proprietary LLMs to open models such as Code Llama, Falcon, Llama 2, SDXL and more.
Developers will have the flexibility to deploy open-source NVIDIA software with Ray or opt for NVIDIA AI Enterprise software running on the Anyscale Platform for a fully supported and secure production deployment.
Ray and the Anyscale Platform are widely used by developers building advanced LLMs for generative AI applications capable of powering intelligent chatbots, coding copilots and powerful search and summarization tools.
NVIDIA and Anyscale Deliver Speed, Savings and Efficiency Generative AI applications are captivating the attention of businesses around the globe. Fine-tuning, augmenting and running LLMs requires significant investment and expertise. Together, NVIDIA and Anyscale can help reduce costs and complexity for generative AI development and deployment with a number of application integrations.
NVIDIA TensorRT-LLM, new open-source software announced last week, will support Anyscale offerings to supercharge LLM performance and efficiency to deliver cost savings. Also supported in the NVIDIA AI Enterprise software platform, Tensor-RT LLM automatically scales inference to run models in parallel over multiple GPUs, which can provide up to 8x higher performance when running on NVIDIA H100 Tensor Core GPUs, compared to prior-generation GPUs.
TensorRT-LLM automatically scales inference to run models in parallel over multiple GPUs and includes custom GPU kernels and optimizations for a wide range of popular LLM models. It also implements the new FP8 numerical format available in the NVIDIA H100 Tensor Core GPU Transformer Engine and offers an easy-to-use and customizable Python interface.
NVIDIA Triton Inference Server software supports inference across cloud, data center, edge and embedded devices on GPUs, CPUs and other processors. Its integration can enable Ray developers to boost efficiency when deploying AI models from multiple deep learning and machine learning frameworks, including TensorRT, TensorFlow, PyTorch, ONNX, OpenVINO, Python, RAPIDS XGBoost and more.
With the NVIDIA NeMo framework, Ray users will be able to easily fine-tune and customize LLMs with business data, paving the way for LLMs that understand the unique offerings of individual businesses.
NeMo is an end-to-end, cloud-native framework to build, customize and deploy generative AI models anywhere. It features training and inferencing frameworks, guardrailing toolkits, data curation tools and pretrained models, offering enterprises an easy, cost-effective and fast way to adopt generative AI.
Options for Open-Source or Fully Supported Production AI Ray open source and the Anyscale Platform enable developers to effortlessly move from open source to deploying production AI at scale in the cloud.
The Anyscale Platform provides fully managed, enterprise-ready unified computing that makes it easy to build, deploy and manage scalable AI and Python applications using Ray, helping customers bring AI products to market faster at significantly lower cost.
Whether developers use Ray open source or the supported Anyscale Platform, Anyscale's core functionality helps them easily orchestrate LLM workloads. The NVIDIA AI integration can help developers build, train, tune and scale AI with even greater efficiency.
Ray and the Anyscale Platform run on accelerated computing from leading clouds, with the option to run on hybrid or multi-cloud computing. This helps developers easily scale up as they need more computing to power a successful LLM deployment.
The collaboration will also enable developers to begin building models on their workstations through NVIDIA AI Workbench and scale them easily across hybrid or multi-cloud accelerated computing once it's time to move to production.
NVIDIA AI integrations with Anyscale are in development and expected to be available by the end of the year.
Developers can sign up to get the latest news on this integration as well as a free 90-day evaluation of NVIDIA AI Enterprise.
To learn more, attend the Ray Summit in San Francisco this week or watch the demo video below.
See this notice regarding NVIDIA's software roadmap.
Most recent headlines
09/11/2025
Dalet today announced a transformative leap forward for media operations: Agentic Artificial Intelligence (AI) that unifies the Dalet ecosystem under one natura...
06/11/2025
MELVILLE, N.Y. Canon USA has launched the CR-N400 and CR-N350, two new Pan-Tilt-Zoom (PTZ) cameras designed to deliver high image quality, versatile connectivit...
06/11/2025
NEW YORK Fox Sports said its coverage of the Los Angeles Dodgers' win over the Toronto Blue Jays in the decisive Game 7 of the 2025 World Series delivered 2...
06/11/2025
NEW YORK DoubleVerify and Roku, Inc. are reporting that they have seen a marked reduction in fraudulent ad requests imitating Roku device traffic across the s...
06/11/2025
CAMBRIDGE, England Faced with rising inflation and worries that the economy is weakening, consumers are prioritizing their spending on popular streaming service...
06/11/2025
STAMFORD, Conn. NBC Sports is mounting mics in unconventional places to bring the sounds of NASCAR into the living rooms of sports fans....
06/11/2025
BRANSON, Mo. Link Electronics has partnered with Aberdeen Broadcast Services to provide a real-time, dual-stream translation and captioning service....
06/11/2025
Scripps Research team identifies sugar molecules that trigger placental formation Study reveals how sugar-protein interactions are critical for the placenta dur...
05/11/2025
College Hoops Preview 2025: ESPN Remote-Ops Team Preps for Massive Slate of Men&...
05/11/2025
University of Iowa Centralizes Video Production With Dual Control Rooms at Carve...
05/11/2025
On Monday night, Ed Sheeran and Spotify lit up The Royal Dublin Society in Dublin for a one-night-only performance. The occasion? The third installment of Billi...
05/11/2025
Cumbia has long been woven into daily life in Argentina, and its popularity on S...
05/11/2025
La cumbia forma parte del d a a d a de los argentinos desde siempre, y su popula...
05/11/2025
Earlier this year, our in-house publishing imprint, Spotify Audiobooks, put out ...
05/11/2025
SBS showcases next generation of pro cyclists in extended broadcast deal with Pr...
05/11/2025
Photo credit: CIVMEC...
05/11/2025
A Nielsen survey of 500 directors and millions of monday.com workflows reveals t...
05/11/2025
Prime Video's VoD offerings included for the first time in standardized AGF Measurement
Frankfurt, October 28, 2025. AGF Videoforschung, together with Ama...
05/11/2025
A 13-year-old company that serves as a master control/disaster recovery hub for PBS stations says it has seen little financial impact from cuts to public broadc...
05/11/2025
To no one's surprise, the NFL's most dominant franchise in recent years attracts the most TV viewers, according to new research by S&P Global Market Int...
05/11/2025
MELBOURNE, Australia Atomos today introduced Ninja TX GO, a new HDMI monitor-recorder that combines a brighter screen, advanced monitoring tools, professional c...
05/11/2025
WASHINGTON Despite the ongoing government shutdown, Federal Communications Commission Chairman Brendan Carr has announced a tentative agenda for the agency'...
05/11/2025
The College Football Playoff (CFP), ESPN and TNT Sports have announced kick times and broadcast information for the 2025 CFP First Round, which will launch the ...
05/11/2025
NEW YORK IAB Tech Lab, the global digital advertising technical standards-setting body, has announced the release of device attestation support in the industry ...
05/11/2025
SINGAPORE Appear, an Oslo-based provider of live production technology, is opening a new facility in Singapore as part of the company's expansion into the A...
05/11/2025
DALLAS Parks Associates has released new data showing just how far the dramatic shift to streaming services has gone in recent years. Currently, more than nine ...
05/11/2025
Supports existing services while powering 16 new products for government and ent...
05/11/2025
Wednesday 5 November 2025
To view this content, please enable our use of cookie...
05/11/2025
Wednesday 5 November 2025
To view this content, please enable our use of cookie...
05/11/2025
Rohde & Schwarz Mobile Test Summit 2025 on the future of wireless communications...
05/11/2025
Wuppertal November 5, 2025
Riedel RefCam and Easy5G to Make Handball Debut at the Men's EHF EURO 2026The European Handball Federation (EHF) will introduce...
05/11/2025
Back to All News
Netflix's Third Season of Ads and a Look Ahead at Whats Next
Amy Reinhard
President, Advertising
Business
05 November 2025
United Sta...
05/11/2025
Comscore and Polaris I/O Partner to Automate Audience Insights in MarketView for...
05/11/2025
New schedule will be live on-air Monday 10 November
Brand-new Today with David McCullagh from 9am
Oliver Callan in all-new extended show from 11am to 1pm
Kie...
05/11/2025
Explore the future with Science Week on RT
Dive into a week of innovative, themed programming and content across RT television, radio and online
Includes a ...
05/11/2025
Get ready for six weeks of United FC, a brand-new, feel-good teen docuseries kic...
04/11/2025
SVG Sit-Down: Why Professional Fight League CEO John Martin Believes Growth Is I...
04/11/2025
SVG All-Stars: David Koppett, Executive Producer, Live Sports and Studio, NESN a...
04/11/2025
From concept to kick-off: How TAMS could transform sports workflows By Paul Markham
Tuesday, October 28, 2025 - 09:43
Print This Story
Techex tx darwin pr...
04/11/2025
College Hoops Preview 2025: The CW Tips Off Third Season of ACC Men's/Women&...
04/11/2025
College Hoops Preview 2025: Big Ten Network Heats Up for Busy Season With 500 Me...
04/11/2025
College Hoops Preview 2025: CBS Sports Readies 300+ Game Broadcasts Across Its P...
04/11/2025
College Hoops Preview 2025: NBC Sports Slate Features 200+ Big Ten, BIG EAST, an...
04/11/2025
College Hoops Preview 2025: ESPN Remote-Ops Team Preps for Massive Slate of 7,40...
04/11/2025
Never-before-seen footage of Selena Quintanilla and her family's band offers...
04/11/2025
Joel Edgerton at Train Dreams Park City premiere (photo by Soul Brother / Shutterstock for Sundance Film Festival)...
04/11/2025
Today, we announced our third quarter 2025 earnings, marking strong momentum as we surpassed 700 million Monthly Active Users and achieved double-digit subscrib...
04/11/2025
Idag rapporterar vi v rt resultat f r det tredje kvartalet 2025, vilket markerar en stark och fortsatt tillv xt d vi passerade 700 miljoner m natliga aktiva an...
04/11/2025
SBS calls for bold, thought-provoking factual ideas: up to $50,000 in developmen...
04/11/2025
Tomorrow's fight will demand networks that deliver both capacity and survivability, the speed to move mission applications at scale, and the resilience to e...