Sony Pixel Power calrec Sony

NVIDIA Launches Array of New CUDA Libraries to Expand Accelerated Computing and Deliver Order-of-Magnitude Speedup to Science and Industrial Applications

26/08/2024

News summary: New libraries in accelerated computing deliver order-of-magnitude speedups and reduce energy consumption and costs in data processing, generative AI, recommender systems, AI data curation, data processing, 6G research, AI-physics and more. They include:

LLM applications: NeMo Curator, to create custom datasets, adds image curation and Nemotron-4 340B for high-quality synthetic data generation

Data processing: cuVS for vector search to build indexes in minutes instead of days and a new Polars GPU Engine in open beta

Physical AI: For physics simulation, Warp accelerates computations with a new TIle API. For wireless network simulation, Aerial adds more map formats for ray tracing and simulation. And for link-level wireless simulation, Sionna adds a new toolchain for real-time inference

Companies around the world are increasingly turning to NVIDIA accelerated computing to speed up applications they first ran on CPUs only. This has enabled them to achieve extreme speedups and benefit from incredible energy savings.

In Houston, CPFD makes computational fluid dynamics simulation software for industrial applications, like its Barracuda Virtual Reactor software that helps design next-generation recycling facilities. Plastic recycling facilities run CPFD software in cloud instances powered by NVIDIA accelerated computing. With a CUDA GPU-accelerated virtual machine, they can efficiently scale and run simulations 400x faster and 140x more energy efficiently than using a CPU-based workstation.

Bottles being loaded into a plastics recycling facility. AI-generated image. A popular video conferencing application captions several hundred thousand virtual meetings an hour. When using CPUs to create live captions, the app could query a transformer-powered speech recognition AI model three times a second. After migrating to GPUs in the cloud, the application's throughput increased to 200 queries per second - a 66x speedup and 25x energy-efficiency improvement.

In homes across the globe, an e-commerce website connects hundreds of millions of shoppers a day to the products they need using an advanced recommendation system powered by a deep learning model, running on its NVIDIA accelerated cloud computing system. After switching from CPUs to GPUs in the cloud, it achieved significantly lower latency with a 33x speedup and nearly 12x energy-efficiency improvement.

With the exponential growth of data, accelerated computing in the cloud is set to enable even more innovative use cases.

NVIDIA Accelerated Computing on CUDA GPUs Is Sustainable Computing NVIDIA estimates that if all AI, HPC and data analytics workloads that are still running on CPU servers were CUDA GPU-accelerated, data centers would save 40 terawatt-hours of energy annually. That's the equivalent energy consumption of 5 million U.S. homes per year.

Accelerated computing uses the parallel processing capabilities of CUDA GPUs to complete jobs orders of magnitude faster than CPUs, improving productivity while dramatically reducing cost and energy consumption.

Although adding GPUs to a CPU-only server increases peak power, GPU acceleration finishes tasks quickly and then enters a low-power state. The total energy consumed with GPU-accelerated computing is significantly lower than with general-purpose CPUs, while yielding superior performance.

GPUs achieve 20x greater energy efficiency compared to traditional computing on CPU-only servers because they deliver greater performance per watt, completing more tasks in less time. In the past decade, NVIDIA AI computing has achieved approximately 100,000x more energy efficiency when processing large language models. To put that into perspective, if the efficiency of cars improved as much as NVIDIA has advanced the efficiency of AI on its accelerated computing platform, they'd get 500,000 miles per gallon. That's enough to drive to the moon, and back, on less than a gallon of gasoline.

In addition to these dramatic boosts in efficiency on AI workloads, GPU computing can achieve incredible speedups over CPUs. Customers of the NVIDIA accelerated computing platform running workloads on cloud service providers saw speedups of 10-180x across a gamut of real-world tasks, from data processing to computer vision, as the chart below shows.

Speedups of 10-180x achieved in real-world implementations by cloud customers across workloads with the NVIDIA accelerated computing platform. As workloads continue to demand exponentially more computing power, CPUs have struggled to provide the necessary performance, creating a growing performance gap and driving compute inflation. The chart below illustrates a multiyear trend of how data growth has far outpaced the growth in compute performance per watt of CPUs.

The widening gap between data growth and the lagging compute performance per watt of CPUs. The energy savings of GPU acceleration frees up what would otherwise have been wasted cost and energy.

With its massive energy-efficiency savings, accelerated computing is sustainable computing.

The Right Tools for Every Job GPUs cannot accelerate software written for general-purpose CPUs. Specialized algorithm software libraries are needed to accelerate specific workloads. Just like a mechanic would have an entire toolbox from a screwdriver to a wrench for different tasks, NVIDIA provides a diverse set of libraries to perform low-level functions like parsing and executing calculations on data.

Each NVIDIA CUDA library is optimized to harness hardware features specific to NVIDIA GPUs. Combined, they encompass the power of the NVIDIA platform.

New updates continue to be added on the CUDA platform roadmap, expanding across diverse use cases:

LLM Applications NeMo Curator gives developers the flexibility to quickly create custom datasets in large language model (LLM) use cases. Recently
LINK: https://blogs.nvidia.com/blog/cuda-accelerated-computing-energy-effici...
See more stories from nvidia

More from Nvidia

08/10/2024

Accelerated Computing Key to Quantum Research

A recently released joint research paper by NVIDIA, Moderna and Yale reviews how techniques from quantum machine learning (QML) may enhance drug discovery metho...

08/10/2024

Pittsburgh Steels Itself for Innovation With Launch of NVIDIA AI Tech Community

Serving as a bridge for academia, industry and public-sector groups to partner on artificial intelligence innovation, NVIDIA is launching its inaugural AI Tech ...

08/10/2024

TSMC and NVIDIA Transform Semiconductor Manufacturing With Accelerated Computing

TSMC, the world leader in semiconductor manufacturing, is moving to production with NVIDIA's computational lithography platform, called cuLitho, to accelera...

08/10/2024

SETI Institute Researchers Engage in World's First Real-Time AI Search for Fast Radio Bursts

This summer, scientists supercharged their tools in the hunt for signs of life b...

08/10/2024

A Not-So-Secret Agent: NVIDIA Unveils NIM Blueprint for Cybersecurity

Artificial intelligence is transforming cybersecurity with new generative AI tools and capabilities that were once the stuff of science fiction. And like many o...

08/10/2024

US Healthcare System Deploys AI Agents, From Research to Rounds

The U.S. healthcare system is adopting digital health agents to harness AI across the board, from research laboratories to clinical settings. The latest AI-acc...

07/10/2024

Foxconn to Build Taiwan's Fastest AI Supercomputer With NVIDIA Blackwell

NVIDIA and Foxconn are building Taiwan's largest supercomputer, marking a milestone in the island's AI advancement. The project, Hon Hai Kaohsiung Supe...

03/10/2024

No Tricks, Just Games: GeForce NOW Thrills With 22 Games in October

The air is crisp, the pumpkins are waiting to be carved, and GFN Thursday is ready to deliver some gaming thrills. GeForce NOW is unleashing a monster mash of ...

03/10/2024

How AI and Accelerated Computing Drive Energy Efficiency

AI isn't just about building smarter machines. It's about building a greener world. From optimizing energy use to reducing emissions, AI and accelerate...

02/10/2024

Brave New World: Leo AI and Ollama Bring RTX-Accelerated Local LLMs to Brave Browser Users

Editor's note: This post is part of the AI Decoded series, which demystifies...

01/10/2024

NVIDIA AI Summit DC: Industry Leaders Gather to Showcase AI's Real-World Impact

Washington, D.C., is where possibility has always met policy, and AI presents un...

27/09/2024

Bon Voyage: NIO Unveils ONVO L60 Smart Electric SUV, Built on NVIDIA DRIVE Orin

NIO's smart EV brand, ONVO, has unveiled the L60 flagship mid-size family SUV, built on the NVIDIA DRIVE Orin system-on-a-chip. Earlier this year, the auto...

26/09/2024

A Whole New World: GreedFall II: The Dying World' Joins GeForce NOW

Whether looking for a time-traveling adventure, strategic roleplay or epic action, anyone can find something to play on GeForce NOW, with over 2,000 games in th...

25/09/2024

Decoding How AI Can Accelerate Data Science Workflows

Editor's note: This post is part of the AI Decoded series, which demystifies AI by making the technology more accessible, and showcases new hardware, softwa...

23/09/2024

To Save Lives, and Energy, Wellcome Sanger Institute Speeds Cancer Research With NVIDIA Accelerated Computing

The Wellcome Sanger Institute, a key contributor to the international Human Geno...

23/09/2024

NVIDIA Partners for Globally Inclusive AI in U.S. Government Initiative

NVIDIA is joining the U.S. government's launch of the Partnership for Global Inclusivity on AI (PGIAI), providing Deep Learning Institute training, GPU cred...

23/09/2024

High-Speed AI: Hitachi Rail Advances Real-Time Railway Analysis Using NVIDIA Technology

Hitachi Rail, a global transportation company powering railway systems in over 5...

20/09/2024

Medical Centers Tap AI, Federated Learning for Better Cancer Detection

A committee of experts from top U.S. medical centers and research institutes is harnessing NVIDIA-powered federated learning to evaluate the impact of federated...

19/09/2024

We've Fused Signal Processing and AI': NVIDIA CEO Outlines Future of Telecom at T-Mobile's Capital Markets Day

In a surprise appearance at T-Mobile's Capital Markets Day, NVIDIA founder a...

19/09/2024

Climate Week Forecast: Outlook Improving With AI, Accelerated Computing

All the electricity that powers NVIDIA's global operations will come from renewable sources by the end of January. It's the right fuel for the company&...

19/09/2024

FINAL FANTASY XVI' Soars Into the Cloud With GeForce NOW

GeForce NOW makes gamers' fantasies a reality by bringing top titles to the cloud. This week, the award-winning FINAL FANTASY XVI is available for members t...

18/09/2024

NVIDIA AI Aerial Launches to Optimize Wireless Networks, Deliver New Generative AI Experiences on One Platform

Telecommunications providers are transforming beyond voice and data services wit...

18/09/2024

How SonicJobs Uses AI Agents to Connect the Internet, Starting with Jobs

Companies in the US spend $15bn annually on talent acquisition. The most important metric in recruitment advertising is the conversion from the paid click on th...

17/09/2024

New AI Innovation Hub in Tunisia Drives Technological Advancement Across Africa

A new AI innovation hub for developers across Tunisia launched today in Novation City, a technology park that's designed to cultivate a vibrant, innovation ...

17/09/2024

Upgrade Livestreams With Twitch Enhanced Broadcasting and the NVIDIA Encoder

At TwitchCon - a global convention for the Twitch livestreaming platform-livestreamers and content creators this week can experience the latest technologies for...

12/09/2024

GeForce NOW to Bring Dead Rising Deluxe Remaster' to the Cloud at Launch

Rise and shine - Capcom's latest action-adventure game, Dead Rising Deluxe Remaster, heads to the cloud at launch next week. It's part of nine new titl...

11/09/2024

AI on the Air: Behind the Scenes at IBC With Holoscan for Media

AI is transforming the broadcast industry by enhancing the way content is created, distributed and consumed - but integrating the technology can be challenging....

11/09/2024

NVIDIA and Oracle to Accelerate AI and Data Processing for Enterprises

Enterprises are looking for increasingly powerful compute to support their AI workloads and accelerate data processing. The efficiency gained can translate to b...

11/09/2024

Ready to Roll: Nuro to License Its Autonomous Driving System

To accelerate autonomous vehicle development and deployment timelines, Nuro announced today it will license its Nuro Driver autonomous driving system directly t...

09/09/2024

Live Media Reimagined: NVIDIA Holoscan for Media Now Available for Production

Companies in broadcast, sports and streaming are transitioning to software-defined infrastructure to benefit from flexible deployment and to more easily adopt t...

06/09/2024

How AI Is Personalizing Customer Service Experiences Across Industries

Customer service departments across industries are facing increased call volumes, high customer service agent turnover, talent shortages and shifting customer e...

05/09/2024

19 New Games to Drop for GeForce NOW in September

Fall will be here soon, so leaf it to GeForce NOW to bring the games, with 19 joining the cloud in September. Get started with the seven games available to str...

05/09/2024

Three Ways to Ride the Flywheel of Cybersecurity AI

The business transformations that generative AI brings come with risks that AI itself can help secure in a kind of flywheel of progress. Companies who were qui...

04/09/2024

Volvo Cars EX90 SUV Rolls Out, Built on NVIDIA Accelerated Computing and AI

Volvo Cars' new, fully electric EX90 is making its way from the automaker's assembly line in Charleston, South Carolina, to dealerships around the U.S. ...

04/09/2024

Do the Math: New RTX AI PC Hardware Delivers More AI, Faster

Editor's note: This post is part of the AI Decoded series, which demystifies AI by making the technology more accessible, and showcases new hardware, softwa...

04/09/2024

Hammer Time: Machina Labs' Edward Mehr on Autonomous Blacksmith Bots and More

Edward Mehr works where AI meets the anvil. The company he cofounded, Machina L...

04/09/2024

Manufacturing Intelligence: Deltia AI Delivers Assembly Line Gains With NVIDIA Metropolis and Jetson

It all started at Berlin's Merantix venture studio in 2022, when Silviu Homo...

29/08/2024

From RAG to Richness: Startup Uplevels Retrieval-Augmented Generation for Enterprises

Well before OpenAI upended the technology industry with its release of ChatGPT i...

29/08/2024

Crystal-Clear Gaming: Visions of Mana' Sharpens on GeForce NOW

It's time to mana-fest the spirit of adventure with Square Enix's highly anticipated action role-playing game, Visions of Mana, launching today in the c...

28/08/2024

NVIDIA Blackwell Sets New Standard for Generative AI in MLPerf Inference Debut

As enterprises race to adopt generative AI and bring new services to market, the demands on data center infrastructure have never been greater. Training large l...

28/08/2024

More Than Fine: Multi-LoRA Support Now Available in NVIDIA RTX AI Toolkit

Editor's note: This post is part of the AI Decoded series, which demystifies AI by making the technology more accessible, and showcases new hardware, softwa...

27/08/2024

From Prototype to Prompt: NVIDIA NIM Agent Blueprints Fast-Forward Next Wave of Enterprise Generative AI

The initial wave of generative AI was driven by its use in internet services tha...

27/08/2024

Better Molecules, Faster: NVIDIA NIM Agent Blueprint Redefines Hit Identification With Generative AI-Based Virtual Screening

Aiming at making the process faster and smarter, NVIDIA on Wednesday released th...

26/08/2024

NVIDIA Launches NIM Microservices for Generative AI in Japan, Taiwan

Nations around the world are pursuing sovereign AI to produce artificial intelligence using their own computing infrastructure, data, workforce and business net...

23/08/2024

NVIDIA to Present Innovations at Hot Chips That Boost Data Center Performance and Energy Efficiency

A deep technology conference for processor and system architects from industry a...

22/08/2024

Straight Out of Gamescom and Into Xbox PC Games, GeForce NOW Newly Supports Automatic Xbox Sign-In

Straight out of Gamescom, NVIDIA introduced GeForce NOW support for Xbox automat...

21/08/2024

How Snowflake Is Unlocking the Value of Data With Large Language Models

Snowflake is using AI to help enterprises transform data into insights and applications. In this episode of NVIDIA's AI Podcast, host Noah Kravitz and Baris...

21/08/2024

Lightweight Champ: NVIDIA Releases Small Language Model With State-of-the-Art Accuracy

Developers of generative AI typically face a tradeoff between model size and acc...

21/08/2024

SLMming Down Latency: How NVIDIA's First On-Device Small Language Model Makes Digital Humans More Lifelike

Editor's note: This post is part of the AI Decoded series, which demystifies...