Sony Pixel Power calrec Sony

NVIDIA RTX AI Accelerates FLUX.1 Kontext - Now Available for Download

02/07/2025

Black Forest Labs, one of the world's leading AI research labs, just changed the game for image generation.

The lab's FLUX.1 image models have earned global attention for delivering high-quality visuals with exceptional prompt adherence. Now, with its new FLUX.1 Kontext model, the lab is fundamentally changing how users can guide and refine the image generation process.

To get their desired results, AI artists today often use a combination of models and ControlNets - AI models that help guide the outputs of an image generator. This commonly involves combining multiple ControlNets or using advanced techniques like the one used in the NVIDIA AI Blueprint for 3D-guided image generation, where a draft 3D scene is used to determine the composition of an image.

The new FLUX.1 Kontext model simplifies this by providing a single model that can perform both image generation and editing, using natural language.

NVIDIA has collaborated with Black Forest Labs to optimize FLUX.1 Kontext [dev] for NVIDIA RTX GPUs using the NVIDIA TensorRT software development kit and quantization to deliver faster inference with lower VRAM requirements.

For creators and developers alike, TensorRT optimizations mean faster edits, smoother iteration and more control - right from their RTX-powered machines.

The FLUX.1 Kontext [dev] Flex: In-Context Image Generation Black Forest Labs in May introduced the FLUX.1 Kontext family of image models which accept both text and image prompts.

These models allow users to start from a reference image and guide edits with simple language, without the need for fine-tuning or complex workflows with multiple ControlNets.

FLUX.1 Kontext is an open-weight generative model built for image editing using a guided, step-by-step generation process that makes it easier to control how an image evolves, whether refining small details or transforming an entire scene. Because the model accepts both text and image inputs, users can easily reference a visual concept and guide how it evolves in a natural and intuitive way. This enables coherent, high-quality image edits that stay true to the original concept.

FLUX.1 Kontext's key capabilities include:

Character Consistency: Preserve unique traits across multiple scenes and angles.

Localized Editing: Modify specific elements without altering the rest of the image.

Style Transfer: Apply the look and feel of a reference image to new scenes.

Real-Time Performance: Low-latency generation supports fast iteration and feedback.

Black Forest Labs last week released FLUX.1 Kontext weights for download in Hugging Face, as well as the corresponding TensorRT-accelerated variants.

Three side-by-side images of the same graphic of coffee and snacks on a table with flowers, showing an example of multi-turn editing possible with the FLUX.1 Kontext [dev] model. The original image (left); the first edit transforms it into a Bauhaus style image (middle) and the second edit changes the color style of the image with a pastel palette (right).Traditionally, advanced image editing required complex instructions and hard-to-create masks, depth maps or edge maps. FLUX.1 Kontext [dev] introduces a much more intuitive and flexible interface, blending step-by-step edits with cutting-edge optimization for diffusion model inference.

The [dev] model emphasizes flexibility and control. It supports capabilities like character consistency, style preservation and localized image adjustments, with integrated ControlNet functionality for structured visual prompting.

FLUX.1 Kontext [dev] is already available in ComfyUI and the Black Forest Labs Playground, with an NVIDIA NIM microservice version expected to release in August.

Optimized for RTX With TensorRT Acceleration FLUX.1 Kontext [dev] accelerates creativity by simplifying complex workflows. To further streamline the work and broaden accessibility, NVIDIA and Black Forest Labs collaborated to quantize the model - reducing the VRAM requirements so more people can run it locally - and optimized it with TensorRT to double its performance.

The quantization step enables the model size to be reduced from 24GB to 12GB for FP8 (Ada) and 7GB for FP4 (Blackwell). The FP8 checkpoint is optimized for GeForce RTX 40 Series GPUs, which have FP8 accelerators in their Tensor Cores. The FP4 checkpoint is optimized for GeForce RTX 50 Series GPUs for the same reason and uses a new method called SVDQuant, which preserves high image quality while reducing model size.

TensorRT - a framework to access the Tensor Cores in NVIDIA RTX GPUs for maximum performance - provides over 2x acceleration compared with running the original BF16 model with PyTorch.

Speedup compared with BF16 GPU (left, higher is better) and memory usage required to run FLUX.1 Kontext [dev] in different precisions (right, lower is better).Learn more about NVIDIA optimizations and how to get started with FLUX.1 Kontext [dev] on the NVIDIA Technical Blog.

Get Started With FLUX.1 Kontext FLUX.1 Kontext [dev] is available on Hugging Face (Torch and TensorRT).

AI enthusiasts interested in testing these models can download the Torch variants and use them in ComfyUI. Black Forest Labs has also made available an online playground for testing the model.

For advanced users and developers, NVIDIA is working on sample code for easy integration of TensorRT pipelines into workflows. Check out the DemoDiffusion repository to come later this month.

But Wait, There's More Google last week announced the release of Gemma 3n, a new multimodal small language model ideal for running on NVIDIA GeForce RTX GPUs and the NVIDIA Jetson platform for edge AI and robotics.

AI enthusiasts can use Gemma 3n models with RTX accelerations in Ollama and Llama.cpp with their favorite apps, such as AnythingLLM and LM Studio.

Performance tested in June 2025 with Gemma 3n in Ollama, with 4 b
LINK: https://blogs.nvidia.com/blog/rtx-ai-garage-flux-kontext-nim-tensorrt/...
See more stories from nvidia

More from Nvidia

28/08/2025

Drop Into the Battle: Gears of War: Reloaded Unleashed' Launches on GeForce NOW

Brace yourself, COGs - the Locusts aren't the only thing rising up. The Coal...

28/08/2025

Game On: How Modders Reimagine Classic Games With NVIDIA RTX Remix and Generative AI

Last week at Gamescom, NVIDIA announced the winners of the NVIDIA and ModDB RTX ...

27/08/2025

How Do You Teach an AI Model to Reason? With Humans

AI models are advancing at a rapid rate and scale. But what might they lack that (most) humans don't? Common sense: an understanding, developed through rea...

25/08/2025

NVIDIA Jetson Thor Unlocks Real-Time Reasoning for General Robotics and Physical AI

Robots around the world are about to get a lot smarter as physical AI developers...

25/08/2025

Take It for a Spin: NVIDIA Rolls Out DRIVE AGX Thor Developer Kit to World's Automotive Developers

As autonomous vehicle systems rapidly grow in complexity, equipped with reasonin...

22/08/2025

Inside NVIDIA Blackwell Ultra: The Chip Powering the AI Factory Era

As the latest member of the NVIDIA Blackwell architecture family, the NVIDIA Blackwell Ultra GPU builds on core innovations to accelerate training and AI reason...

22/08/2025

Hot Topics at Hot Chips: Inference, Networking, AI Innovation at Every Scale - All Built on NVIDIA

AI reasoning, inference and networking will be top of mind for attendees of next...

21/08/2025

RIKEN, Japan's Leading Science Institute, Taps Fujitsu and NVIDIA for Next Flagship Supercomputer

Japan is once again building a landmark high-performance computing system - not ...

21/08/2025

Think SMART: How to Optimize AI Factory Inference Performance

From AI assistants doing deep research to autonomous vehicles making split-second navigation decisions, AI adoption is exploding across industries. Behind ever...

21/08/2025

Gearing Up for the Gigawatt Data Center Age

Across the globe, AI factories are rising - massive new data centers built not to serve up web pages or email, but to train and deploy intelligence itself. Inte...

21/08/2025

GeForce NOW Brings RTX 5080 Power to the Ultimate Membership

Get a glimpse into the future of gaming. The NVIDIA Blackwell RTX architecture is coming to GeForce NOW in September, marking the service's biggest upgrade...

20/08/2025

Into the Omniverse: How OpenUSD and Digital Twins Are Powering Industrial and Physical AI

Editor's note: This blog is a part of Into the Omniverse, a series focused o...

18/08/2025

At Gamescom 2025, NVIDIA DLSS 4 and Ray Tracing Come to This Year's Biggest Titles

With over 175 games now supporting NVIDIA DLSS 4 - a suite of advanced, AI-power...

18/08/2025

New Lightweight AI Model for Project G-Assist Brings Support for 6GB NVIDIA GeForce RTX and RTX PRO GPUs

At Gamescom, NVIDIA is releasing its first major update to Project G Assist - an...

15/08/2025

Now We're Talking: NVIDIA Releases Open Dataset, Models for Multilingual Speech AI

Of around 7,000 languages in the world, a tiny fraction are supported by AI lang...

14/08/2025

NVIDIA, National Science Foundation Support Ai2 Development of Open AI Models to Drive U.S. Scientific Leadership

NVIDIA is partnering with the U.S. National Science Foundation (NSF) to create a...

14/08/2025

Warhammer 40,000: Dawn of War - Definitive Edition' Storms GeForce NOW at Launch

Warhammer 40,000: Dawn of War - Definitive Edition is marching onto GeForce NOW,...

13/08/2025

FLUX.1 Kontext NVIDIA NIM Microservice Now Available for Download

Black Forest Labs' FLUX.1 Kontext [dev] image editing model is now available as an NVIDIA NIM microservice. FLUX.1 models allow users to edit existing imag...

11/08/2025

Amazon Devices & Services Achieves Major Step Toward Zero-Touch Manufacturing With NVIDIA AI and Digital Twins

Using NVIDIA digital twin technologies, Amazon Devices & Services is powering bi...

11/08/2025

Mini Footprint, Mighty AI: NVIDIA Blackwell Architecture Powers AI Acceleration in Compact Workstations

Packing the power of the NVIDIA Blackwell architecture in compact, energy-effici...

11/08/2025

Making Safer Spaces: NVIDIA and Partners Bring Physical AI to Cities and Industrial Infrastructure

Physical AI is becoming the foundation of smart cities, facilities and industria...

07/08/2025

The Saga Continues: Stream 2K's Mafia: The Old Country' at Launch on GeForce NOW

This GFN Thursday brings an offer members can't refuse - 2K's highly ant...

05/08/2025

OpenAI and NVIDIA Propel AI Innovation With New Open Models Optimized for the World's Largest AI Inference Infrastructure

Two new open-weight AI reasoning models from OpenAI released today bring cutting...

05/08/2025

OpenAI's New Open Models Accelerated Locally on NVIDIA GeForce RTX and RTX PRO GPUs

In collaboration with OpenAI, NVIDIA has optimized the company's new open-so...

05/08/2025

Delivering 1.5M TPS Inference on NVIDIA GB200 NVL72, NVIDIA Accelerates OpenAI gpt-oss Models From Cloud to Edge

NVIDIA and OpenAI began pushing the boundaries of AI with the launch of NVIDIA D...

05/08/2025

No Backdoors. No Kill Switches. No Spyware.

NVIDIA GPUs are at the heart of modern computing. They're used across industries - from healthcare and finance to scientific research, autonomous systems an...

31/07/2025

Embark on Epic Adventures in August With a Dozen New Games Coming to GeForce NOW

August brings new levels of gaming excitement on GeForce NOW, with 2,300 titles now available to stream in the cloud. Grab a controller and get ready for epic ...

31/07/2025

Wired for Action: Langflow Enables Local AI Agent Creation on NVIDIA RTX PCs

Interest in generative AI is continuing to grow, as new models include more capabilities. With the latest advancements, even enthusiasts without a developer bac...

29/07/2025

FourCastNet 3 Enables Fast and Accurate Large Ensemble Weather Forecasting With Scalable Geometric ML

FourCastNet3 (FCN3) is the latest AI global weather forecasting system from NVID...

28/07/2025

How New GB300 NVL72 Features Provide Steady Power for AI

The electrical grid is designed to support loads that are relatively steady, such as lighting, household appliances, and industrial machines that operate at con...

24/07/2025

Creative Agency Black Mixture Creates Stunning Visuals With Generative AI Powered by NVIDIA RTX

For media company Black Mixture, AI isn't just a tool - it's an entire p...

24/07/2025

WUCHANG: Fallen Feathers' Lands in the Cloud

Sharpen the blade and brace for a journey steeped in myth and mystery. WUCHANG: Fallen Feathers has launched in the cloud. Ride in style with skateboarding leg...

23/07/2025

Into the Omniverse: How Global Brands Are Scaling Personalized Advertising With AI and 3D Content Generation

In today's fast-evolving digital landscape, marketing teams face increasing ...

22/07/2025

AI On: How Financial Services Companies Use Agentic AI to Enhance Productivity, Efficiency and Security

Editor's note: This post is part of the AI On blog series, which explores th...

17/07/2025

GeForce NOW Delivers Justice With RoboCop: Rogue City - Unfinished Business'

Listen up citizens, the law is back and patrolling the cloud. Nacon's RoboCop Rogue City - Unfinished Business launches today in the cloud, bringing justice...

15/07/2025

Deadline Extended - Create a Project G-Assist Plug-In for a Chance to Win an NVIDIA GeForce RTX GPU and Laptop

Submissions for NVIDIA's Plug and Play: Project G-Assist Plug-In Hackathon a...

14/07/2025

NVIDIA CEO Jensen Huang Promotes AI in Washington, DC and China

This month, NVIDIA founder and CEO Jensen Huang promoted AI in both Washington, D.C. and Beijing - emphasizing the benefits that AI will bring to business and s...

11/07/2025

A Gaming GPU Helps Crack the Code on a Thousand-Year Cultural Conversation

Ceramics - the humble mix of earth, fire and artistry - have been part of a global conversation for millennia. From Tang Dynasty trade routes to Renaissance pa...

10/07/2025

From Terabytes to Turnkey: AI-Powered Climate Models Go Mainstream

In the race to understand our planet's changing climate, speed and accuracy are everything. But today's most widely used climate simulators often strugg...

10/07/2025

Indonesia on Track to Achieve Sovereign AI Goals With NVIDIA, Cisco and IOH

As one of the world's largest emerging markets, Indonesia is making strides toward its Golden 2045 Vision - an initiative tapping digital technologies and...

10/07/2025

Reach the PEAK' on GeForce NOW

Grab a friend and climb toward the clouds - PEAK is now available on GeForce NOW, enabling members to try the hugely popular indie hit on virtually any device. ...

10/07/2025

How to Run Coding Assistants for Free on RTX AI PCs and Workstations

Coding assistants or copilots - AI-powered assistants that can suggest, explain and debug code - are fundamentally changing how software is developed for both e...

08/07/2025

Asking an Encyclopedia-Sized Question: How To Make the World Smarter with Multi-Million Token Real-Time Inference

Modern AI applications increasingly rely on models that combine huge parameter c...

03/07/2025

GeForce NOW's 20 July Games Bring the Heat to the Cloud

The forecast this month is showing a 100% chance of epic gaming. Catch the scorching lineup of 20 titles coming to the cloud, which gamers can play whether indo...

02/07/2025

NVIDIA RTX AI Accelerates FLUX.1 Kontext - Now Available for Download

Black Forest Labs, one of the world's leading AI research labs, just changed the game for image generation. The lab's FLUX.1 image models have earned g...

01/07/2025

How AI Factories Can Help Relieve Grid Stress

In many parts of the world, including major technology hubs in the U.S., there's a yearslong wait for AI factories to come online, pending the buildout of n...

26/06/2025

Run Google DeepMind's Gemma 3n on NVIDIA Jetson and RTX

As of today, NVIDIA now supports the general availability of Gemma 3n on NVIDIA RTX and Jetson. Gemma, previewed by Google DeepMind at Google I/O last month, in...

26/06/2025

Into the Omniverse: World Foundation Models Advance Autonomous Vehicle Simulation and Safety

Editor's note: This blog is a part of Into the Omniverse, a series focused o...

26/06/2025

Startup Uses NVIDIA RTX-Powered Generative AI to Make Coolers, Cooler

Mark Theriault founded the startup FITY envisioning a line of clever cooling products: cold drink holders that come with freezable pucks to keep beverages cold ...