You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

By clicking "Agree", you agree to the FLUX Non-Commercial License Agreement and acknowledge the Acceptable Use Policy.

FLUX.2 [klein] 9B-KV is an optimized variant of FLUX.2 [klein] 9B with KV-cache support for accelerated multi-reference editing. This variant caches key-value pairs from reference images during the first denoising step, eliminating redundant computation in subsequent steps for significantly faster multi-image editing workflows.

For more information about FLUX.2 [klein], please read our blog post.

Key Features

KV-Cache Optimization: Reference image KV pairs are computed once and cached, reducing computation and speeding up inference by up to 2.5 times for multi-reference editing tasks.
All capabilities of FLUX.2 [klein] 9B: sub-second generation, text-to-image, and multi-reference editing in a single unified model.
Ideal for interactive applications and real-time editing pipelines where the same reference images are used across multiple generations.
9B flow model with 8B Qwen3 text embedder, step-distilled to 4 inference steps.
Available for non-commercial use.

How KV-Caching Works

In standard image editing, reference image tokens are processed at every denoising step. With KV-caching:

Step 0: Full forward pass processes reference tokens and extracts their key-value pairs into a cache.
Steps 1-3: Cached KV pairs are reused, skipping redundant reference token computation.

This is particularly beneficial when:

Editing with multiple reference images
Generating variations with the same references
Building interactive editing applications

Usage

We provide a reference implementation in our GitHub repository.

API Endpoints

FLUX.2 [klein] 9B-KV is available via the BFL API at bfl.ai.

Using with Diffusers 🧨

To use FLUX.2 [klein] 9B-KV with the 🧨 Diffusers python library, first install or upgrade diffusers:

pip install git+https://github.com/huggingface/diffusers.git

Then you can use Flux2KleinKVPipeline to run the model:

import torch
from diffusers import Flux2KleinKVPipeline

device = "cuda"
dtype = torch.bfloat16
model_path = "black-forest-labs/FLUX.2-klein-9b-kv"

pipe = Flux2KleinKVPipeline.from_pretrained(model_path, torch_dtype=dtype)
pipe.to(device)

# Text-to-image (no reference image)
print("Generating text-to-image...")
image = pipe(
    prompt="A cat holding a sign that says hello world",
    height=1024,
    width=1024,
    num_inference_steps=4,
    generator=torch.Generator(device=device).manual_seed(0),
).images[0]
image.save("t2i_output.png")
print("Saved t2i_output.png")

# Image-to-image with KV cache (using the generated image as reference)
print("Generating image-to-image with KV cache...")
image_kv = pipe(
    prompt="A cat dressed like a wizard",
    image=image,
    height=1024,
    width=1024,
    num_inference_steps=4,
    generator=torch.Generator(device=device).manual_seed(0),
).images[0]
image_kv.save("kv_output.png")
print("Saved kv_output.png")

Limitations

This model is not intended or able to provide factual information.
While the model can output text, text rendered may be inaccurate or subject to distortion.
As a statistical model, this checkpoint may represent or amplify biases observed in the training data.
The model may fail to generate output that matches the prompts.
Prompt following is heavily influenced by the prompting style.

Out-of-Scope Use

This model and its derivatives may not be used outside the scope of the license, including for unlawful, fraudulent, defamatory, abusive, or otherwise violative purposes as further explained in our Usage Policies.

Hardware

The FLUX.2 [klein] 9B-KV model fits in ~29GB VRAM and is accessible on NVIDIA RTX 5090 and above.

Responsible AI Development

Black Forest Labs is committed to responsible model development and deployment. Prior to releasing FLUX.2 [klein] 9B-KV, we evaluated and mitigated a number of risks, including child sexual abuse material (CSAM) and nonconsensual intimate imagery (NCII). For detailed information about our mitigations, evaluation processes, content provenance features, and policies, please see our post: Capable, Open, and Safe: Combating AI Misuse.

To report safety concerns, contact safety@blackforestlabs.ai.

License

This model falls under the FLUX Non-Commercial License.

Trademarks & IP

This project may contain trademarks or logos for projects, products, or services. Use of Black Forest Labs and FLUX trademarks or logos in modified versions of this project must not cause confusion or imply sponsorship or endorsement. Any use of third-party trademarks, intellectual property or logos are subject to those third-party's policies.

Downloads last month: 10,341

Model tree for black-forest-labs/FLUX.2-klein-9b-kv

Finetunes

1 model

Quantizations

6 models

Spaces using black-forest-labs/FLUX.2-klein-9b-kv 38

Collection including black-forest-labs/FLUX.2-klein-9b-kv

FLUX.2

Collection

Our second generation of FLUX • 21 items • Updated Apr 6 • 229