Stable Velocity:
A Variance Perspective on Flow Matching

Donglin Yang^1,3 · Yongxing Zhang² · Xin Yu¹ · Liang Hou³
Xin Tao³ · Pengfei Wan³ · Xiaojuan Qi¹ · Renjie Liao²

¹ HKU ² UBC ³ Kling Team, Kuaishou Technology

[project page] [arXiv] [🤗 checkpoints]

Summary — By explicitly characterizing the variance of flow matching, we identify 1) a high-variance regime near the prior, where optimization is challenging, and 2) a low-variance regime near the data distribution, where conditional and marginal velocities nearly coincide. Leveraging this insight, we propose Stable Velocity, a unified framework that improves both training and sampling. For training, we introduce Stable Velocity Matching (StableVM), an unbiased variance-reduction objective, along with Variance-Aware Representation Alignment (VA-REPA), which adaptively strengthen auxiliary supervision in the low-variance regime. For inference, we show that dynamics in the low-variance regime admit closed-form simplifications, enabling Stable Velocity Sampling (StableVS), a finetuning-free acceleration.

🔍 Motivation: Variance Regimes in Flow Matching

Conditional Flow Matching (CFM) trains neural velocity fields using single-sample conditional velocities. While unbiased, these targets can exhibit high variance, especially when the marginal distribution remains close to the prior. We empirically observe that this variance is highly non-uniform over time.

Flow Matching naturally decomposes into two regimes:

Low-variance regime (near data) — The posterior concentrates on a single reference sample, and conditional and marginal velocities nearly coincide.
High-variance regime (near prior) — The posterior spreads over multiple samples, leading to large variance in conditional velocity targets.

This regime structure becomes more pronounced in high-dimensional data, such as ImageNet latents. Why does this happen?

🚀 Stable Velocity: A Variance-Driven Framework

Stable Velocity is a unified framework that leverages this variance structure to improve both training and sampling.

1. Stable Velocity Matching (StableVM)

StableVM replaces the single-sample target with a multi-sample, self-normalized estimator under multi-sample conditional path.

Unbiased estimator of the true marginal velocity
Strictly lower variance than standard CFM
Compatible with general stochastic interpolants

2. Variance-Aware Representation Alignment (VA-REPA)

Representation alignment methods (e.g., REPA) are effective only when the noisy input retains semantic information. From a variance perspective, this occurs exclusively in the low-variance regime.

Applying representation alignment uniformly along the diffusion trajectory introduces noisy supervision. VA-REPA activates alignment only in the low-variance regime, leading to consistent improvements over REPA as well as its variants.

3. Stable Velocity Sampling (StableVS)

In the low-variance regime, the probability flow dynamics admit closed-form simplifications. StableVS exploits this structure to enable finetuning-free acceleration of pretrained models, achieving more than 2× faster sampling in the low-variance regime without perceptible degradation in sample quality.

StableVS on SD3.5 (Text-to-Image)

Prompt: "A turquoise river winds through a lush canyon. Thick moss and dense ferns blanket the rocky walls; multiple waterfalls cascade from above, enveloped in mist. At noon, sunlight filters through the dense canopy, dappling the river surface with shimmering light. The atmosphere is humid and fresh, pulsing with primal jungle vitality. No humans, text, or artificial traces present."

Euler (30 steps)	Euler (20 steps)	Euler (11) + StableVS (9)

StableVS on Flux (Text-to-Image)

Prompt: "A cat holding a sign that says Stable Velocity"

Euler (30 steps)	Euler (20 steps)	Euler (11) + StableVS (9)

StableVS on Qwen-Image (Text-to-Image)

Prompt: "A 20-year-old East Asian girl with delicate, charming features and large, bright brown eyes—expressive and lively, with a cheerful or subtly smiling expression. Her naturally wavy long hair is either loose or tied in twin ponytails. She has fair skin and light makeup accentuating her youthful freshness. She wears a modern, cute dress or relaxed outfit in bright, soft colors—lightweight fabric, minimalist cut. She stands indoors at an anime convention, surrounded by banners, posters, or stalls. Lighting is typical indoor illumination—no staged lighting—and the image resembles a casual iPhone snapshot: unpretentious composition, yet brimming with vivid, fresh, youthful charm."

Euler (30 steps)	Euler (17 steps)	Euler (8) + StableVS (9)

StableVS on Wan2.2 (Text-to-Video)

Prompt: "Two anthropomorphic cats in comfy boxing gear and bright gloves fight intensely on a spotlighted stage."

UniPC (30 steps)	UniPC (20 steps)	UniPC (11) + StableVS (9)

Prompt: "A horse jumps over a fence."

UniPC (30 steps)	UniPC (20 steps)	UniPC (11) + StableVS (9)

⚡ Quick Start (StableVM & VA-REPA)

cd StableVM
conda create -n stablevm python=3.12 -y
conda activate stablevm
pip install -r requirements.txt

Training — Launch StableVM + VA-REPA:

accelerate launch train.py --model=SiT-XL/2 --data-dir=/path/to/data \
    --loss-type=stablevm --bank-capacity-per-class=256 --prefill-bank-fully \
    --use-proj-loss --proj-weight-schedule=sigmoid --proj-tau=0.7 \
    --allow-tf32 --mixed-precision=fp16

Evaluation — Generate 50K samples for FID:

torchrun --nnodes=1 --nproc_per_node=4 generate.py --model SiT-XL/2 \
    --ckpt /path/to/checkpoint.pt --num-fid-samples 50000 --use-projector \
    --mode=sde --num-steps=250 --cfg-scale=1.8 --guidance-high=0.7

📖 For the full list of options, toy GMM experiments, and more details, see the StableVM README.

⚡ Quick Start (StableVS)

StableVS provides drop-in custom schedulers for the diffusers library, enabling finetuning-free accelerated sampling on pretrained models.

cd StableVS
conda create -n stablevs python=3.10 -y
conda activate stablevs

# Install PyTorch (adjust cu121 to match your CUDA version)
pip install torch torchvision --index-url https://download.pytorch.org/whl/cu121

# Install diffusers and StableVS
pip install "diffusers[test]"
pip install -e .

Basic Usage — Replace the scheduler in any diffusers pipeline:

import torch
from stablevs import StableVSFlowMatchScheduler
from diffusers import FluxPipeline

pipe = FluxPipeline.from_pretrained("black-forest-labs/FLUX.1-dev", torch_dtype=torch.bfloat16)
pipe.scheduler = StableVSFlowMatchScheduler(
    num_train_timesteps=1000,
    use_fast_low_schedule=True,
    fast_low_split_point=0.85,
    fast_low_low_substeps=9,
    low_region_noise_factor=0.0,
)
image = pipe("a beautiful landscape", num_inference_steps=30, guidance_scale=3.5).images[0]

Quick Demos — Compare baseline vs StableVS schedulers:

# Text-to-Image (SD3.5, Flux, Qwen-Image)
python examples/t2i_demo.py --models sd35,flux,qwen --output-dir ./figures

# Text-to-Video (Wan2.2)
python examples/t2v_demo.py --output-dir ./videos

# Print sigma schedules only (no GPU needed)
python examples/t2i_demo.py --print-sigmas-only
python examples/t2v_demo.py --print-sigmas-only

📖 For the full list of supported schedulers, benchmark scripts, and parameter details, see the StableVS README.

📚 Citation

If you find our paper or code useful, please consider citing our paper:

@misc{yang2026stablevelocityvarianceperspective,
      title={Stable Velocity: A Variance Perspective on Flow Matching}, 
      author={Donglin Yang and Yongxing Zhang and Xin Yu and Liang Hou and Xin Tao and Pengfei Wan and Xiaojuan Qi and Renjie Liao},
      year={2026},
      eprint={2602.05435},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2602.05435}, 
}

✅ Checklist

~~StableVM and VA-REPA code release~~
~~Model checkpoints for StableVM and VA-REPA~~
~~StableVS code release~~
~~Project page release~~

👨🏻‍💻 Contact

Feel free to contact Donglin Yang or submit a GitHub issue if you have identified any bugs.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
StableVM		StableVM
StableVS		StableVS
assets		assets
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Stable Velocity:
A Variance Perspective on Flow Matching

[project page] [arXiv] [🤗 checkpoints]

🔍 Motivation: Variance Regimes in Flow Matching

🚀 Stable Velocity: A Variance-Driven Framework

1. Stable Velocity Matching (StableVM)

2. Variance-Aware Representation Alignment (VA-REPA)

3. Stable Velocity Sampling (StableVS)

StableVS on SD3.5 (Text-to-Image)

StableVS on Flux (Text-to-Image)

StableVS on Qwen-Image (Text-to-Image)

StableVS on Wan2.2 (Text-to-Video)

⚡ Quick Start (StableVM & VA-REPA)

⚡ Quick Start (StableVS)

📚 Citation

✅ Checklist

👨🏻‍💻 Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Stable Velocity:A Variance Perspective on Flow Matching

[project page] [arXiv] [🤗 checkpoints]

🔍 Motivation: Variance Regimes in Flow Matching

🚀 Stable Velocity: A Variance-Driven Framework

1. Stable Velocity Matching (StableVM)

2. Variance-Aware Representation Alignment (VA-REPA)

3. Stable Velocity Sampling (StableVS)

StableVS on SD3.5 (Text-to-Image)

StableVS on Flux (Text-to-Image)

StableVS on Qwen-Image (Text-to-Image)

StableVS on Wan2.2 (Text-to-Video)

⚡ Quick Start (StableVM & VA-REPA)

⚡ Quick Start (StableVS)

📚 Citation

✅ Checklist

👨🏻‍💻 Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Stable Velocity:
A Variance Perspective on Flow Matching

Packages