Flow Matching

June 18, 2025

by Leonardo

1. Flow Matching (FM) and Conditional Flow Matching (CFM)

Symbol	Description	Type/Dimension
$𝑝_{0} (𝑥)$	Base distribution (typically simple, e.g., Gaussian)	Probability density
$𝑝_{1} (𝑥)$	Target data distribution	Probability density
$𝑝_{𝑡} (𝑥)$	Probability path at time $𝑡$ , connecting $𝑝_{0}$ and $𝑝_{1}$	Probability density
$𝑝_{𝑡} (𝑥 \| 𝑧)$	Conditional probability path given conditioning variable $𝑧$	Conditional density
$𝑧$	Conditioning variable for constructing conditional flows	Random variable
$𝑢_{𝑡} (𝑥)$	Velocity field at time $𝑡$ (marginal)	$ℝ^{𝑑}$
$𝑢_{𝑡} (𝑥 \| 𝑧)$	Conditional velocity field given condition $𝑧$	$ℝ^{𝑑}$
$𝑢_{𝑡}^{𝜃} (𝑥)$	Neural network velocity field with parameters $𝜃$	$ℝ^{𝑑}$
$𝜓_{𝑡} (𝑥_{0} \| 𝑧)$	Conditional flow map from initial point $𝑥_{0}$ to time $𝑡$	$ℝ^{𝑑}$
$𝑡$	Time variable, typically $𝑡 \in [0, 1]$	$ℝ$
$𝑥$	Spatial position variable	$ℝ^{𝑑}$
$𝑥_{0}$	Position at initial time $(𝑡 = 0)$	$ℝ^{𝑑}$
$𝑥_{1}$	Position at final time $(𝑡 = 1)$	$ℝ^{𝑑}$
$𝜃$	Neural network parameters	Parameter space
$𝐿^{CFM} (𝜃)$	Conditional Flow Matching loss function	$ℝ_{+}$
$𝐿^{FM} (𝜃)$	Flow Matching loss function (intractable)	$ℝ_{+}$

1.1. Flow Matching Overview

Flow Matching is a framework for training continuous normalizing flows by learning velocity fields that transform a simple base distribution into a target data distribution. The key insight is to parameterize the transformation through a time-dependent velocity field $𝑢_{𝜃} (𝑥, 𝑡)$ that defines an ordinary differential equation.

Given a probability path $𝑝_{𝑡} (𝑥)$ that interpolates between $𝑝_{0}$ (base distribution) and $𝑝_{1}$ (data distribution), the velocity field must satisfy the continuity equation:

\frac{\partial 𝑝_{𝑡} (𝑥)}{\partial 𝑡} + \nabla \cdot (𝑝_{𝑡} (𝑥) 𝑢_{𝜃} (𝑥, 𝑡)) = 0

However, directly learning $𝑢_{𝜃} (𝑥, 𝑡)$ from this equation is challenging because:

We don't know the true $𝑝_{𝑡} (𝑥)$ for intermediate times
The continuity equation provides insufficient supervision
There are infinitely many velocity fields satisfying the boundary conditions

CFM solves these issues by introducing conditional probability paths that enable tractable training.

1.2. Conditional Flow Matching (CFM)

The goal of CFM is to find a velocity field $𝑢_{𝜃} (𝑥, 𝑡)$ . However, there exists an infinite number of velocity fields that can satisfy the continuity equation for a given probability path. In order to get supervision for all $𝑡$ , one must fully specify a probability path and its corresponding velocity field.

1.2.1. How to fully specify a probability path $𝑝_{𝑡}$ and velocity field $𝑢_{𝑡}$ ?

The key challenge is that solving the continuity equation $\partial_{𝑡} 𝑝_{𝑡} + \nabla \cdot (𝑢_{𝑡} 𝑝_{𝑡}) = 0$ for $𝑢_{𝑡}$ given $𝑝_{𝑡}$ has infinitely many solutions. CFM's core idea is to avoid this difficulty by constructively defining both the probability path and velocity field through:

Choose a conditioning variable $𝑧$
Design conditional probability paths $𝑝_{𝑡} (𝑥 | 𝑧)$ with known flow maps $𝜓_{𝑡} (𝑥_{0} | 𝑧)$
Obtain the velocity field analytically via $𝑢_{𝑡} (𝑥 | 𝑧) = \partial_{𝑡} 𝜓_{𝑡} (𝑥_{0} | 𝑧)$

We want to ensure two conditions are met:

The induced global probability $𝑝_{𝑡} (𝑥) = 𝐸_{𝑧} [𝑝_{𝑡} (𝑥 | 𝑧)]$ transforms $𝑝_{0}$ into $𝑝_{1}$ .
The associated velocity field $𝑢_{𝑡} (𝑥 | 𝑧)$ has an analytic form obtained from the flow construction.

1.2.2. Linear Interpolation

1.2.2.1. Conditioning on Base and Target Points

The conditional variable $𝑧$ is defined as

𝑧 \overset{choice}{=} (𝑥_{0}, 𝑥_{1}) \sim 𝑝_{0} \times 𝑝_{1}

1.2.2.2. Flow Construction and Velocity Field

We construct a deterministic linear flow between $𝑥_{0}$ and $𝑥_{1}$ :

𝜓_{𝑡} (𝑥_{0} | 𝑧 = (𝑥_{0}, 𝑥_{1})) \overset{def}{=} (1 - 𝑡) \cdot 𝑥_{0} + 𝑡 \cdot 𝑥_{1}

This induces the probability path:

𝑝_{𝑡} (𝑥 | 𝑧 = (𝑥_{0}, 𝑥_{1})) \overset{def}{=} 𝛿_{(1 - 𝑡) \cdot 𝑥_{0} + 𝑡 \cdot 𝑥_{1}} (𝑥)

The velocity field is obtained by differentiating the flow map:

𝑢_{𝑡} (𝑥 | 𝑧 = (𝑥_{0}, 𝑥_{1})) = \partial_{𝑡} 𝜓_{𝑡} (𝑥_{0} | 𝑧) = 𝑥_{1} - 𝑥_{0}

We can verify that this velocity field satisfies the continuity equation.

1.2.3. Conical Gaussian Paths

1.2.3.1. Alternative Conditioning Choice

We can make other choices for the conditional variable:

𝑧 \overset{choice}{=} 𝑥_{1} \sim 𝑝_{1}

1.2.3.2. Flow Construction

We construct a flow that starts from a standard Gaussian and converges to the target point:

𝜓_{𝑡} (𝑥_{0} | 𝑧 = 𝑥_{1}) \overset{def}{=} 𝑡 𝑥_{1} + (1 - 𝑡) 𝑥_{0}, 𝑥_{0} \sim 𝑁 (0, 𝐼)

This induces the conditional probability path:

𝑝_{𝑡} (𝑥 | 𝑧 = 𝑥_{1}) \overset{def}{=} 𝑁 (𝑡 𝑥_{1}, {(1 - 𝑡)}^{2} 𝐼 𝑑)

The corresponding velocity field is:

𝑢_{𝑡} (𝑥 | 𝑧 = 𝑥_{1}) = \partial_{𝑡} 𝜓_{𝑡} (𝑥_{0} | 𝑧) = \frac{𝑥_{1} - 𝑥}{1 - 𝑡}

where we use the fact that $𝑥_{0} = \frac{𝑥 - 𝑡 𝑥_{1}}{1 - 𝑡}$ by inverting the flow map.

1.2.4. General construction of conditional probability paths

The general CFM construction follows these steps:

First, choose a conditioning variable $𝑧$ (independent of $𝑡$ )
Second, design flow maps $𝜓_{𝑡} (𝑥_{0} | 𝑧)$ that connect the source and target distributions
Third, obtain conditional probability paths $𝑝_{𝑡} (𝑥 | 𝑧)$ as the push-forward of the source under the flow
Fourth, derive velocity fields $𝑢_{𝑡} (𝑥 | 𝑧) = \partial_{𝑡} 𝜓_{𝑡} (𝑥_{0} | 𝑧)$ analytically

The conditional probability paths must satisfy the boundary conditions:

\begin{matrix} \forall 𝑥, 𝐸_{𝑧} [𝑝_{0} (𝑥 | 𝑧)] = 𝑝_{0} (𝑥), \\ \forall 𝑥, 𝐸_{𝑧} [𝑝_{1} (𝑥 | 𝑧)] = 𝑝_{1} (𝑥) \end{matrix}

This construction ensures that:

We avoid solving the ill-posed continuity equation
The velocity field has an analytical, tractable form
The global probability path correctly interpolates between $𝑝_{0}$ and $𝑝_{1}$

1.2.5. From Conditional to Unconditional Velocity

Figure 1: A Flow is represented with a Velocity field defining a random process generating a Probability path. The main idea of Flow Matching is to break down the construction of a complex flow satisfying the desired Boundary conditions to conditional flows satisfying simpler Boundary conditions and consequently easier to solve. The arrows indicate dependencies between different objects: Blue arrows signify relationships employed by the Flow Matching framework.

Theorem: Let $𝑧$ be any random variable independent of $𝑡$ . Choose conditional probability paths $𝑝_{𝑡} (𝑥 | 𝑧)$ , and let $𝑢_{𝑡} (𝑥 | 𝑧)$ be the velocity field associated to these paths. Then the marginal velocity field $𝑢_{𝑡} (𝑥)$ associated to the probability path $𝑝_{𝑡} (𝑥) = 𝐸_{𝑧} [𝑝_{𝑡} (𝑥 | 𝑧)]$ has a closed-form formula:

\forall 𝑡, 𝑥, 𝑢_{𝑡} (𝑥) = 𝐸_{𝑧 | 𝑥} [𝑢_{𝑡} (𝑥 | 𝑧)]

This is intractable in general, so we use a neural network $𝑢_{𝑡}^{𝜃} (𝑥)$ to estimate. The training objective is to minimize the tractable Conditional Flow Matching (CFM) loss:

𝐿^{CFM} (𝜃) = 𝐸_{𝑡, 𝑧, 𝑥} {‖ 𝑢_{𝑡}^{𝜃} (𝑥) - 𝑢_{𝑡} (𝑥 | 𝑧) ‖}^{2}

We can use the above loss because it's equivalent to directly regressing against the intractable unknown vector field $𝑢_{𝑡} (𝑥)$ ¹:

𝐿^{CFM} (𝜃) = 𝐸_{𝑥, 𝑡} {‖ 𝑢_{𝑡}^{𝜃} (𝑥) - 𝑢_{𝑡} (𝑥) ‖}^{2} + 𝐶 = 𝐿^{FM} (𝜃) + 𝐶

2. Rectified Flow

Rectified Flow is a specific and powerful instantiation of the Flow Matching framework. It simplifies the construction of the probability path and velocity field by focusing on creating the straightest possible trajectories between points from the source and target distributions. This approach not only provides a clear and simple training objective but also leads to highly efficient generative models.

The core idea is to "rectify" the coupling between the base distribution $𝑝_{0} (𝑥)$ and the target distribution $𝑝_{1} (𝑥)$ . Instead of arbitrary or complex conditional paths, Rectified Flow learns an Ordinary Differential Equation (ODE) that transports mass along straight lines.

2.1. 1-Rectified Flow: The Direct Path

The initial Rectified Flow, often called the 1-rectified flow, is constructed in a manner very similar to the linear interpolation method in CFM.

2.1.1. Construction

We start by creating the simplest possible coupling between the base and target distributions: an independent coupling. We draw a pair of samples, $𝑥_{0} \sim 𝑝_{0} (𝑥)$ and $𝑥_{1} \sim 𝑝_{1} (𝑥)$ , and define a straight-line path between them.

The flow map is a direct linear interpolation:

𝜓_{𝑡} (𝑥_{0}, 𝑥_{1}) = (1 - 𝑡) 𝑥_{0} + 𝑡 𝑥_{1}

This is the same as the "Linear Interpolation" above. The key difference in Rectified Flow is the focus on this specific construction and its iterative refinement.

The velocity field for this path is constant with respect to time for a given pair $(𝑥_{0}, 𝑥_{1})$ :

𝑢_{𝑡} (𝑥 | 𝑧 = (𝑥_{0}, 𝑥_{1})) = \partial_{𝑡} 𝜓_{𝑡} (𝑥_{0}, 𝑥_{1}) = 𝑥_{1} - 𝑥_{0}

2.1.2. The "Rectified" Velocity Field

While individual paths are straight, the marginal velocity field $𝑢_{𝑡} (𝑥)$ at a point $𝑥$ is the average velocity of all straight-line paths that pass through $𝑥$ at time $𝑡$ . This averaging process is what "rectifies" the flow. The resulting marginal velocity field is generally non-linear and complex, and it defines a deterministic flow that transforms $𝑝_{0}$ to $𝑝_{1}$ .

The training objective for a neural network $𝑢_{𝑡}^{𝜃} (𝑥)$ is a straightforward regression problem, identical to the CFM loss but with this specific choice of conditional velocity:

𝐿^{RF1} (𝜃) = 𝐸_{𝑡, 𝑥_{0}, 𝑥_{1}} {‖ 𝑢_{𝑡}^{𝜃} ((1 - 𝑡) 𝑥_{0} + 𝑡 𝑥_{1}) - (𝑥_{1} - 𝑥_{0}) ‖}^{2}

This loss aims to learn the expected velocity $𝐸 [𝑥_{1} - 𝑥_{0} | 𝑥_{𝑡} = 𝑥]$ where $𝑥_{𝑡} = (1 - 𝑡) 𝑥_{0} + 𝑡 𝑥_{1}$ .

The resulting trained model $𝑢_{𝑡}^{𝜃} (𝑥)$ can then be used to generate samples by solving the ODE $\frac{𝑑 𝑥_{𝑡}}{𝑑 𝑡} = 𝑢_{𝑡}^{𝜃} (𝑥_{𝑡})$ from $𝑡 = 0$ to $𝑡 = 1$ , starting with a sample $𝑥_{0} \sim 𝑝_{0} (𝑥)$ .

2.2. 2-Rectified Flow (and beyond): The "Reflow" Procedure

A key innovation of Rectified Flow is the reflow procedure. While the 1-rectified flow is a significant step, its trajectories are only perfectly straight if the model perfectly learns the conditional expectation. In practice, the generated paths from the 1-rectified flow model will have some curvature.

The reflow procedure aims to iteratively straighten these paths.

2.2.1. The Reflow Algorithm

Train the 1-Rectified Flow: First, train a velocity field $𝑢_{𝑡}^{1} (𝑥)$ using the direct straight-line paths between $𝑝_{0}$ and $𝑝_{1}$ as described above.
Generate a New Paired Dataset: Use the trained model $𝑢_{𝑡}^{1} (𝑥)$ to generate a new set of paired samples.
- Sample $𝑧_{0} \sim 𝑝_{0} (𝑥)$ .
- Solve the ODE $\frac{𝑑 𝑧_{𝑡}}{𝑑 𝑡} = 𝑢_{𝑡}^{1} (𝑧_{𝑡})$ from $𝑡 = 0$ to $𝑡 = 1$ to obtain the corresponding endpoint $𝑧_{1}$ .
- This creates a new dataset of pairs $(𝑧_{0}, 𝑧_{1})$ that represent a deterministic coupling induced by the 1-rectified flow.
Train the 2-Rectified Flow: Train a new velocity field $𝑢_{𝑡}^{2} (𝑥)$ using the same loss function, but on the new data pairs $(𝑧_{0}, 𝑧_{1})$ .

𝐿^{RF2} (𝜃) = 𝐸_{𝑡, 𝑧_{0}, 𝑧_{1}} {‖ 𝑢_{𝑡}^{𝜃} ((1 - 𝑡) 𝑧_{0} + 𝑡 𝑧_{1}) - (𝑧_{1} - 𝑧_{0}) ‖}^{2}

This process can be repeated to create 3-rectified flows and so on, with each iteration producing increasingly straight trajectories.

Proof that $𝐿^{CFM} (𝜃) = 𝐸_{𝑥, 𝑡} {‖ 𝑢_{𝑡}^{𝜃} (𝑥) - 𝑢_{𝑡} (𝑥) ‖}^{2} + 𝐶$ :

We need to show that minimizing the CFM loss is equivalent to minimizing the intractable loss up to a constant. We prove this by showing the gradients are equal.

Let $𝐷 (𝑎, 𝑏) = {‖ 𝑎 - 𝑏 ‖}^{2}$ be the squared L2 distance. Then:
$\begin{matrix} 𝐿^{CFM} (𝜃) & = 𝐸_{𝑡, 𝑧, 𝑥} 𝐷 (𝑢_{𝑡}^{𝜃} (𝑥), 𝑢_{𝑡} (𝑥 | 𝑧)) \\ 𝐿^{FM} (𝜃) & = 𝐸_{𝑥, 𝑡} 𝐷 (𝑢_{𝑡}^{𝜃} (𝑥), 𝑢_{𝑡} (𝑥)) \end{matrix}$
Taking gradients:
$\begin{matrix} \nabla_{𝜃} 𝐿^{FM} (𝜃) & = \nabla_{𝜃} 𝐸_{𝑡, 𝑋_{𝑡} \sim 𝑝_{𝑡}} 𝐷 (𝑢_{𝑡} (𝑋_{𝑡}), 𝑢_{𝑡}^{𝜃} (𝑋_{𝑡})) \\ = 𝐸_{𝑡, 𝑋_{𝑡} \sim 𝑝_{𝑡}} \nabla_{𝜃} 𝐷 (𝑢_{𝑡} (𝑋_{𝑡}), 𝑢_{𝑡}^{𝜃} (𝑋_{𝑡})) \\ = 𝐸_{𝑡, 𝑋_{𝑡} \sim 𝑝_{𝑡}} \nabla_{𝑣} 𝐷 (𝑢_{𝑡} (𝑋_{𝑡}), 𝑣) |_{𝑣 = 𝑢_{𝑡}^{𝜃} (𝑋_{𝑡})} \nabla_{𝜃} 𝑢_{𝑡}^{𝜃} (𝑋_{𝑡}) \\ = 𝐸_{𝑡, 𝑋_{𝑡} \sim 𝑝_{𝑡}} \nabla_{𝑣} 𝐷 (𝐸_{𝑍 \sim 𝑝_{𝑍 | 𝑡} (\cdot | 𝑋_{𝑡})} [𝑢_{𝑡} (𝑋_{𝑡} | 𝑍)], 𝑣) |_{𝑣 = 𝑢_{𝑡}^{𝜃} (𝑋_{𝑡})} \nabla_{𝜃} 𝑢_{𝑡}^{𝜃} (𝑋_{𝑡}) \\ = 𝐸_{𝑡, 𝑋_{𝑡} \sim 𝑝_{𝑡}} 𝐸_{𝑍 \sim 𝑝_{𝑍 | 𝑡} (\cdot | 𝑋_{𝑡})} [\nabla_{𝑣} 𝐷 (𝑢_{𝑡} (𝑋_{𝑡} | 𝑍), 𝑣) |_{𝑣 = 𝑢_{𝑡}^{𝜃} (𝑋_{𝑡})}] \nabla_{𝜃} 𝑢_{𝑡}^{𝜃} (𝑋_{𝑡}) \\ = 𝐸_{𝑡, 𝑋_{𝑡} \sim 𝑝_{𝑡}} 𝐸_{𝑍 \sim 𝑝_{𝑍 | 𝑡} (\cdot | 𝑋_{𝑡})} [\nabla_{𝜃} 𝐷 (𝑢_{𝑡} (𝑋_{𝑡} | 𝑍), 𝑢_{𝑡}^{𝜃} (𝑋_{𝑡}))] \\ = \nabla_{𝜃} 𝐸_{𝑡, 𝑍 \sim 𝑞, 𝑋_{𝑡} \sim 𝑝_{𝑡 | 𝑍} (\cdot | 𝑍)} [𝐷 (𝑢_{𝑡} (𝑋_{𝑡} | 𝑍), 𝑢_{𝑡}^{𝜃} (𝑋_{𝑡}))] \\ = \nabla_{𝜃} 𝐿^{CFM} (𝜃) \end{matrix}$

🔒 Access Restricted

Access Control