Introduction
The Kantorovich dual problem is a common reformulation of the Kantorovich problem. The case is an important special case for several reasons. First off, the minimum cost is the 2-Wasserstein distance. From the point of view of mathematical analysis, the quadratic cost is special because the -transform for is closely related to convex conjugates, and many tools from convex analysis can be used. Moreover, in the quadratic case the Monge optimal transport map is of the form for a convex function (as discussed in the section on Brenier’s theorem).
Definitions
Definition of the convex conjugate
Suppose . The convex conjugate of is defined by Here, is the dot product of and . The map is called the Legendre transform.
The Relationship between c-Concavity and Convex Conjugates
The following is Proposition 1.21 in (Santambrogio 2015). The proof below mirrors the proof there.
Theorem 1 (Relationship between the c-transform and Legendre transform) Given a proper function , define by . Then . Moreover, a proper function is c-concave if and only if is convex and lower semi-continuous.
Proof. Let , and let . We compute:
This proves .
It remains to show that is c-concave if and only if is convex and lower semi-continuous.
Let . Assume is c-concave. Then, for some . Now, . By Fenchel Moreau this implies is convex and lower semicontinous.
Now, assume is convex and lower semicontinuous. Then, by Fenchel Moreau, for some function . Define (so ). Then, , which implies . Thus, is -concave.
The Dual problem
Let and be Polish spaces. Let and be probability measures on and respectively. Let be lower semicontinuous. The dual problem of the Kantorovich problem is where the supremum is take over all such that .
There is one thing here that isn’t standard: sometimes you take instead , instead of . That is, you maximize over the smaller set of continuous and bounded functions. Whenever and are Polish spaces is dense in so the supremum is the same. However, working in is can be convenient because sometimes maximizers of Equation 1 are non-continuous functions.
Solvability of the Dual Problem with Quadratic Cost
Theorem 2 Let and be (Borel) probability measures on . Assume Then, there is a maximizer to the dual Kantorovich problem. Moreover, and are both convex and .
This is Theorem 1.40 in (Santambrogio 2015) (p.32), a proof can be found there. For more general , the dual problem still has a solution (and the maximum of the dual problem is equal to the infimum in the primal problem) as is discussed here. However, much of the proof in the non-quadratic case relies on theory of the -transform instead of convex analysis. The book (Santambrogio 2015) is also a good resource to see the proof in the general case.
References
Santambrogio, Filippo. 2015. “Optimal Transport for Applied Mathematicians.” Birkäuser, NY 55 (58-63): 94.