On the Duality Between Uniform Convexity and Uniform Smoothness
The duality result between strong convexity and strong smoothness is a well known result, see e.g. [1]. A generalization of this duality is a duality result between more general concepts uniform convexity and uniform smoothness, its proof can be found e.g. in the book of Zalinescu [3, Prop. 3.5.3]. Although the proof idea of Zalinescu is correct, there are some errors and typos in the definitions, theorem statement and even the proof. In this note, we revisit and rigorously prove this result of Zalinescu.
More importantly, we show that uniform convexity on domain⟺uniform smoothness on entire space.
As a consequence strong convexity on domain⟺strong smoothness on entire space.
1. Basic concepts
Let (X,∣∣⋅∣∣) be a normed space and (X∗,∣∣⋅∣∣∗) be its continuous dual space. Let AX be class of non-negative vector-variable functions vanishing at 0, i.e. AX={φ:X→[0,+∞] so that φ(0)=0}. Here, note that we do not assume that the functions in AX are symmetric.
For f:X→[−∞,+∞], its convex conjugatef∗:X∗→[−∞,+∞] is defined as follows
f∗(x∗)=x∈Xsup⟨x∗,x⟩−f(x)
It is clear that if φ∈AX, then φ∗∈AX∗.
For ρ∈AX, f is said to be ρ-uniformly convex on its domain domf, if
f((1−λ)x+λy)+(1−λ)λρ(x−y)≤(1−λ)f(x)+λf(y)
for all x,y∈domf and λ∈[0,1]. By the non-negativity of ρ, uniform convexity implies convexity. When ρ(x)=2α∣∣x∣∣X2, f is said to be α-strongly convex.
Similarly, for σ∈AX, f is said to be σ-uniformly smooth if it is convex and satisfies
f((1−λ)x+λy)+(1−λ)λσ(x−y)≥(1−λ)f(x)+λf(y)
for all x,y∈domf, λ∈[0,1]. When σ(x)=2β∣∣x∣∣X2, f is said to be β-strongly smooth. In particular, we say that f is globally σ-uniformly smooth, if it is σ-uniformly smooth and domf=X.
In the following, we will exploit the following transformation:
{x′=(1−λ)x+λyy′=−x+y⟺{x=x′−λy′y=x′+(1−λ)y′
By this transformation, σ-uniform smoothness has an equivalent definition
f(x′)+(1−λ)λσ(−y′)≥(1−λ)f(x′−λy′)+λf(x′+(1−λ)y′),
for all x′,y′∈X and λ∈[0,1] such that x′−λy′ and x′+(1−λ)y′∈domf.
Remark. Note that if f is uniformly smooth, then the Frechet derivative of f exists and is uniformly continuous, see Theorem 3.5.6. page (xi) 207-208 [3]. The existence of Frechet derivative explains why we call it smoothness. Furthermore, if f is strongly smooth, then the Frechet derivative is even Lipschitz continuous, see Corollary 3.5.7. page 212-213 (vi) [2].
2. Duality Result
Theorem (Revise [3, Prop. 3.5.3])
Assume that f is a closed proper convex function.
If f is ρ-uniformly convex on domf, then f∗ is ρ∗-uniformly smooth on X∗.
If f is σ-uniformly smooth on X, then f∗ is σ∗-uniformly convex on domf∗.
Proof.
(s.convex ⇒ s. smooth) Assume that f is ρ-uniformly convex.
Note that ∂f∗(x∗)=x∈Xargmax⟨x∗,x⟩−f(x),∀x∗∈X∗.
Since f is closed, proper and uniformly convex, the argmax is well-defined and singleton. So is ∂f∗. Thus, f∗ is differentiable everywhere on X∗ and domf∗=X∗.
Let x0∗,x1∗∈X∗ be arbitrary and λ∈[0,1].
To prove that f∗ is globally ρ∗-uniformly smooth, we show that
(1−λ)f∗(x0∗)+λf∗(x1∗)≤f∗(x∗)+λ(1−λ)ρ∗(−y∗).
where x∗:=(1−λ)x0∗+λx1∗ and y∗:=−x0∗+x1∗. Conversely, we have x0∗:=x∗−λy∗ and x1∗:=x∗+(1−λ)y∗.
Let γ0<f∗(x0∗) and γ1<f∗(x1∗). By definition of conjugate, there exist x0,x1∈domf such that γ0<⟨x0,x0∗⟩−f(x0) and γ1<⟨x1,x1∗⟩−f(x1). Multiplying the first inequality by 1−λ and the second by λ and then adding both side to Fenchel’s inequality, 0≤f(xλ)+f∗(x∗)−⟨xλ,x∗⟩, where xλ=(1−λ)x0+λx1, we can end up with
Here, we implicitly apply the fact that f is ρ-uniformly convex at the second inequality. Taking γi→f∗(xi∗) for i=1,2, the ρ∗-uniform smoothness of f∗ is proved.
(s.smooth ⇒ s.convex) Suppose that f is globally σ-uniformly smooth. Let any x0∗,x1∗∈domf∗ and λ∈[0,1]. Denote xλ∗:=(1−λ)x0∗+λx1∗. For any x,y∈X, we have
Remark. If we only assume that f is uniformly smooth on its domain, then it is not sufficient to guarantee the uniform convexity of f∗. A counter example in Rn is f(x)=21∣∣x∣∣22+I(∣∣x∣∣2≤1), then
In this case, f is 1-strongly smooth on its domain, but f∗ contains linear part, thus not strictly/uniformly/strongly convex.
3. References
[1] Kakade, Sham, Shai Shalev-Shwartz, and Ambuj Tewari. “On the duality of strong convexity and strong smoothness: Learning applications and matrix regularization.” (2009): 35.
[2] Zalinescu, C. “On uniformly convex functions.” Journal of Mathematical Analysis and Applications 95.2 (1983): 344-374.
[3] Zalinescu, Constantin. “Convex analysis in general vector spaces”. World scientific, 2002.