Convex cones #
THIS FILE IS SYNCHRONIZED WITH MATHLIB4. Any changes to this file require a corresponding PR to mathlib4.
In a 𝕜
-module E
, we define a convex cone as a set s
such that a • x + b • y ∈ s
whenever
x, y ∈ s
and a, b > 0
. We prove that convex cones form a complete_lattice
, and define their
images (convex_cone.map
) and preimages (convex_cone.comap
) under linear maps.
We define pointed, blunt, flat and salient cones, and prove the correspondence between convex cones and ordered modules.
We define convex.to_cone
to be the minimal cone that includes a given convex set.
Main statements #
We prove two extension theorems:
riesz_extension
: M. Riesz extension theorem says that ifs
is a convex cone in a real vector spaceE
,p
is a submodule ofE
such thatp + s = E
, andf
is a linear functionp → ℝ
which is nonnegative onp ∩ s
, then there exists a globally defined linear functiong : E → ℝ
that agrees withf
onp
, and is nonnegative ons
.exists_extension_of_le_sublinear
: Hahn-Banach theorem: ifN : E → ℝ
is a sublinear map,f
is a linear map defined on a subspace ofE
, andf x ≤ N x
for allx
in the domain off
, thenf
can be extended to the whole space to a linear mapg
such thatg x ≤ N x
for allx
We prove the following theorems:
convex_cone.hyperplane_separation_of_nonempty_of_is_closed_of_nmem
: This variant of the hyperplane separation theorem states that given a nonempty, closed, convex coneK
in a complete, real inner product spaceH
and a pointb
disjoint from it, there is a vectory
which separatesb
fromK
in the sense that for all pointsx
inK
,0 ≤ ⟪x, y⟫_ℝ
and⟪y, b⟫_ℝ < 0
. This is also a geometric interpretation of the Farkas lemma.convex_cone.inner_dual_cone_of_inner_dual_cone_eq_self
:
Implementation notes #
While convex 𝕜
is a predicate on sets, convex_cone 𝕜 E
is a bundled convex cone.
References #
Definition of convex_cone
and basic properties #
- carrier : set E
- smul_mem' : ∀ ⦃c : 𝕜⦄, 0 < c → ∀ ⦃x : E⦄, x ∈ self.carrier → c • x ∈ self.carrier
- add_mem' : ∀ ⦃x : E⦄, x ∈ self.carrier → ∀ ⦃y : E⦄, y ∈ self.carrier → x + y ∈ self.carrier
A convex cone is a subset s
of a 𝕜
-module such that a • x + b • y ∈ s
whenever a, b > 0
and x, y ∈ s
.
Instances for convex_cone
- convex_cone.has_sizeof_inst
- convex_cone.set_like
- convex_cone.add_mem_class
- convex_cone.has_inf
- convex_cone.has_Inf
- convex_cone.has_bot
- convex_cone.has_top
- convex_cone.complete_lattice
- convex_cone.inhabited
- convex_cone.has_zero
- convex_cone.has_add
- convex_cone.add_zero_class
- convex_cone.add_comm_semigroup
Equations
- convex_cone.set_like = {coe := convex_cone.carrier _inst_3, coe_injective' := _}
Two convex_cone
s are equal if they have the same elements.
Equations
- convex_cone.has_Inf = {Inf := λ (S : set (convex_cone 𝕜 E)), {carrier := ⋂ (s : convex_cone 𝕜 E) (H : s ∈ S), ↑s, smul_mem' := _, add_mem' := _}}
Equations
- convex_cone.complete_lattice 𝕜 = {sup := λ (a b : convex_cone 𝕜 E), has_Inf.Inf {x : convex_cone 𝕜 E | a ≤ x ∧ b ≤ x}, le := has_le.le (preorder.to_has_le (convex_cone 𝕜 E)), lt := has_lt.lt (preorder.to_has_lt (convex_cone 𝕜 E)), le_refl := _, le_trans := _, lt_iff_le_not_le := _, le_antisymm := _, le_sup_left := _, le_sup_right := _, sup_le := _, inf := has_inf.inf convex_cone.has_inf, inf_le_left := _, inf_le_right := _, le_inf := _, Sup := λ (s : set (convex_cone 𝕜 E)), has_Inf.Inf {T : convex_cone 𝕜 E | ∀ (S : convex_cone 𝕜 E), S ∈ s → S ≤ T}, le_Sup := _, Sup_le := _, Inf := has_Inf.Inf convex_cone.has_Inf, Inf_le := _, le_Inf := _, top := ⊤, bot := ⊥, le_top := _, bot_le := _}
Equations
- convex_cone.inhabited 𝕜 = {default := ⊥}
The image of a convex cone under a 𝕜
-linear map is a convex cone.
The preimage of a convex cone under a 𝕜
-linear map is a convex cone.
Constructs an ordered module given an ordered_add_comm_group
, a cone, and a proof that
the order relation is the one defined by the cone.
Convex cones with extra properties #
A convex cone is pointed if it includes 0
.
A convex cone is blunt if it doesn't include 0
.
A convex cone is flat if it contains some nonzero vector x
and its opposite -x
.
A convex cone is salient if it doesn't include x
and -x
for any nonzero x
.
A flat cone is always pointed (contains 0
).
A blunt cone (one not containing 0
) is always salient.
A pointed convex cone defines a preorder.
A pointed and salient cone defines a partial order.
Equations
- S.to_partial_order h₁ h₂ = {le := preorder.le (S.to_preorder h₁), lt := preorder.lt (S.to_preorder h₁), le_refl := _, le_trans := _, lt_iff_le_not_le := _, le_antisymm := _}
A pointed and salient cone defines an ordered_add_comm_group
.
Equations
- S.to_ordered_add_comm_group h₁ h₂ = {add := add_comm_group.add (show add_comm_group E, from _inst_2), add_assoc := _, zero := add_comm_group.zero (show add_comm_group E, from _inst_2), zero_add := _, add_zero := _, nsmul := add_comm_group.nsmul (show add_comm_group E, from _inst_2), nsmul_zero' := _, nsmul_succ' := _, neg := add_comm_group.neg (show add_comm_group E, from _inst_2), sub := add_comm_group.sub (show add_comm_group E, from _inst_2), sub_eq_add_neg := _, zsmul := add_comm_group.zsmul (show add_comm_group E, from _inst_2), zsmul_zero' := _, zsmul_succ' := _, zsmul_neg' := _, add_left_neg := _, add_comm := _, le := partial_order.le (S.to_partial_order h₁ h₂), lt := partial_order.lt (S.to_partial_order h₁ h₂), le_refl := _, le_trans := _, lt_iff_le_not_le := _, le_antisymm := _, add_le_add_left := _}
Equations
- convex_cone.add_zero_class = {zero := 0, add := has_add.add convex_cone.has_add, zero_add := _, add_zero := _}
Equations
- convex_cone.add_comm_semigroup = {add := has_add.add convex_cone.has_add, add_assoc := _, add_comm := _}
Submodules are cones #
Every submodule is trivially a convex cone.
Positive cone of an ordered module #
The positive cone is the convex cone formed by the set of nonnegative elements in an ordered module.
The positive cone of an ordered module is always salient.
The positive cone of an ordered module is always pointed.
The cone of strictly positive elements.
Note that this naming diverges from the mathlib convention of pos
and nonneg
due to "positive
cone" (convex_cone.positive
) being established terminology for the non-negative elements.
The strictly positive cone of an ordered module is always salient.
The strictly positive cone of an ordered module is always blunt.
Cone over a convex set #
The set of vectors proportional to those in a convex set forms a convex cone.
hs.to_cone s
is the least cone that includes s
.
M. Riesz extension theorem #
Given a convex cone s
in a vector space E
, a submodule p
, and a linear f : p → ℝ
, assume
that f
is nonnegative on p ∩ s
and p + s = E
. Then there exists a globally defined linear
function g : E → ℝ
that agrees with f
on p
, and is nonnegative on s
.
We prove this theorem using Zorn's lemma. riesz_extension.step
is the main part of the proof.
It says that if the domain p
of f
is not the whole space, then f
can be extended to a larger
subspace p ⊔ span ℝ {y}
without breaking the non-negativity condition.
In riesz_extension.exists_top
we use Zorn's lemma to prove that we can extend f
to a linear map g
on ⊤ : submodule E
. Mathematically this is the same as a linear map on E
but in Lean ⊤ : submodule E
is isomorphic but is not equal to E
. In riesz_extension
we use this isomorphism to prove the theorem.
Induction step in M. Riesz extension theorem. Given a convex cone s
in a vector space E
,
a partially defined linear map f : f.domain → ℝ
, assume that f
is nonnegative on f.domain ∩ p
and p + s = E
. If f
is not defined on the whole E
, then we can extend it to a larger
submodule without breaking the non-negativity condition.
M. Riesz extension theorem: given a convex cone s
in a vector space E
, a submodule p
,
and a linear f : p → ℝ
, assume that f
is nonnegative on p ∩ s
and p + s = E
. Then
there exists a globally defined linear function g : E → ℝ
that agrees with f
on p
,
and is nonnegative on s
.
Hahn-Banach theorem: if N : E → ℝ
is a sublinear map, f
is a linear map
defined on a subspace of E
, and f x ≤ N x
for all x
in the domain of f
,
then f
can be extended to the whole space to a linear map g
such that g x ≤ N x
for all x
.