James' weak compactness theorem: an exposition

Warren B. Moors; Samuel White

arXiv:1705.06406·math.FA·May 19, 2017

James' weak compactness theorem: an exposition

Warren B. Moors, Samuel White

PDF

Open Access

TL;DR

This paper presents an accessible proof of James' weak compactness theorem suitable for first-year graduate students in functional analysis.

Contribution

It offers a simplified, pedagogically friendly exposition of James' theorem tailored for educational use.

Findings

01

Clear proof of James' weak compactness theorem

02

Educational approach suitable for graduate teaching

03

Simplifies understanding of weak compactness in Banach spaces

Abstract

The purpose of this paper is to provide a proof of James' weak compactness theorem that is able to be taught in a first year graduate class in functional analysis.

Equations374

V_{1} \cap V_{2} = f^{- 1} (U_{1}) \cap f^{- 1} (U_{2}) = f^{- 1} (U_{1} \cap U_{2}) .

V_{1} \cap V_{2} = f^{- 1} (U_{1}) \cap f^{- 1} (U_{2}) = f^{- 1} (U_{1} \cap U_{2}) .

\mbox{$\bigcup_{\alpha\in A}$}V_{\alpha}=\mbox{$\bigcup_{\alpha\in A}$}f^{-1}(U_{\alpha})=f^{-1}\left(\mbox{$\bigcup_{\alpha\in A}$}U_{\alpha}\right)\!.

\mbox{$\bigcup_{\alpha\in A}$}V_{\alpha}=\mbox{$\bigcup_{\alpha\in A}$}f^{-1}(U_{\alpha})=f^{-1}\left(\mbox{$\bigcup_{\alpha\in A}$}U_{\alpha}\right)\!.

\mathcal{B}:=\left\{\mbox{$\bigcap_{1\leq k\leq n}$}f_{k}^{-1}(U_{k}):n\in{\mathbb{N}},U_{k}\in\tau_{Y}\mbox{ and }f_{k}\in\mathcal{F}\right\}

\mathcal{B}:=\left\{\mbox{$\bigcap_{1\leq k\leq n}$}f_{k}^{-1}(U_{k}):n\in{\mathbb{N}},U_{k}\in\tau_{Y}\mbox{ and }f_{k}\in\mathcal{F}\right\}

V_{1}\cap V_{2}=\mbox{$\bigcap_{1\leq k\leq{n_{1}}}$}(f^{\prime}_{k})^{-1}(U^{\prime}_{k})\cap\mbox{$\bigcap_{1\leq k\leq{n_{2}}}$}(f^{\prime\prime}_{k})^{-1}(U^{\prime\prime}_{k})=\mbox{$\bigcap_{1\leq k\leq{n}}$}f_{k}^{-1}(U_{k})\in\mathcal{B}.

V_{1}\cap V_{2}=\mbox{$\bigcap_{1\leq k\leq{n_{1}}}$}(f^{\prime}_{k})^{-1}(U^{\prime}_{k})\cap\mbox{$\bigcap_{1\leq k\leq{n_{2}}}$}(f^{\prime\prime}_{k})^{-1}(U^{\prime\prime}_{k})=\mbox{$\bigcap_{1\leq k\leq{n}}$}f_{k}^{-1}(U_{k})\in\mathcal{B}.

V_{1}\cap V_{2}=(\mbox{$\bigcup_{i\in I_{1}}$}B_{i})\cap(\mbox{$\bigcup_{i\in I_{2}}$}B_{i})=\mbox{$\bigcup_{(i,j)\in I}$}B_{i}\cap B_{j}\in\tau_{X}\mbox{\quad since, $B_{i}\cap B_{j}\in\mathcal{B}$}

V_{1}\cap V_{2}=(\mbox{$\bigcup_{i\in I_{1}}$}B_{i})\cap(\mbox{$\bigcup_{i\in I_{2}}$}B_{i})=\mbox{$\bigcup_{(i,j)\in I}$}B_{i}\cap B_{j}\in\tau_{X}\mbox{\quad since, $B_{i}\cap B_{j}\in\mathcal{B}$}

N_{X}(x_{0},F,\varepsilon):=\mbox{$\bigcap_{f\in F}$}\{x\in X:|f(x)-f(x_{0})|<\varepsilon\}.

N_{X}(x_{0},F,\varepsilon):=\mbox{$\bigcap_{f\in F}$}\{x\in X:|f(x)-f(x_{0})|<\varepsilon\}.

x_{0} \in N (x_{0}, F, ε) \subseteq N (x_{0}, F_{U}, ε_{U}) \cap N (x_{0}, F_{V}, ε_{V}) \subseteq U \cap V .

x_{0} \in N (x_{0}, F, ε) \subseteq N (x_{0}, F_{U}, ε_{U}) \cap N (x_{0}, F_{V}, ε_{V}) \subseteq U \cap V .

N(x,F,\varepsilon)=\mbox{$\bigcap_{f\in F}$}\{x\in X:|f(x)-f(x_{0})|<\varepsilon\}=\mbox{$\bigcap_{1\leq k\leq n}$}f_{k}^{-1}(U_{k}).\mbox{\quad\quad$(*)$}

N(x,F,\varepsilon)=\mbox{$\bigcap_{f\in F}$}\{x\in X:|f(x)-f(x_{0})|<\varepsilon\}=\mbox{$\bigcap_{1\leq k\leq n}$}f_{k}^{-1}(U_{k}).\mbox{\quad\quad$(*)$}

f (N (x_{0}, {f}, ε)) \subseteq (f (x_{0}) - ε, f (x_{0}) + ε) .

f (N (x_{0}, {f}, ε)) \subseteq (f (x_{0}) - ε, f (x_{0}) + ε) .

∣ f (S (x^{'}, y^{'})) - f (S (x, y)) ∣

∣ f (S (x^{'}, y^{'})) - f (S (x, y)) ∣

ε_{1} := \frac{ε}{2 ( ∣ f ( x ) ∣ + 1 )} \mbox an d ε_{2} := \frac{ε}{2 ( ∣ r ∣ + 1 )} .

ε_{1} := \frac{ε}{2 ( ∣ f ( x ) ∣ + 1 )} \mbox an d ε_{2} := \frac{ε}{2 ( ∣ r ∣ + 1 )} .

∣ f (M (r^{'}, x^{'})) - f (M (r, x)) ∣

∣ f (M (r^{'}, x^{'})) - f (M (r, x)) ∣

T (x) := (f_{1} (x), \dots, f_{n} (x)) for all x \in V .

T (x) := (f_{1} (x), \dots, f_{n} (x)) for all x \in V .

x_{k}\in\mbox{$\bigcap$}\{\ker(f_{i}):1\leq i\leq n,\ i\neq k\}\setminus\mbox{$\bigcap_{i=1}^{n}$}\ker(f_{i}).

x_{k}\in\mbox{$\bigcap$}\{\ker(f_{i}):1\leq i\leq n,\ i\neq k\}\setminus\mbox{$\bigcap_{i=1}^{n}$}\ker(f_{i}).

g^{*} (x) := g (z) for any z \in T^{- 1} (x) .

g^{*} (x) := g (z) for any z \in T^{- 1} (x) .

g = g^{*} \circ T = (i = 1 \sum n c_{i} e_{i}^{*}) \circ T = i = 1 \sum n c_{i} (e_{i}^{*} \circ T) = i = 1 \sum n c_{i} f_{i}

g = g^{*} \circ T = (i = 1 \sum n c_{i} e_{i}^{*}) \circ T = i = 1 \sum n c_{i} (e_{i}^{*} \circ T) = i = 1 \sum n c_{i} f_{i}

∣ y_{k}^{*} (y) - y_{k}^{*} (T (x_{0})) ∣

∣ y_{k}^{*} (y) - y_{k}^{*} (T (x_{0})) ∣

x_{0} \in N_{X} (x_{0}, {x_{1}^{*}, x_{2}^{*}, \dots, x_{n}^{*}}, ε) \subseteq T^{- 1} (W) .

x_{0} \in N_{X} (x_{0}, {x_{1}^{*}, x_{2}^{*}, \dots, x_{n}^{*}}, ε) \subseteq T^{- 1} (W) .

f (λ_{1}^{- 1} m_{1} + λ_{2}^{- 1} m_{2}) \leq p (λ_{1}^{- 1} m_{1} + λ_{2}^{- 1} m_{2}) \leq p (λ_{1}^{- 1} m_{1} - x_{0}) + p (λ_{2}^{- 1} m_{2} + x_{0}) .

f (λ_{1}^{- 1} m_{1} + λ_{2}^{- 1} m_{2}) \leq p (λ_{1}^{- 1} m_{1} + λ_{2}^{- 1} m_{2}) \leq p (λ_{1}^{- 1} m_{1} - x_{0}) + p (λ_{2}^{- 1} m_{2} + x_{0}) .

f (λ_{1}^{- 1} m_{1}) - p (λ_{1}^{- 1} m_{1} - x_{0}) \leq p (λ_{2}^{- 1} m_{2} + x_{0}) - f (λ_{2}^{- 1} m_{2})

f (λ_{1}^{- 1} m_{1}) - p (λ_{1}^{- 1} m_{1} - x_{0}) \leq p (λ_{2}^{- 1} m_{2} + x_{0}) - f (λ_{2}^{- 1} m_{2})

0 < λ < \infty m \in M sup (f (λ^{- 1} m) - p (λ^{- 1} m - x_{0})) \leq p (λ_{2}^{- 1} m_{2} + x_{0}) - f (λ_{2}^{- 1} m_{2}) .

0 < λ < \infty m \in M sup (f (λ^{- 1} m) - p (λ^{- 1} m - x_{0})) \leq p (λ_{2}^{- 1} m_{2} + x_{0}) - f (λ_{2}^{- 1} m_{2}) .

a := 0 < λ < \infty m \in M sup (f (λ^{- 1} m) - p (λ^{- 1} m - x_{0})) \leq 0 < λ < \infty m \in M in f (p (λ^{- 1} m + x_{0}) - f (λ^{- 1} m)) =: b .

a := 0 < λ < \infty m \in M sup (f (λ^{- 1} m) - p (λ^{- 1} m - x_{0})) \leq 0 < λ < \infty m \in M in f (p (λ^{- 1} m + x_{0}) - f (λ^{- 1} m)) =: b .

f(m)+(-\lambda)\alpha^{*}\leq p(m+(-\lambda)x_{0})\mbox{\quad for all $m\in M$ and $0<\lambda<\infty$.}

f(m)+(-\lambda)\alpha^{*}\leq p(m+(-\lambda)x_{0})\mbox{\quad for all $m\in M$ and $0<\lambda<\infty$.}

f(m)+\lambda\alpha^{*}\leq p(m+\lambda x_{0})\mbox{\quad for all $m\in M$ and $0<\lambda<\infty$.}

f(m)+\lambda\alpha^{*}\leq p(m+\lambda x_{0})\mbox{\quad for all $m\in M$ and $0<\lambda<\infty$.}

F^{*}(x):=F_{\alpha^{*}}(x)\leq p(x)\mbox{\quad for all $x\in M^{*}$.}

F^{*}(x):=F_{\alpha^{*}}(x)\leq p(x)\mbox{\quad for all $x\in M^{*}$.}

N_{Y} (y, {y_{1}^{*}, y_{2}^{*}, \dots, y_{n}^{*}}, ε) = N_{X} (y, {x_{1}^{*}, x_{2}^{*}, \dots, x_{n}^{*}}, ε) \cap Y \subseteq U .

N_{Y} (y, {y_{1}^{*}, y_{2}^{*}, \dots, y_{n}^{*}}, ε) = N_{X} (y, {x_{1}^{*}, x_{2}^{*}, \dots, x_{n}^{*}}, ε) \cap Y \subseteq U .

N_{X} (y, {x_{1}^{*}, x_{2}^{*}, \dots, x_{n}^{*}}, ε) \cap Y = N_{Y} (y, {y_{1}^{*}, y_{2}^{*}, \dots, y_{n}^{*}}, ε) \subseteq U .

N_{X} (y, {x_{1}^{*}, x_{2}^{*}, \dots, x_{n}^{*}}, ε) \cap Y = N_{Y} (y, {y_{1}^{*}, y_{2}^{*}, \dots, y_{n}^{*}}, ε) \subseteq U .

μ_{C} (x) := in f {λ > 0 : x \in λ C}

μ_{C} (x) := in f {λ > 0 : x \in λ C}

{x \in V : μ_{C} (x) < 1} \subseteq C \subseteq {x \in V : μ_{C} (x) \leq 1} .

{x \in V : μ_{C} (x) < 1} \subseteq C \subseteq {x \in V : μ_{C} (x) \leq 1} .

s_{0} c = \frac{s _{0}}{s} (sc) + (1 - \frac{s _{0}}{s}) 0 \in s C .

s_{0} c = \frac{s _{0}}{s} (sc) + (1 - \frac{s _{0}}{s}) 0 \in s C .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Topology and Set Theory · Functional Equations Stability Results · Advanced Banach Space Theory

Full text

James’ weak compactness theorem: an exposition

Warren B. Moors111Corresponding author. See back page for details and Samuel J. White

Abstract. The purpose of this paper is to provide a proof of James’ weak compactness theorem that is able to be taught in a first year graduate class in functional analysis.

AMS (2010) subject classification: Primary 46B20; Secondary 46B10, 46B50.

Keywords: James theorem, weak compactness, Banach space, Linear topology.

1 Introduction

The purpose of this paper is to provide a proof of James’ weak compactness theorem that is able to be taught in a first year graduate class in functional analysis. Usually when one teaches a first course in functional analysis one teaches the basic finite dimensional material, Hilbert space material, the open mapping theorem, the closed graph theorem, the uniform boundedness theorem and the Hahn-Banach theorem, plus applications. Then one might consider the spectral theory of compact normal operators, or even an introduction to $C^{*}$ -algebras. However, what is often neglected is the study of linear topology, which then makes it difficult to even start to contemplate how one might prove James’ theorem on weak compactness. So, what we propose here is a way of presenting James’ theorem on weak compactness to an audience unfamiliar with linear topology, or anything other than, the most basic facts concerning normed linear spaces.

For the authors, James’ theorem on weak compactness is one of the true delights of functional analysis. Its proof is a beautiful synthesis of linear algebra and topology. The one down-side of this theorem is that its proof has an unfortunate reputation of being very difficult. We hope, among other things, to dispel this myth.

We shall start with a brief history of this problem. Back in 1933 (see, [29]) S. Mazur conjectured that a Banach space $(X,\|\cdot\|)$ , over the real numbers, is reflexive if, and only if, every continuous linear functional defined on $X$ attains its maximum value on the closed unit ball of $X$ . In 1957 (see, [18]), R. James confirmed this conjecture for separable Banach spaces, i.e., those spaces that contain a countable dense subset. Later, in 1963 (see, [19]), R. James completely confirmed the conjecture for arbitrary Banach spaces. One year after this, in [20], R. James extended this result to show that a closed and bounded convex subset $C$ of a Banach space $X$ is weakly compact if, and only if, every continuous linear functional defined on $X$ attains its maximum value over $C$ . The fact that this result does not extend to non-complete normed linear spaces was established in [16], again by James. Almost immediately, even in 1965 (see, [40]), there was a search for a simpler proof of James’ weak compactness theorem. The proof in [40] is indeed very clear and easy to read, and is in fact the basis of a lot of the work in this paper. However, [40] still contains a series of seven technical lemmas. In 1972, R. James (see, [17]) provided a simpler proof of his own weak compactness theorem, and in [43], S. Simons, using an inequality that now bears his name, proved the weak compactness theorem for separable Banach spaces. Since these early results there have been many attempts at providing a simple proof of James’ theorem. Most of these require additional assumptions on the space. One approach which is quite appealing is that of $(I)$ -generation. This first appeared in [10] and then again in [11], but it has since been shown (see, [23]) that this approach is essentially equivalent to the approach of S. Simons from 1972. In addition to the already mentioned papers the interested reader may also want to see the papers [3, 14, 35, 24, 33, 34], where several “simple” proofs of James’ theorem are given. The paper [3] also has some interesting applications and historical facts.

We now return to the mathematics. In this paper all vector spaces and all normed linear spaces will be over the field of real numbers. The key concept, which runs throughout this paper, is the notion of a convex set. A subset $C$ of a vector space $(V,+,\cdot)$ , over the real numbers, is called convex if, for every pair of points $x,y\in C$ and $0<\lambda<1$ , we have $\lambda x+(1-\lambda)y\in C$ . We encourage the reader to follow the role that this concept plays throughout the rest of this paper.

The structure of the reminder of this paper is as follows: Section 2 contains the necessary background material. In particular, it contains Subsections; 2.1 on Weak topologies, 2.2 on Linear topology, 2.3 on the Hahn-Banach Theorem, 2.4 on the Weak∗ topology. Readers with a background in linear topology may wish to skip this section. Section 3 contains three proofs of James’ theorem given in three subsections; 3.1 on James’ theorem for separable Banach spaces; 3.2 on James’ theorem for spaces with a weak∗ sequentially compact dual ball, 3.3 the general version of James’ theorem. In Subsection 3.4 some applications of James’ theorem are given. In Section 4 a generalisation of James’ theorem is given. To achieve this, this section contains Subsection 4.1 which gives the necessary background in convex analysis, then in Subsection 4.2 the necessary set-valued analysis is given. In Subsection 4.3 the generalisation of James’ theorem is presented. Finally, in Section 5, we give a variational principle that is based upon the generalised version of James’ theorem. The paper ends with an index of notation and assumed knowledge and a bibliography.

2 Preliminaries

In this section of the paper we will present the necessary background material that is required in order to prove James’ theorem on weak compactness of closed and bounded convex subsets of a given Banach space.

2.1 Weak topologies on sets

An important part of general topology concerns the generation of topologies on a given set. In this subsection we will show how to construct topologies that make a given function (set of functions) continuous.

Proposition 2.1.

Let $f:X\to Y$ be a function between sets $X$ and $Y$ . If $\tau_{Y}$ is a topology on $Y$ then $\tau_{X}:=\{f^{-1}(U):U\in\tau_{Y}\}$ is a topology on $X$ and $f:(X,\tau_{X})\to(Y,\tau_{Y})$ is continuous. Furthermore, if $\tau$ is any topology on $X$ such that $f:(X,\tau)\to(X,\tau_{Y})$ is continuous then $\tau_{X}\subseteq\tau$ . This is, $\tau_{X}$ is the weakest topology on $X$ that makes $f$ continuous (when $Y$ is endowed with the topology $\tau_{Y}$ ).

Proof.

First we will show that $\tau_{X}$ is a topology on $X$ . Now, $\varnothing\in\tau_{X}$ since $\varnothing=f^{-1}(\varnothing)$ and $\varnothing\in\tau_{Y}$ . Similarly, $X\in\tau_{X}$ since $X=f^{-1}(Y)$ and $Y\in\tau_{Y}$ . Next, suppose that $V_{1}\in\tau_{X}$ and $V_{2}\in\tau_{2}$ . Then, by the definition of $\tau_{X}$ , there exists $U_{1}\in\tau_{Y}$ and $U_{2}\in\tau_{Y}$ such that $V_{1}=f^{-1}(U_{1})$ and $V_{2}=f^{-1}(U_{2})$ . Therefore,

[TABLE]

Since $\tau_{Y}$ is a topology on $Y$ , $U_{1}\cap U_{2}\in\tau_{Y}$ . Hence, $V_{1}\cap V_{2}\in\tau_{X}$ . Finally, to show that $\tau_{X}$ is a topology on $X$ , suppose that $\{V_{\alpha}:\alpha\in A\}\subseteq\tau_{X}$ . Then, by the definition of $\tau_{X}$ , there exist $\{U_{\alpha}:\alpha\in A\}\subseteq\tau_{Y}$ such that $V_{\alpha}=f^{-1}(U_{\alpha})$ for each $\alpha\in A$ . Therefore,

[TABLE]

Since $\tau_{Y}$ is a topology on $Y$ , $\bigcup_{\alpha\in A}U_{\alpha}\in\tau_{Y}$ . Hence, $\bigcup_{\alpha\in A}V_{\alpha}\in\tau_{X}$ . Thus, $\tau_{X}$ is a topology on $X$ . To show that $f:(X,\tau_{X})\to(Y,\tau_{Y})$ is continuous we consider the following. Let $U\in\tau_{Y}$ . Then $f^{-1}(U)\in\tau_{X}$ , by the definition of $\tau_{X}$ . Therefore, by the definition of continuity, $f:(X,\tau_{X})\to(Y,\tau_{Y})$ is continuous. For our last step of the proof, we will show that $\tau_{X}$ is the weakest topology on $X$ that makes $f$ continuous. To this end, let $\tau$ be any topology on $X$ such that $f:(X,\tau)\to(Y,\tau_{Y})$ is continuous. Let $V\in\tau_{X}$ . Then, by the definition of $\tau_{X}$ , there exists an $U\in\tau_{Y}$ such that $V=f^{-1}(U)$ . Since we are assuming that $f:(X,\tau)\to(Y,\tau_{Y})$ is continuous, $V=f^{-1}(U)\in\tau$ . Thus, $\tau_{X}\subseteq\tau$ . This completes the proof. $\Box$

The topology $\tau_{X}$ in Proposition 2.1 is called the weak topology on $X$ generated by $f$ and $\tau_{Y}$ , or more briefly, when the context is clear, the weak topology on $X$ .

When we have more than one function we still have the following result.

Proposition 2.2.

Let $X$ and $Y$ be sets and let $\tau_{Y}$ be a topology on $Y$ . If $\mathcal{F}$ is a nonempty family of functions from $X$ into $Y$ then

[TABLE]

is a base for a topology $\tau_{X}$ on $X$ . Furthermore, the topology $\tau_{X}$ is the weakest topology on $X$ that make each $f\in\mathcal{F}$ continuous, when $Y$ is endowed with the topology $\tau_{Y}$ .

Proof.

Firstly, it is easy to see that $\varnothing$ and $X$ are members of $\mathcal{B}$ . Indeed, since $\mathcal{F}\not=\varnothing$ we may take a function $f\in\mathcal{F}$ . Then $\varnothing=f^{-1}(\varnothing)$ and so $\varnothing\in\mathcal{B}$ since $\varnothing\in\tau_{Y}$ . Similarly, $X=f^{-1}(Y)$ and so $X\in\mathcal{B}$ since $Y\in\tau_{Y}$ . Next, let us observe that $\mathcal{B}$ is closed under taking finite intersections. Suppose $V_{1}\in\mathcal{B}$ and $V_{2}\in\mathcal{B}$ . Then there exists $n_{1}\in{\mathbb{N}}$ , $f^{\prime}_{k}\in\mathcal{F}$ and $U_{k}^{\prime}\in\tau_{Y}$ for each $1\leq k\leq n_{1}$ such that $V_{1}=\bigcap_{1\leq k\leq{n_{1}}}(f^{\prime}_{k})^{-1}(U^{\prime}_{k})$ . Similarly, there exists $n_{2}\in{\mathbb{N}}$ , $f^{\prime\prime}_{k}\in\mathcal{F}$ and $U_{k}^{\prime\prime}\in\tau_{Y}$ for each $1\leq k\leq n_{2}$ such that $V_{2}=\bigcap_{1\leq k\leq{n_{2}}}(f^{\prime\prime}_{k})^{-1}(U^{\prime\prime}_{k})$ . Let $n:=n_{1}+n_{2}$ and for each $1\leq k\leq n_{1}$ let $U_{k}:=U^{\prime}_{k}$ and for each $1\leq k\leq n_{2}$ let $U_{{n_{1}}+k}:=U^{\prime\prime}_{k}$ . For each $1\leq k\leq n_{1}$ let $f_{k}:=f^{\prime}_{k}$ and for each $1\leq k\leq n_{2}$ let $f_{{n_{1}}+k}:=f^{\prime\prime}_{k}$ . Then,

[TABLE]

We now define $\tau_{X}$ to be the set of all subsets of $X$ that can be expressed as a union of members of $\mathcal{B}$ . From above we see that $\varnothing$ and $X$ are members of $\tau_{X}$ , and $\tau_{X}$ is closed under taking finite intersections. For the details of this last claim consider the following. Let $V_{1}\in\tau_{X}$ and $V_{2}\in\tau_{X}$ . Then there exist disjoint sets $I_{1}$ and $I_{2}$ such that $V_{1}=\bigcup_{i\in I_{1}}B_{i}$ for some $B_{i}\in\mathcal{B}$ and $V_{2}=\bigcup_{i\in I_{2}}B_{i}$ for some $B_{i}\in\mathcal{B}$ . Let $I:=I_{1}\times I_{2}$ then

[TABLE]

So it remains to show that $\tau_{X}$ is closed under arbitrary unions. Suppose that $\{U_{i}:i\in I\}\subseteq\tau_{X}$ . Then for each $i\in I$ , there exist disjoint sets $J_{i}$ such that $U_{i}=\bigcup_{j\in J_{i}}B_{j}$ , where $B_{j}\in\mathcal{B}$ . Let $J:=\bigcup_{i\in I}J_{i}$ . Then $\bigcup_{I\in I}U_{i}=\bigcup_{j\in J}B_{j}\in\tau_{X}$ . We now show that $\tau_{X}$ is the weakest topology on $X$ the makes each function in $\mathcal{F}$ continuous. So suppose that $\tau$ is a topology on $X$ that makes each function in $\mathcal{F}$ continuous. Then clearly $\mathcal{B}\subseteq\tau$ since $f^{-1}(U)\in\tau$ for each $f\in\mathcal{F}$ and each $U\in\tau_{Y}$ . Since $\tau_{X}$ is the smallest topology on $X$ that contains $\mathcal{B}$ we must have that $\tau_{X}\subseteq\tau$ . $\Box$

The topology $\tau_{X}$ in Proposition 2.2 is call the weak topology on $X$ generated by $\mathcal{F}$ and $\tau_{Y}$ , or more briefly, when the context is clear, the weak topology on $X$ . For further information on general topology see [9, 25].

2.2 Linear topologies

Let $(V,+,\cdot)$ be a vector space over the field of real numbers and let $\tau$ be a topology on $V$ . Then $(V,+,\cdot,\tau)$ is called a linear topological space or a topological vector space if vector addition from $V\times V$ into $V$ is continuous, when $V\times V$ is considered with the product topology and scalar multiplication from ${\mathbb{R}}\times V$ into $V$ is continuous, again when we consider ${\mathbb{R}}\times V$ with the product topology and ${\mathbb{R}}$ with the usual topology.

An important feature of linear topological spaces is that they are always regular. That is, if $(X,+,\cdot,\tau)$ is linear topological space, $C$ is a closed subset of $X$ and $x\in X\setminus C$ then there exist disjoint open sets $U$ and $V$ such that $x\in U$ and $C\subseteq V$ . To see this, suppose that $x=x+0\in X\setminus C$ ; which is open. Therefore, from the continuity of addition, there exist open neighbourhoods $U$ of $x$ and $W$ of [math] such that $U+W\subseteq X\setminus C$ , i.e,. $(U+W)\cap C=\varnothing$ . Therefore, $U\cap(C+(-W))=\varnothing$ . Let $V:=C+(-W)=\bigcup_{c\in C}c-W$ . Then $V$ is an open set containing the set $C$ and $U\cap V=\varnothing$ . Thus, $(X,\tau)$ is a regular topological space.

Let $(V,+,\cdot,\tau)$ be a linear topological space over ${\mathbb{R}}$ . We shall say that $(V,+,\cdot,\tau)$ is a locally convex space if for each open set $U$ in $V$ , containing [math], there exists an open convex set $W$ such that $0\in W\subseteq U$ , or, equivalently, $(V,\tau)$ has a local base consisting of open convex sets.

If $(X,\|\cdot\|)$ is a normed linear space and $\varnothing\not=\mathcal{F}\subseteq X^{*}$ - the set of all continuous linear functionals on $X$ , then $\sigma(\mathcal{F},X)$ denotes the weak topology on $X$ generated by $\mathcal{F}$ . We shall simply call $\sigma(X^{*},X)$ the weak topology on $X$ and write $(X,\mathrm{weak})$ for $(X,\sigma(X^{*},X))$ .

Sometimes it is convenient to work with a more concrete representation of the $\sigma(\mathcal{F},X)$ -topology. Fortunately such a representation exists and furthermore, the representation is very similar to the way in which open sets are defined in metric spaces. Let $(X,\|\cdot\|)$ be a normed linear space, let $x_{0}\in X$ , $\varepsilon>0$ and let $F$ be a nonempty finite subset of $X^{*}$ . Then

[TABLE]

Note: sometimes it is also convenient to write $N_{X}(x_{0},f_{1},f_{2},\ldots,f_{n},\varepsilon)$ when the finite set $F$ is enumerated as $F:=\{f_{1},f_{2},\ldots,f_{n}\}$ . When the context is clear we simply write, $N(x_{0},F,\varepsilon)$ or $N(x_{0},f_{1},f_{2},\ldots,f_{n},\varepsilon)$ .

Given a nonempty subset $\mathcal{F}$ of $X^{*}$ we shall say that a subset $U$ of $X$ is $\mathcal{F}$ -open if for every $x_{0}\in U$ there exists a nonempty finite subset $F$ of $\mathcal{F}$ and an $\varepsilon>0$ such that $N(x_{0},F,\varepsilon)\subseteq U$ .

Proposition 2.3.

If $(X,\|\cdot\|)$ is a normed linear space and $\varnothing\not=\mathcal{F}\subseteq X^{*}$ then the set of all $\mathcal{F}$ -open sets forms a topology on $X$ . Furthermore, the set of all $\mathcal{F}$ -open sets coincides with the $\sigma(\mathcal{F},X)$ -topology on $X$ .

Proof.

First we will show that the set of all $\mathcal{F}$ -open sets forms a topology on $X$ . It is easy to see that vacuously, $\varnothing$ is $\mathcal{F}$ -open. To see that $X$ is $\mathcal{F}$ -open, consider any element $x_{0}\in X$ . Now, let $x^{*}$ be any element in $\mathcal{F}$ . Then $N(x_{0},\{x^{*}\},1)\subseteq X$ . Therefore, $X$ is $\mathcal{F}$ -open. Next, suppose that $U$ and $V$ are both $\mathcal{F}$ -open subsets of $X$ . We will show that $U\cap V$ is also $\mathcal{F}$ -open. To this end, let $x_{0}\in U\cap V$ . Then, since $x_{0}\in U$ , there exists a finite subset $F_{U}$ of $\mathcal{F}$ and an $\varepsilon_{U}>0$ such that $N(x_{0},F_{U},\varepsilon_{U})\subseteq U$ . Similarly, there exists a finite subset $F_{V}$ of $\mathcal{F}$ and an $\varepsilon_{V}>0$ such that $N(x_{0},F_{V},\varepsilon_{V})\subseteq V$ . Let $F:=F_{U}\cup F_{V}$ and $\varepsilon:=\min\{\varepsilon_{U},\varepsilon_{V}\}$ . Then,

[TABLE]

So it remains to show that an arbitrary union of $\mathcal{F}$ -open sets is again $\mathcal{F}$ -open. Let $\{U_{\alpha}:\alpha\in A\}$ be a family of $\mathcal{F}$ -open sets. Let $x_{0}$ be any element of $\bigcup_{\alpha\in A}U_{\alpha}$ . Then there exists an $\alpha_{0}\in A$ such that $x_{0}\in U_{\alpha_{0}}$ . Since $U_{\alpha_{0}}$ is $\mathcal{F}$ -open there exists a finite subset $F$ of $\mathcal{F}$ and an $\varepsilon>0$ such that $N(x_{0},F,\varepsilon)\subseteq U_{\alpha_{0}}$ . Now, $U_{\alpha_{0}}\subseteq\bigcup_{\alpha\in A}U_{\alpha}$ and so $N(x_{0},F,\varepsilon)\subseteq\bigcup_{\alpha\in A}U_{\alpha}$ . Therefore, $\bigcup_{\alpha\in A}U_{\alpha}$ is $\mathcal{F}$ -open. We will now show that the two topologies coincide. Suppose that $U$ is an $\mathcal{F}$ -open set. Then, for each $x\in U$ , there exists a finite subset $F_{x}$ of $\mathcal{F}$ and an $\varepsilon_{x}>0$ such that $x\in N(x,F_{x},\varepsilon_{x})\subseteq U$ . Therefore, $\bigcup_{x\in U}N(x,F_{x},\varepsilon_{x})=U$ . Thus, to show that $U$ is $\sigma(\mathcal{F},X)$ -open it is sufficient to show that every set of the form: $N(x,F,\varepsilon)$ is $\sigma(\mathcal{F},X)$ -open, where $x\in X$ , $F$ is a finite subset of $\mathcal{F}$ and $\varepsilon>0$ . So suppose that $x_{0}\in X$ , $F=\{f_{1},f_{2},\ldots,f_{n}\}\subseteq\mathcal{F}$ and $\varepsilon>0$ . Let $U_{k}:=(f_{k}(x_{0})-\varepsilon,f_{k}(x_{0})+\varepsilon)$ for each $1\leq k\leq n$ . Then,

[TABLE]

Therefore, by the definition of the $\sigma(\mathcal{F},X)$ -topology, $N(x,F,\varepsilon)$ is $\sigma(\mathcal{F},X)$ -open. To show that every $\sigma(\mathcal{F},X)$ -open set is $\mathcal{F}$ -open it is sufficient to show that each member of $\mathcal{F}$ is continuous with respect to the topology generated by the $\mathcal{F}$ -open sets. However, this is obvious from the definition of the $\mathcal{F}$ -open sets. If you want to see the details, then let $f\in\mathcal{F}$ , $x_{0}\in X$ and $\varepsilon>0$ . Then

[TABLE]

This completes the proof. $\Box$

Remark 2.4.

It follows from Proposition 2.3 and equation $(*)$ that for each $x\in X$ , finite set $\varnothing\not=F\subseteq\mathcal{F}$ and $\varepsilon>0$ , the set $N(x,F,\varepsilon)$ is $\sigma(\mathcal{F},X)$ -open in $X$ .

By using Proposition 2.3 and Remark 2.4, one can easily deduce the following result.

Proposition 2.5.

Let $Y$ be a subspace of a normed linear space $(X,\|\cdot\|)$ and let $\varnothing\not=\mathcal{F}\subseteq X^{*}$ . Then a subset $U$ of $Y$ is open in the relative $\sigma(\mathcal{F},X)$ -topology on $Y$ if, and only if, for each $y\in U$ there exists a finite subset $F$ of $\mathcal{F}$ and an $\varepsilon>0$ such that $N_{X}(y,F,\varepsilon)\cap Y\subseteq U$ .

Proposition 2.6.

If $(X,\|\cdot\|)$ is a normed linear space and $\mathcal{F}\subseteq X^{*}$ , then $(X,\sigma(\mathcal{F},X))$ is a locally convex topological space.

Proof.

Let us first show that $(X,\sigma(\mathcal{F},X))$ is a linear topology. Let $S:X\times X\to X$ be defined by, $S(x,y):=x+y$ . We need to show that $S$ is continuous. To this end, let $W$ be a $\sigma(\mathcal{F},X)$ -open subset of $X$ and let $(x,y)\in S^{-1}(W)$ , i.e., $S(x,y)\in W$ . By Proposition 2.3 there exists a finite subset $F$ of $\mathcal{F}$ and an $\varepsilon>0$ such that $N(S(x,y),F,\varepsilon)\subseteq W$ . We claim that $S(N(x,F,\varepsilon/2)\times N(y,F,\varepsilon/2))\subseteq N(S(x,y),F,\varepsilon)\subseteq W$ . To see this, let $(x^{\prime},y^{\prime})\in N(x,F,\varepsilon)\times N(y,F,\varepsilon)$ and let $f\in F$ . Then, $|f(x^{\prime})-f(x)|<\varepsilon/2$ and $|f(y^{\prime})-f(y)|<\varepsilon/2$ , and so

[TABLE]

Therefore, $S(x^{\prime},y^{\prime})\in N(S(x,y),F,\varepsilon)$ ; which proves the claim. Now since both $N(x,F,\varepsilon/2)$ and $N(y,F,\varepsilon/2)$ are $\sigma(\mathcal{F},X)$ -open we see that $S^{-1}(W)$ is open in $X\times X$ , with the product topology and so $S$ is continuous. Let $M:{\mathbb{R}}\times X\to X$ be defined by, $M(r,x):=rx$ . We need to show that $M$ is continuous. To this end, let $W$ be a $\sigma(\mathcal{F},X)$ -open subset of $X$ and let $(r,x)\in M^{-1}(W)$ , i.e., $M(r,x)\in W$ . By Proposition 2.3 there exists a finite subset $F$ of $\mathcal{F}$ and an $1>\varepsilon>0$ such that $N(M(r,x),F,\varepsilon)\subseteq W$ . Set

[TABLE]

We claim that $M((r-\varepsilon_{1},r+\varepsilon_{1})\times N(x,F,\varepsilon_{2}))\subseteq N(M(r,x),F,\varepsilon)\subseteq W$ . To see this is true, let

$(r^{\prime},x^{\prime})\in(r-\varepsilon_{1},r+\varepsilon_{1})\times N(x,F,\varepsilon_{2})$ and let $f\in F$ . Then, $|r^{\prime}-r|<\varepsilon_{1}$ and $|f(x^{\prime})-f(x)|<\varepsilon_{2}$ , and so

[TABLE]

Therefore, $M(r^{\prime},x^{\prime})\in N(M(r,x),F,\varepsilon)$ ; which proves the claim. Now since $(r-\varepsilon_{1},r+\varepsilon_{1})$ is open in ${\mathbb{R}}$ and $N(x,F,\varepsilon_{2})$ is $\sigma(\mathcal{F},X)$ -open we see that $M^{-1}(W)$ is open in ${\mathbb{R}}\times X$ , with the product topology and so $M$ is continuous. This shows that $(X,\sigma(\mathcal{F},X))$ is a linear topological space. To see that $(X,\sigma(\mathcal{F},X))$ is locally convex we merely appeal to Proposition 2.3 and the fact that for each finite subset $F$ of $\mathcal{F}$ and $\varepsilon>0$ , $N(0,F,\varepsilon)$ is a convex open neighbourhood of [math]. $\Box$

The beauty of linear topology lies in the interplay between linear algebra and topology. This is highlighted in Proposition 2.8, which is based upon the following result from linear algebra.

Lemma 2.7.

Let $V$ be a vector space over $\mathbb{R}$ and suppose that $(f_{i})_{i=1}^{n}$ are linear functionals on $V$ . If $g$ is a linear functional on $V$ such that $\bigcap_{i=1}^{n}\ker(f_{i})\subseteq\ker(g)$ , then $g\in\emph{span}\{f_{1},\dots,f_{n}\}$ .

Proof.

Define $T:V\rightarrow\mathbb{R}^{n}$ by

[TABLE]

Observe that $T$ is clearly linear and that $\ker(T)=\bigcap_{i=1}^{n}\ker(f_{i})$ . We may assume that $(f_{i})_{i=1}^{n}$ is a minimal (in terms of cardinality) family of functions such that $\bigcap_{i=1}^{n}\ker(f_{i})\subseteq\ker(g)$ , and from this we claim that $T$ is also surjective.

Fix $1\leq k\leq n$ . Then, by the minimality assumption on $n$ , we have that $\bigcap\{\ker(f_{i}):1\leq i\leq n,\ i\neq k\}\not\subseteq\ker(g)$ , and so in particular $\bigcap\{\ker(f_{i}):1\leq i\leq n,\ i\neq k\}\not\subseteq\bigcap_{i=1}^{n}\ker(f_{i})$ . Now we may choose

[TABLE]

Then, after scaling $x_{k}$ if necessary, we have that $f_{k}(x_{k})=1$ and $f_{i}(x_{k})=0$ for $i\neq k$ . Therefore, $T(x_{k})=e_{k}$ , where $e_{k}$ is the $k^{\textrm{th}}$ standard basis vector of $\mathbb{R}^{n}$ . Then, since $1\leq k\leq n$ was arbitrary, we have that $\mathbb{R}^{n}=\text{span}\{e_{1},\dots,e_{n}\}\subseteq T(V)$ and so $T$ is surjective as claimed.

Now define $g^{*}:\mathbb{R}^{n}\rightarrow\mathbb{R}$ by

[TABLE]

Then $g^{*}$ is well-defined. Indeed, let $x\in\mathbb{R}^{n}$ . Since $T$ is onto, we have that $T^{-1}(x)\neq\varnothing$ . So, suppose $z_{1},z_{2}\in T^{-1}(x)$ . Then $T(z_{1})=x=T(z_{2})$ and so $z_{1}-z_{2}\in\ker(T)\subseteq\ker(g)$ . Thus $g(z_{1})=g(z_{2})$ as required. Moreover, a routine calculation shows that $g^{*}$ is linear, so $g^{*}\in(\mathbb{R}^{n})^{*}$ . Since $(\mathbb{R}^{n})^{*}=\text{span}\{e_{1}^{*},\dots,e_{n}^{*}\}$ , there exist $(c_{i})_{i=1}^{n}$ such that $g^{*}=\sum_{i=1}^{n}c_{i}e_{i}^{*}$ , where here $e_{i}^{*}(e_{j})=\delta_{ij}$ , the $ij$ -Kroeneker delta.

Finally, note that $x\in T^{-1}(T(x))$ and so $g^{*}(T(x))=g(x)$ for all $x\in V$ . Therefore,

[TABLE]

since $e^{*}_{i}\circ T=f_{i}$ for all $1\leq i\leq n$ , and thus $g\in\text{span}\{f_{1},\dots,f_{n}\}$ . $\Box$

Proposition 2.8.

If $(X,\|\cdot\|)$ is a normed linear space and $\mathcal{F}\subseteq X^{*}$ , then the following are equivalent:

(i)

$x^{*}\in X^{*}$ * is $\sigma(\mathcal{F},X)$ -continuous;* 2. (ii)

$x^{*}\in X^{*}$ * is bounded on a $\sigma(\mathcal{F},X)$ neighbourhood of [math];* 3. (iii)

$x^{*}\in\mbox{span}(\mathcal{F})$ .

Proof.

$(i)\Longrightarrow(ii)$ . Suppose that $x^{*}\in X^{*}$ is $\sigma(\mathcal{F},X)$ -continuous on $X$ . Then, in particular, $x^{*}$ is continuous at $0\in X$ . Therefore, there exists a $\sigma(\mathcal{F},X)$ -open neighbourhood $N$ of [math] such that $|x^{*}(x)|=|x^{*}(x)-x^{*}(0)|<1$ for all $x\in N$ . Hence $x^{*}$ is bounded on $N$ . $(ii)\Longrightarrow(iii)$ . Suppose that $x^{*}$ is bounded on a $\sigma(\mathcal{F},X)$ -open neighbourhood $N$ of [math]. Then there exists a finite subset $G$ of $\mathcal{F}$ and an $0<\varepsilon$ such that $N(0,G,\varepsilon)\subseteq N$ . Let $S:=\bigcap_{y^{*}\in G}\mathrm{ker}(y^{*})$ . Then $S$ is a subspace of $X$ and furthermore, $S\subseteq N(0,G,\varepsilon)\subseteq N$ . Hence, $x^{*}|_{S}$ is bounded on $S$ . Thus, $x^{*}|_{S}\equiv 0$ and so $\bigcap_{y^{*}\in G}\mathrm{ker}(y^{*})=S\subseteq\mathrm{ker}(x^{*})$ . The result now follows from Lemma 2.7. $(iii)\Longrightarrow(i)$ . Suppose that $x^{*}=\sum_{k=1}^{n}\lambda_{k}y^{*}_{k}$ , where $\lambda_{k}\in{\mathbb{R}}$ and $y^{*}_{k}\in\mathcal{F}$ for all $1\leq k\leq n$ . Define $f:{\mathbb{R}}^{n}\to{\mathbb{R}}$ by, $f(x_{1},x_{2},\ldots,x_{n}):=\sum_{k=1}^{n}\lambda_{k}x_{k}$ and $F:X\to{\mathbb{R}}^{n}$ by, $F(x):=(y^{*}_{1}(x),y^{*}_{2}(x),\ldots,y^{*}_{n}(x))$ . Then $f$ is a continuous function on ${\mathbb{R}}^{n}$ and $F$ is a $\sigma(\mathcal{F},X)$ -continuous function on $X$ . Therefore, $x^{*}=f\circ F$ , is also a $\sigma(\mathcal{F},X)$ -continuous function on $X$ . $\Box$

Remark 2.9.

It follows from Proposition 2.8 that for any normed linear space $(X,\|\cdot\|)$ and any $\mathcal{F}\subseteq X^{*}$ , $\sigma(\mathcal{F},X)=\sigma(\mbox{span}(\mathcal{F}),X)$ . To see this, first note the general fact that if $\mathcal{F}\subseteq\mathcal{F}^{\prime}$ then $\sigma(\mathcal{F},X)\subseteq\sigma(\mathcal{F}^{\prime},X)$ (i.e., to make more functions continuous you need more open sets) and then the equivalence of (i) and (iii) above.

Proposition 2.10.

If $T:X\to Y$ is a continuous linear operator acting between normed linear spaces $(X\|\cdot\|_{X})$ and $(Y,\|\cdot\|_{Y})$ then $T:(X,\mathrm{weak})\to(Y,\mathrm{weak})$ is also continuous.

Proof.

Let $W$ be a weak open subset of $Y$ . We will show that $T^{-1}(W)$ is open in the weak topology on $X$ . To this end, let $x_{0}\in T^{-1}(W)$ . Then $T(x_{0})\in W$ and so by Proposition 2.3 there exist $\{y_{1}^{*},y_{2}^{*},\ldots,y_{n}^{*}\}\subseteq Y^{*}$ and $\varepsilon>0$ such that $N_{Y}(T(x_{0}),\{y_{1}^{*},y_{2}^{*},\ldots,y_{n}^{*}\},\varepsilon)\subseteq W$ . For each $1\leq k\leq n$ let $x_{k}^{*}:=y^{*}_{k}\circ T$ . Then $\{x_{1}^{*},x_{2}^{*},\ldots,x_{n}^{*}\}\subseteq X^{*}$ . We claim that $T(N_{X}(x_{0},\{x_{1}^{*},x_{2}^{*},\ldots,x_{n}^{*}\},\varepsilon))\subseteq N_{Y}(T(x_{0}),\{y_{1}^{*},y_{2}^{*},\ldots,y_{n}^{*}\},\varepsilon)\subseteq W$ . To see this, let $y\in T(N_{X}(x_{0},\{x_{1}^{*},x_{2}^{*},\ldots,x_{n}^{*}\},\varepsilon))$ . Then there exists a $x\in N_{X}(x_{0},\{x_{1}^{*},x_{2}^{*},\ldots,x_{n}^{*}\},\varepsilon)$ such that $y=T(x)$ . Fix $1\leq k\leq n$ . Then,

[TABLE]

Therefore, $y\in N_{Y}(T(x_{0}),\{y_{1}^{*},y_{2}^{*},\ldots,y_{n}^{*}\},\varepsilon)\subseteq W$ . This completes the proof of the claim. Hence

[TABLE]

Thus, by Proposition 2.3, $T^{-1}(W)$ is open in the weak topology on $X$ . $\Box$

2.3 Hahn-Banach Theorem

A real-valued function $p$ defined on a vector space $V$ is called sublinear if for every $x,y\in V$ and $0\leq\lambda<\infty$ , $p(\lambda x)=\lambda p(x)$ and $p(x+y)\leq p(x)+p(y)$ .

Although it is easy, using linear algebra, to construct linear functionals on a vector space, it is not so easy to construct continuous linear functions on a linear topological space. The key to constructing continuous linear functionals on locally convex spaces is given next.

Theorem 2.11 (Hahn-Banach Theorem [8]).

Let $Y$ be a subspace of a vector space $V$ (over $\mathbb{R}$ ) and let $p:V\rightarrow\mathbb{R}$ be a sublinear functional on $V$ . If $f$ is a linear functional on $Y$ and $f(y)\leq p(y)$ for all $y\in Y$ then there exists a linear functional $F:V\rightarrow\mathbb{R}$ such that $F|_{Y}=f$ and $F(x)\leq p(x)$ for all $x\in V$ .

Proof.

Let $\mathscr{P}$ be the collection of all ordered pairs $(M^{\prime},f^{\prime})$ , where $M^{\prime}$ is a subspace of $V$ containing $Y$ and $f^{\prime}:M^{\prime}\rightarrow\mathbb{R}$ is a linear functional defined on $M^{\prime}$ such that $f^{\prime}|_{Y}=f$ and satisfies $f^{\prime}(x)\leq p(x)$ for all $x\in M^{\prime}$ . $\mathscr{P}$ is nonempty because $(Y,f)\in\mathscr{P}$ . We partially order $\mathscr{P}$ by, $(M^{\prime},f^{\prime})\leq(M^{\prime\prime},f^{\prime\prime})$ if $M^{\prime}\subseteq M^{\prime\prime}$ and $f^{\prime\prime}|_{M^{\prime}}=f^{\prime}$ . If $\{(M_{\alpha},f_{\alpha}):\alpha\in A\}$ is a nonempty totally ordered sub-family of $\mathscr{P}$ , then set $M^{\prime}:=\bigcup\{M_{\alpha}:\alpha\in A\}$ and define the linear functional $f^{\prime}:M^{\prime}\rightarrow\mathbb{R}$ by, $f^{\prime}(x):=f_{\alpha}(x)$ if $x\in M_{\alpha}$ . Then $(M^{\prime},f^{\prime})\in\mathscr{P}$ and $(M_{\alpha},f_{\alpha})\leq(M^{\prime},f^{\prime})$ for all $\alpha\in A$ . Therefore, by Zorn’s lemma, $\mathscr{P}$ has a maximal element $(M,F)$ . We must show that $M=V$ . So suppose, in order to obtain a contradiction, that $M\not=V$ and pick $x_{0}\in V\setminus M$ and put $M^{*}:=\mbox{span}\{M,x_{0}\}$ . We will define $F^{*}:M^{*}\rightarrow\mathbb{R}$ so that $(M^{*},F^{*})\in\mathscr{P}$ and $(M,F)<(M^{*},F^{*})$ ; which will be our desired contradiction. For each $\alpha\in\mathbb{R}$ we define $F_{\alpha}$ on $M^{*}$ by, $F_{\alpha}(m+\lambda x_{0}):=f(m)+\lambda\alpha$ . It is easy to check that $F_{\alpha}$ is well defined and linear on $M^{*}$ . Moreover, $F_{\alpha}|_{M}=f$ . So it remains to show that $F_{\alpha}(x)\leq p(x)$ for all $x\in M^{*}$ . To achieve this, we need to select the right value of $\alpha\in\mathbb{R}$ .

Selection of $\alpha$ : For any $m_{1},m_{2}\in M$ and $0<\lambda_{1}<\infty$ and $0<\lambda_{2}<\infty$ we have:

[TABLE]

Therefore,

[TABLE]

for all $m_{1},m_{2}\in M$ and $0<\lambda_{1}<\infty$ , $0<\lambda_{2}<\infty$ . Hold $m_{2}$ and $\lambda_{2}$ fixed and take the supremum over $m_{1}\in M$ and $0<\lambda_{1}<\infty$ . Then for each $m_{2}\in M$ and $0<\lambda_{2}<\infty$ we have that:

[TABLE]

Now we take the infimum over $m_{2}\in M$ and $0<\lambda_{2}<\infty$ . Then,

[TABLE]

Choose $\alpha^{*}\in[a,b]$ . Then from the left-hand side of the equation we get that:

[TABLE]

From the right-hand side of the equation we get that:

[TABLE]

From these two equations we see that:

[TABLE]

That is, $(M,F)<(M^{*},F^{*})\in\mathscr{P}$ . $\Box$

We now give some applications of this famous theorem.

Corollary 2.12.

Let $Y$ be a subspace of a normed linear space $(X,\|\cdot\|)$ (over $\mathbb{R}$ ). If $f\in Y^{*}$ then there exists an $F\in X^{*}$ such that $F|_{Y}=f$ and $\|F\|=\|f\|$ .

Proof.

Consider the sublinear functional $p:X\to{\mathbb{R}}$ defined by, $p(x):=\|f\|\|x\|$ . Then $f(y)\leq p(y)$ for all $y\in Y$ . By the Hahn-Banach Theorem, (Theorem 2.11) there exists a linear functional $F:X\to{\mathbb{R}}$ such that $F|_{Y}=f$ and $F(x)\leq p(x)$ for all $x\in X$ . Therefore, $-F(x)=F(-x)\leq p(-x)=p(x)$ for all $x\in X$ too. Thus, $|F(x)|\leq p(x)$ for all $x\in X$ . This in turn implies that $\|F\|\leq\|f\|$ . On the other hand, since $F$ is an extension of $f$ , we must also have that $\|f\|\leq\|F\|$ . $\Box$

Corollary 2.13.

Let $(X,\|\cdot\|)$ be a normed linear space. For every $x\in X\setminus\{0\}$ there exists an $f\in S_{X^{*}}$ such that $f(x)=\|x\|$ .

Proof.

Let $Y:=\mbox{span}\{x\}$ and define $f\in Y^{*}$ by, $f(\lambda x):=\lambda\|x\|$ . Clearly, $\|f\|=1$ and $f(x)=\|x\|$ . By Corollary 2.12 there exists an $F\in X^{*}$ such that $\|F\|=\|f\|=1$ and $F|_{Y}=f$ . Therefore, in particular we have that $F(x)=f(x)=\|x\|$ . $\Box$

Proposition 2.14.

Let $Y$ be a subspace of a normed linear space $(X,\|\cdot\|)$ . Then the topology $\sigma(Y^{*},Y)$ on $Y$ coincides with the relative $\sigma(X^{*},X)$ topology on $Y$ .

Proof.

Let us first show that every relatively $\sigma(X^{*},X)$ -open set in $Y$ is $\sigma(Y^{*},Y)$ open. To this end, let $U$ be a relatively $\sigma(X^{*},X)$ -open set in $Y$ . Let $y\in U$ . By Proposition 2.5, there exists a finite set $\{x^{*}_{1},x^{*}_{2},\ldots,x_{n}^{*}\}\subseteq X^{*}$ and an $\varepsilon>0$ such that $N_{X}(y,\{x^{*}_{1},x^{*}_{2},\ldots,x_{n}^{*}\},\varepsilon)\cap Y\subseteq U$ . For each $1\leq k\leq n$ let $y_{k}^{*}:=x_{k}^{*}|_{Y}$ . Then

[TABLE]

Thus, by Proposition 2.3, $U$ is $\sigma(Y^{*},Y)$ -open. Now, suppose that $U$ is a $\sigma(Y^{*},Y)$ -open subset of $Y$ . Then, by Proposition 2.3, there exists a finite set $\{y^{*}_{1},y^{*}_{2},\ldots,y_{n}^{*}\}\subseteq Y^{*}$ and an $\varepsilon>0$ such that $N_{Y}(y,\{y^{*}_{1},y^{*}_{2},\ldots,y_{n}^{*}\},\varepsilon)\subseteq U$ . By Corollary 2.12, for each $1\leq k\leq n$ , there exists an $x^{*}_{k}\in X^{*}$ such that $x^{*}_{k}|_{Y}=x^{*}_{k}$ . Then

[TABLE]

Therefore, by Proposition 2.5, $U$ is open in the relative $\sigma(X^{*},X)$ -topology on $Y$ . $\Box$

Next we will show how to use the Hahn-Banach Theorem to obtain some geometric properties of locally convex spaces.

Let $S$ be a nonempty subset of a vector space $V$ . We shall say that a point $x\in S$ is a core point of $S$ if for every $v\in V$ there exists a $0<\delta<\infty$ such that $x+\lambda v\in S$ for all $0\leq\lambda<\delta$ . The set of all core points of $S$ is called the core of $S$ and is denoted by $\mathrm{Cor}(S)$ .

Let $C$ be a convex set in a vector space $V$ with $0\in\mathrm{Cor}(C)$ . Then the functional $\mu_{C}:V\rightarrow\mathbb{R}$ defined by,

[TABLE]

is called the Minkowski functional generated by the set $C$ .

Theorem 2.15.

Let $C$ be a convex subset of a vector space $V$ with [math] in the core of $C$ . Then $\mu_{C}:V\rightarrow\mathbb{R}$ is a sublinear functional. Moreover,

[TABLE]

Proof.

Given $\alpha>0$ and $\lambda>0$ , clearly $x\in\lambda C$ if, and only if, $\alpha x\in\lambda\alpha C$ . Therefore, $\mu_{C}(\alpha x)=\alpha\mu_{C}(x)$ and thus $\mu_{C}$ is positively homogeneous. We claim that $\mu_{C}$ is subadditive; that is, $\mu_{C}(x+y)\leq\mu_{C}(x)+\mu_{C}(y)$ . Fix any $s>\mu_{C}(x)$ and $t>\mu_{C}(y)$ . We have that there is some $s_{0}<s$ such that $x\in s_{0}C$ . Note that $s_{0}C\subseteq sC$ . Indeed, $0\in sC$ and if $c\in C$ , then by the convexity of $sC$ ,

[TABLE]

We see that $x\in sC$ and similarly $y\in tC$ . Then $x+y\in sC+tC$ and thus by the convexity of $C$ ,

[TABLE]

Therefore, $\mu_{C}(x+y)\leq s+t$ and so by the choice of $s$ and $t$ we have that $\mu_{C}(x+y)\leq\mu_{C}(x)+\mu_{C}(y)$ .

If $\mu_{C}(x)<1$ then $x\in\lambda C$ for some $0<\lambda<1$ and so $(1/\lambda)x\in C$ . Since $0\in C$ and $C$ is convex,

[TABLE]

If $x\in C$ then $\mu_{C}(x)\leq 1$ by the definition of the Minkowski functional. $\Box$

Remark 2.16.

If the set $C$ in Theorem 2.15 is a closed and convex subset of a topological vector space $(V,\tau)$ , with $0\in\mathrm{Cor}(C)$ and $x_{0}\not\in C$ then it is an easy exercise to show that $1<\mu_{C}(x_{0})$ .

We now give the geometric version of the Hahn-Banach Theorem.

Theorem 2.17 (Separation Theorem).

Suppose that $(X,\tau)$ is a locally convex space over ${\mathbb{R}}$ and $C$ is a nonempty closed convex subset of $X$ . If $x_{0}\not\in C$ then there exists a continuous linear functional $x^{*}$ on $X$ such that

[TABLE]

Proof.

We may assume, without loss of generality, that $0\in C$ ; because otherwise we would consider $C-x$ and $x_{0}-x$ for some $x\in C$ . Since vector addition is continuous and $x_{0}+0\not\in C$ there exist convex open neighbourhoods $U$ of $x_{0}$ and $V$ of [math] such that $(U+V)\cap C=\varnothing$ . Thus, $U\cap[C+(-V)]=\varnothing$ . Now, $-V$ is also a convex open neighbourhood of [math] and so $C+(-V)$ is a convex open set containing the set $C$ and disjoint from $U$ . Let $D:=\overline{C+(-V)}$ , then $D$ is a closed and convex set with $0\in\mathrm{int}(D)$ and $x_{0}\not\in D$ . Let $\mu_{D}$ be the Minkowski functional for $D$ . Since $D$ is closed and $x_{0}\not\in D$ we have $\mu_{D}(x_{0})>1$ (see Remark 2.16). Define a linear functional on $\mbox{span}\{x_{0}\}$ by, $f(\lambda x_{0}):=\lambda\mu_{D}(x_{0})$ . Then on $\mbox{span}\{x_{0}\}$ we have that $f(\lambda x_{0})\leq\mu_{D}(\lambda x_{0})$ . Indeed, for $0\leq\lambda$ it is clear from the definition of $f$ ; whereas for $\lambda<0$ we have $f(\lambda x_{0})=\lambda\mu_{D}(x_{0})<0$ while $\mu_{D}(\lambda x_{0})\geq 0$ . By using the Hahn-Banach Theorem we may extend $f$ onto $X$ so that $f(x)\leq\mu_{D}(x)$ for all $x\in X$ . If $x\in D$ then $\mu_{D}(x)\leq 1$ and thus, $f(x)\leq\mu_{D}(x)\leq 1$ . Since $D$ contains a neighbourhood of the origin we have that $f$ is a bounded on a neighbourhood of [math] and so by Proposition 2.8, $f\in X^{*}$ . Since $f(x_{0})=\mu_{D}(x_{0})>1$ we get that $\sup\{f(x):x\in C\}\leq\sup\{f(x):x\in D\}\leq 1<f(x_{0})$ . $\Box$

An immediate consequence of the Separation Theorem is the following result, which is sometimes known as Mazur’s Theorem.

Proposition 2.18.

Let $C$ be a closed convex subset of a normed linear space $(X,\|\cdot\|)$ . Then $C$ is also closed with respect to the weak topology on $X$ .

Proof.

If $C$ is empty or the whole space, then $C$ is weakly closed, so let us suppose otherwise. Let $x_{0}\in X\setminus C$ . Since $C$ is closed and convex, we have, by the Separation Theorem (Theorem 2.17), the existence of an $f_{x_{0}}\in X^{*}$ such that $f_{x_{0}}(x_{0})>\sup_{x\in C}f_{x_{0}}(x)$ . Thus, $x_{0}\in f_{x_{0}}^{-1}\big{(}\!\left(\sup_{x\in C}f_{x_{0}}(x),\infty\right)\!\big{)}$ , which, being the inverse image of an open set, is weakly open. It is then straightforward to check that $X\setminus C=\bigcup_{x_{0}\in X\setminus C}f_{x_{0}}^{-1}\big{(}(\sup_{x\in C}f_{x_{0}}(x),\infty)\big{)}$ . Hence, $X\setminus C$ , being the union of weakly open sets, is weakly open. Thus, $C$ is weakly closed. $\Box$

2.4 Weak∗ topology

Let $(X,\|\cdot\|)$ be a normed linear space. For each $x\in X$ we define, $\widehat{x}\in X^{**}:=(X^{*})^{*}$ by, $\widehat{x}(x^{*}):=x^{*}(x)$ for all $x^{*}\in X^{*}$ . To show that $\widehat{x}$ is really in $X^{**}$ we must first check that it is linear and then check that it is continuous. So suppose that $x^{*}$ and $y^{*}$ are in $X^{*}$ , then

[TABLE]

Also, for any $\lambda\in\mathbb{R}$ and $x^{*}\in X^{*}$ we have that

[TABLE]

Now, $|\widehat{x}(x^{*})|=|(x^{*})(x)|\leq\|x^{*}\|\cdot\|x\|$ . Therefore, $\|\widehat{x}\|\leq\|x\|$ and so $\widehat{x}\in X^{**}$ .

Proposition 2.19.

Let $(X,\|\cdot\|)$ be a normed linear space. Then the mapping $x\mapsto\widehat{x}$ is a linear isometry from $X$ into $X^{**}$ .

Proof.

The mapping $x\mapsto\widehat{x}$ from $X$ into $X^{**}$ is linear, since for all $x^{*}\in X^{*}$

[TABLE]

Therefore, $\widehat{x+y}=\widehat{x}+\widehat{y}$ . Also, for any $\lambda\in\mathbb{R}$ and $x^{*}\in X^{*}$ ,

[TABLE]

Therefore, $\widehat{(\lambda x)}=\lambda\widehat{x}$ . Next we show that $x\mapsto\widehat{x}$ is an isometry. For each $x\in X$ , we have by Corollary 2.13, a linear function $x^{*}\in S_{X^{*}}$ such that $x^{*}(x)=\|x\|$ . Therefore, $\|\widehat{x}\|\geq\frac{|\widehat{x}(x^{*})|}{\|x^{*}\|}=|\widehat{x}(x^{*})|=|x^{*}(x)|=\|x\|$ . $\Box$

If $(X,\|\cdot\|)$ is a Banach space then $\widehat{X}$ is a closed subspace of $X^{**}$ where $\widehat{X}$ is defined as $\{\widehat{x}:x\in X\}$ . We call $\widehat{X}$ the natural embedding of $X$ into $X^{**}$ and we call $x\mapsto\widehat{x}$ from $X$ into $X^{**}$ the natural embedding mapping.

An important topology for our concerns is the weak∗ topology. Suppose that $(X,\|\cdot\|)$ is a normed linear space. Then we call the topology $\sigma(\widehat{X},X^{*})$ on $X^{*}$ , the weak∗ topology on $X^{*}$ and we write $(X^{*},\mathrm{weak}^{*})$ for $(X^{*},\sigma(\widehat{X},X^{*}))$ . It follows form Proposition 2.8 that $F\in X^{**}$ is weak∗ continuous if, and only if, $F\in\widehat{X}$ .

Let $(X,\|\cdot\|)$ be a normed linear space and let $A\subseteq X$ . We define the (upper) polar of $A$ to be the subset $A^{\circ}$ of $X^{*}$ defined by

[TABLE]

Proposition 2.20.

Let $(X,\|\cdot\|)$ be a normed linear space and let $A\subseteq X$ . Then $A^{\circ}$ is convex, weak-closed, and contains [math].*

Proof.

The fact that $0\in A^{\circ}$ is trivial. To see that $A^{\circ}$ is weak*-closed and convex, note that $A^{\circ}=\bigcap_{a\in A}\widehat{a}^{-1}(-\infty,1]$ is the intersection of weak*-closed and convex sets, and so is itself, weak*-closed and convex. $\Box$

There are many interesting properties of polars that can be easily verified. For example, (i) if $A\subseteq B$ then $B^{\circ}\subseteq A^{\circ}$ , (ii) $(B_{X})^{\circ}=B_{X^{*}}$ , (iii) for any $r>0$ , $(rA)^{\circ}=r^{-1}A^{\circ}$ . By combining these we see that if $0\in\mathrm{int}(A)$ then $A^{\circ}$ is bounded and if $A$ is bounded then $0\in\mathrm{int}(A^{\circ})$ .

There is also a dual version of (upper) polars. Let $(X,\|\cdot\|)$ be a normed linear space and let $A\subseteq X^{*}$ . We define the (lower) polar of $A$ to be the subset $A_{\circ}$ of $X$ defined by

[TABLE]

There are many interesting relationships between these polars. For example, for any $\varnothing\not=A\subseteq X$ , $(A^{\circ})_{\circ}=\overline{\mathrm{co}}(A\cup\{0\})$ and for any $\varnothing\not=B\subseteq X^{*}$ , $(B_{\circ})^{\circ}=\overline{\mathrm{co}}^{w^{*}}(B\cup\{0\})$ .

Perhaps the most famous theorem concerning polars is the following theorem.

Theorem 2.21 (Bipolar Theorem).

Let $C$ be a closed, convex subset of a normed linear space $(X,\|\cdot\|)$ with $0\in C$ . Then $C^{\circ\circ}:=(C^{\circ})^{\circ}=\overline{\widehat{C}}^{w^{*}}$ .

Proof.

It follows directly from the definition of $(C^{\circ})^{\circ}$ that $\widehat{C}\subseteq C^{\circ\circ}$ . Moreover, by Proposition 2.20 we know that $C^{\circ\circ}$ is weak∗-closed and so $\overline{\widehat{C}}^{w^{*}}\subseteq C^{\circ\circ}$ . Now suppose, in order to obtain a contradiction, that $\overline{\widehat{C}}^{w^{*}}\subsetneq C^{\circ\circ}.$ Then there exists an $F_{0}\in C^{\circ\circ}\setminus\overline{\widehat{C}}^{w^{*}}$ . By Theorem 2.17, applied in $(X^{**},\mathrm{weak}^{*})$ , there exists an $x^{*}\in X^{*}$ such that

[TABLE]

If necessary, we may replace $x^{*}$ by $\lambda x^{*}$ , (for some $0<\lambda$ and relabelling), so that

[TABLE]

Therefore, $x^{*}\in C^{\circ}$ . However, this implies that $F_{0}(x^{*})\leq 1$ since $F_{0}\in C^{\circ\circ}$ , which contradicts the earlier inequality: $1<F_{0}(x^{*})$ . $\Box$

An important application of the Bipolar Theorem is given next.

Corollary 2.22 (Goldstine’s Theorem).

Let $(X,\|\cdot\|)$ be a normed linear space then $B_{\widehat{X}}$ is weak∗ dense in $B_{X^{**}}$ .

Proof.

We apply twice, the general fact (observed before) that if $B_{Y}$ is the closed unit ball of a normed linear space $Y$ then $B_{Y^{*}}=(B_{Y})^{\circ}$ to obtain

[TABLE]

and then apply the Bipolar Theorem. $\Box$

Perhaps the main reason for the interest in the weak∗ topology is contained in the next theorem. It says that, although it is too much to ask that the dual ball be compact with respect to the norm topology (unless the space is finite dimensional), it is possible that it is compact with respect to a weaker topology.

Theorem 2.23 (Banach-Alaoglu Theorem [1]).

Let $(X,\|\cdot\|)$ be a normed linear space. Then $(B_{X^{*}},\mbox{weak$ {}^{} $})$ is compact.*

Proof.

For each $x\in X$ , let $I_{x}:=[-x,x]$ and let $Y:=\prod_{x\in X}I_{x}$ be endowed with the product topology. By Tychonoff’s Theorem, $Y$ is compact. It follows from the definition of the product topology and Proposition 2.3 that $\pi:B_{X^{*}}\to Y$ , defined by, $\pi(x^{*})(x):=x^{*}(x)$ for all $x\in X$ , is a homeomorphic embedding of $B_{X^{*}}$ into $Y$ . So to show that $(B_{X^{*}},\mathrm{weak^{*}})$ is compact it is sufficient to show that $\pi(B_{X^{*}})$ is a closed subset of $Y$ , that is, it is sufficient to show that $\overline{\pi(B_{X^{*}})}\subseteq\pi(B_{X^{*}})$ . To this end, let $g\in\overline{\pi(B_{X^{*}})}$ . We will show that $g$ is “linear”. Let $x,y\in X$ and $\varepsilon>0$ . Then there exists $x^{*}\in B_{X^{*}}$ such that $|g(x)-\pi(x^{*})(x)|<\varepsilon/3$ , $|g(y)-\pi(x^{*})(y)|<\varepsilon/3$ and $|g(x+y)-\pi(x^{*})(x+y)|<\varepsilon/3$ . Then, since $x^{*}$ is linear,

[TABLE]

Since $\varepsilon>0$ was arbitrary, $g(x+y)=g(x)+g(y)$ . Next, let $x\in X$ , $\lambda\in{\mathbb{R}}$ and $\varepsilon>0$ . Then there exists $x^{*}\in B_{X^{*}}$ such that $|g(\lambda x)-\pi(x^{*})(\lambda x)|<\varepsilon/2$ and $|g(x)-\pi(x^{*})(x)|<\varepsilon/2(|\lambda|+1)$ . Then, since $x^{*}$ is linear,

[TABLE]

Since $\varepsilon>0$ was arbitrary, $g(\lambda x)=\lambda g(x)$ . Thus, if we define $y^{*}:X\to{\mathbb{R}}$ by, $y^{*}(x):=g(x)$ for all $x\in X$ , then $y^{*}\in B_{X^{*}}$ and $g=\pi(y^{*})\subseteq\pi(B_{X^{*}})$ . $\Box$

Proposition 2.24.

Let $(X,\|\cdot\|)$ be a normed linear space. Then the relative weak topology and the relative weak∗ topology coincide on the subspace $\widehat{X}$ of $X^{**}$ .

Proof.

It follows immediately from the definitions that each relatively weak∗ open subset of $\widehat{X}$ is open in the relative weak topology on $\widehat{X}$ . So we need only consider the converse statement. Suppose that $U$ is a relatively weak open subset of $\widehat{X}$ . Let $\widehat{x}$ be any element of $U$ . Then by, Proposition 2.5 there exists a finite subset $\{\mathscr{F}_{1},\mathscr{F}_{2},\ldots,\mathscr{F}_{n}\}$ of $X^{***}$ and an $\varepsilon>0$ such that $N(\widehat{x},\mathscr{F}_{1},\mathscr{F}_{2},\ldots,\mathscr{F}_{N},\varepsilon)\cap\widehat{X}\subseteq U$ . For each $1\leq k\leq n$ let $f_{k}:X\to{\mathbb{R}}$ be defined by, $f_{k}(x):=\mathscr{F}_{k}(\widehat{x})$ . Then $f_{k}\in X^{*}$ , in fact $\|f_{k}\|\leq\|\mathscr{F}_{k}\|$ for each $1\leq k\leq n$ . We claim that

[TABLE]

To see this, let $F\in N(\widehat{x},\widehat{f_{1}},\widehat{f_{2}},\ldots,\widehat{f_{n}},\varepsilon)\cap\widehat{X}$ . Then $F=\widehat{y}$ for some $y\in X$ and $\widehat{y}\in N(\widehat{x},\widehat{f_{1}},\widehat{f_{2}},\ldots,\widehat{f_{n}},\varepsilon)\cap\widehat{X}$ . Fix $1\leq k\leq n$ . Then

[TABLE]

Therefore, $F\in N(\widehat{x},\mathscr{F}_{1},\mathscr{F}_{2},\ldots,\mathscr{F}_{N},\varepsilon)\cap\widehat{X}$ ; which completes the proof of the claim. The result now follows from Proposition 2.5. $\Box$

In what follows we will often use (without saying) the fact that if $(Z,\tau)$ is a topological space and $A\subseteq Y\subseteq Z$ , then $A$ is compact in $Z$ if, and only if, $A$ is compact in $Y$ , with respect to the relative topology on $Y$ .

Remark 2.25.

Together, Theorem 2.23 and Proposition 2.24 are essential for our future endeavours, as they provide a method for showing that a closed and bounded convex subset $C$ of a normed linear space $(X,\|\cdot\|)$ is weakly compact. Namely, to show that $C$ is weakly compact it is sufficient (and necessary) to show that $\overline{\widehat{C}}^{w^{*}}\subseteq\widehat{X}$ . The reason for this is as follows: $\overline{\widehat{C}}^{w^{*}}$ is a weak∗ compact subset of $(X^{**},\mathrm{weak}^{*})$ , by Theorem 2.23, and hence compact with respect to the relative weak∗ topology on $\widehat{X}$ . Therefore, by Proposition 2.24, $\overline{\widehat{C}}^{w^{*}}$ is compact with respect to the relative weak topology on $\widehat{X}$ . Further, by Proposition 2.14, $\overline{\widehat{C}}^{w^{*}}$ is compact with respect to the $\sigma((\widehat{X})^{*},\widehat{X})$ -topology on $\widehat{X}$ (i.e., the weak topology on $\widehat{X}$ ). Let $j:X\to\widehat{X}$ be the linear isometry defined by, $j(x):=\widehat{x}$ for all $x\in X$ . Then $j^{-1}:\widehat{X}\to X$ is also a linear isometry. Thus, by Proposition 2.10, $C\subseteq j^{-1}(\overline{\widehat{C}}^{w^{*}})$ is weakly compact. Since $C$ is closed and bounded it is closed in the weak topology on $X$ (see, Proposition 2.18). Hence $C$ is compact with respect to the weak topology on $X$ .

As an example of this approach, we will give our first characterisation of reflexivity in terms of the weak compactness of the unit ball.

Theorem 2.26 ([8]).

Let $(X,\|\cdot\|)$ be a normed linear space. Then $X$ is reflexive (i.e., $X^{**}=\widehat{X}$ ) if, and only if, $B_{X}$ is compact with respect to the weak topology on $X$ .

Proof.

Suppose that $B_{X}$ is compact with respect to the weak topology on $X$ . Then, by Proposition 2.10, $B_{\widehat{X}}$ is compact with respect to the weak topology on $X^{**}$ as: (i) the mapping, $x\mapsto\widehat{x}$ , is a bounded linear operator from $X$ into $X^{**}$ and (ii) the general fact that the continuous image of a compact set is compact. Now, since the weak∗ topology on $X^{**}$ is weaker (and certainly no stronger) than the weak topology on $X^{**}$ , $B_{\widehat{X}}$ is compact with respect to the weak∗ topology on $X^{**}$ . Furthermore, since the weak∗ topology is Hausdorff, $B_{\widehat{X}}$ is closed with respect to the weak∗ topology on $X^{**}$ . Thus, by Goldstine’s Theorem, (Theorem 2.22)

[TABLE]

So, $X^{**}=\bigcup_{n\in{\mathbb{N}}}nB_{X^{**}}=\bigcup_{n\in{\mathbb{N}}}nB_{\widehat{X}}=\widehat{X}$ .

Conversely, suppose that $X^{**}=\widehat{X}$ . Then $B_{X^{**}}=B_{\widehat{X}}$ and so by Theorem 2.23, $(B_{\widehat{X}},\mathrm{weak}^{*})$ is compact. Since $B_{\widehat{X}}\subseteq\widehat{X}$ we have by Proposition 2.24 that $(B_{\widehat{X}},\mathrm{weak})$ is compact. Finally, since $x\mapsto\widehat{x}$ is a linear isometry from $X$ onto $X^{**}$ (since we are assuming that $X^{**}=\widehat{X}$ ), its inverse is a continuous linear operator (in fact an isometry as well) and so by Proposition 2.10, $(B_{X},\mathrm{weak})$ is compact too, as the continuous image of a compact set is compact. $\Box$

3 James’ Theorem on weak compactness

In this section we will provide three proofs of James’ Theorem on weak compactness, listing them in order of increasing generality. First we provide a proof that is valid in all separable Banach spaces, then we give a proof that is valid in any Banach space whose dual ball is weak∗ sequentially compact, and then finally, we will present a proof that holds in all Banach spaces. It is our hope that this incremental approach to the full James’ Theorem will make the final proof more accessible and less intimidating to the reader.

3.1 James’ Theorem on weak compactness: the separable case

Convexity is the key to all our proofs of James’ Theorem.

Let $A$ be a nonempty convex subset of a vector space $V$ and let $\varphi:A\rightarrow{\mathbb{R}}$ be a function. We say that $\varphi$ is convex if

[TABLE]

for all $x,y\in A$ and all $0\leq\lambda\leq 1$ .

Lemma 3.1 ( [34]).

Let $0<\beta$ , $0<\beta^{\prime}$ and suppose that $\varphi:[0,\beta+\beta^{\prime}]\rightarrow\mathbb{R}$ is a convex function. Then

[TABLE]

Proof.

The inequality given in the statement of the lemma follows by rearranging the inequality

$\displaystyle\varphi(\beta)\leq\frac{\beta}{\beta+\beta^{\prime}}\varphi(\beta+\beta^{\prime})+\frac{\beta^{\prime}}{\beta+\beta^{\prime}}\varphi(0).$ $\Box$

Our first application of convexity is given next. It plays an important role in all three proofs of James’ theorem.

Lemma 3.2 ([34]).

Let $V$ be a vector space (over $\mathbb{R}$ ) and let $\varphi:A\rightarrow\mathbb{R}$ be a convex function defined on a convex set $A$ with $0\in A$ . If $(A_{n}:n\in\mathbb{N})$ is a decreasing sequence of nonempty, convex subsets of $V$ , $(\beta_{n}:n\in\mathbb{N})$ is any sequence of strictly positive numbers such that $(\sum_{n=1}^{\infty}\beta_{n})A_{1}\subseteq A$ , $r\in\mathbb{R}$ and

[TABLE]

then there exists a sequence $(a_{n}:n\in\mathbb{N})$ in $V$ such that, for all $n\in\mathbb{N}$ :

(i)

$a_{n}\in A_{n}$ * and* 2. (ii)

$\displaystyle\varphi($ $\sum_{i=1}^{n}\beta_{i}a_{i}$ $)+\beta_{n+1}r<\varphi($ $\sum_{i=1}^{n+1}\beta_{i}a_{i})$ **.

Proof.

We proceed in two parts. Firstly we prove that if $\beta_{n}r+\varphi(u)\!<\!\inf_{a\in A_{n}}\!\varphi(u+\beta_{n}a)$ for some $n\in\mathbb{N}$ and some $u\in(\sum_{i=0}^{n-1}\beta_{1})A_{1}$ , where $\beta_{0}:=0$ , then there exists an $a_{n}\in A_{n}$ , such that

[TABLE]

To see this, suppose that $u\in(\sum_{i=0}^{n-1}\beta_{1})A_{1}$ and that $\beta_{n}r+\varphi(u)<\inf_{a\in A_{n}}\varphi(u+\beta_{n}a)$ . Then there exists an $\varepsilon>0$ such that

[TABLE]

So, choose $a_{n}\in A_{n}$ such that $\varphi(u+\beta_{n}a_{n})<\inf_{a\in A_{n}}\varphi(u+\beta_{n}a)+\beta_{n+1}\varepsilon$ . Let $a\in A_{n}$ . Then

$v:=(\beta_{n}a_{n}+\beta_{n+1}a)/(\beta_{n}+\beta_{n+1})\in A_{n}$ (since $A_{n}$ is convex) and so,

[TABLE]

Rearranging gives

[TABLE]

for all $a\in A_{n}$ . Since $\varphi(u+\beta_{n}a_{n})<[\varphi(u+\beta_{n}v)+\beta_{n+1}\varepsilon]$ , the desired inequality follows.

From this, we may inductively construct a sequence $(a_{n}:n\in\mathbb{N})$ with the requisite properties (i) and (ii). For the first step, we set $u:=0$ and then, by hypothesis, we have that

[TABLE]

So, by the first result, there exists an $a_{1}\in A_{1}$ , such that $\displaystyle\beta_{2}r+\varphi(\beta_{1}a_{1})<\inf_{a\in A_{1}}\varphi(\beta_{1}a_{1}+\beta_{2}a)$ .

For the $n^{\rm{th}}$ step, set $u:=\sum_{i=1}^{n-1}\beta_{i}a_{i}$ . Since $A_{n}\subseteq A_{n-1}$ , and by the way $a_{n-1}$ was constructed, we have that

[TABLE]

So, by the first result again, there exists $a_{n}\in A_{n}$ , such that $\beta_{n+1}r+\varphi\left(\mbox{$ \sum_{i=1}^{n}\beta_{i}a_{i} $}\right)<\inf_{a\in A_{n}}\varphi\left(\mbox{$ \sum_{i=1}^{n}\beta_{i}a_{i}+\beta_{n+1}a $}\right)$ which completes the induction. The sequence $(a_{n}:n\in\mathbb{N})$ has the properties claimed above. $\Box$

For the proof of James’ theorem we will only require the following special case of this lemma.

Lemma 3.3.

Let $V$ be a vector space (over $\mathbb{R}$ ) and let $\varphi:V\rightarrow\mathbb{R}$ be a sub-linear function. If $(A_{n}:n\in\mathbb{N})$ is a decreasing sequence of nonempty, convex subsets of $V$ , $(\beta_{n}:n\in\mathbb{N})$ is any sequence of strictly positive numbers, $r>0$ and

[TABLE]

then there exists a sequence $(a_{n}:n\in\mathbb{N})$ in $V$ such that, for all $n\in\mathbb{N}$ :

(i)

$a_{n}\in A_{n}$ * and* 2. (ii)

$\displaystyle\varphi($ $\sum_{i=1}^{n}\beta_{i}a_{i}$ $)+\beta_{n+1}r<\varphi($ $\sum_{i=1}^{n+1}\beta_{i}a_{i})$ **.

In order to formulate our first version of James’ theorem on weak compactness we need to introduce the following notions.

Let $K$ be a weak∗ compact convex subset of the dual of a Banach space $(X,\|\cdot\|)$ . A subset $B$ of $K$ is called a boundary of $K$ if for every $\widehat{x}\in\widehat{X}$ there exists a $b^{*}\in B$ such that $\widehat{x}(b^{*})=\sup\{\widehat{x}(y^{*}):y^{*}\in K\}$ . We shall say $B$ , $(I)$ -generates $K$ , if for every countable cover $(C_{n}:n\in{\mathbb{N}})$ of $B$ by weak∗ compact convex subsets of $K$ , the convex hull of $\bigcup_{n\in{\mathbb{N}}}C_{n}$ is norm dense in $K$ . The following proof is found in [34].

Theorem 3.4 ([10, 11]).

Let $K$ be a weak∗ compact convex subset of the dual of a Banach space $(X,\|\cdot\|)$ and let $B$ be a boundary of $K$ . Then $B$ , $(I)$ -generates $K$ .

Proof.

After possibly translating $K$ , we may assume that $0\in B$ . Let $\{C_{n}:n\in{\mathbb{N}}\}$ be weak* compact, convex subsets of $K$ such that $B\subseteq\bigcup_{n\in{\mathbb{N}}}C_{n}$ and suppose, for a contradiction, that co $[\bigcup_{n\in{\mathbb{N}}}C_{n}]$ is not norm dense in $K$ . Then there must exist an $0<\varepsilon$ and $y^{*}\in K$ such that

[TABLE]

Since, for all $n\in{\mathbb{N}}$ , co $[\bigcup_{j=1}^{n}C_{j}]$ is weak* compact and convex, there exist $(\widehat{x}_{n}:n\in{\mathbb{N}})$ in $\widehat{X}$ such that for every $n\in{\mathbb{N}}$ , $\|\widehat{x}_{n}\|=1$ and

[TABLE]

Now, $(\widehat{x}_{n}(y^{*}):n\in{\mathbb{N}})$ is a bounded sequence of real numbers and thus has a convergent subsequence $(\widehat{x}_{n_{k}}(y^{*}):k\in{\mathbb{N}})$ . Let $\displaystyle s:=\lim_{k\rightarrow\infty}\widehat{x}_{n_{k}}(y^{*})$ . Then, $\varepsilon\leq s$ and, after relabelling the sequence $(\widehat{x}_{n}:n\in{\mathbb{N}})$ if necessary, we may assume that $|\widehat{x}_{n}(y^{*})-s|<\varepsilon/3$ for all $n\in{\mathbb{N}}$ . Note that this relabelling does not disturb the inequality in ( $**$ ).

We define $A_{n}:=\text{co}\{\widehat{x}_{k}:n\leq k\}$ for all $n\in{\mathbb{N}}$ and note that: (i) $(A_{n}:n\in{\mathbb{N}})$ is a decreasing sequence of nonempty convex subsets of $\widehat{X}$ and (ii) if $N<n$ and $b^{*}\in C_{N}$ then

[TABLE]

since, $\{\widehat{x}_{k}:n\leq k\}\subseteq\{\widehat{x}\in\widehat{X}:\widehat{x}(b^{*}-y^{*})<-\varepsilon\}$ ; which is convex. Next, we define $p:\widehat{X}\rightarrow{\mathbb{R}}$ by,

[TABLE]

Then $p$ defines a sublinear functional on $\widehat{X}$ . Moreover, for all $g\in A_{1}$ , we have $(s-\varepsilon/3)<g(y^{*})\leq p(g)$ since $\{\widehat{x}_{n}:n\in{\mathbb{N}}\}\subseteq\{\widehat{x}\in\widehat{X}:(s-\varepsilon/3)<\widehat{x}(y^{*})\}$ ; which is convex and $y^{*}\in K$ .

Let $(\beta_{n}:n\in{\mathbb{N}})$ be any sequence of positive numbers such that $\displaystyle\lim_{n\rightarrow\infty}\left(\mbox{$ \sum_{i=n+1}^{\infty} $}\beta_{i}\right)/\beta_{n}=0$ . Now,

[TABLE]

Therefore, by Lemma 3.3, there exists a sequence $(g_{n}:n\in{\mathbb{N}})$ in $\widehat{X}$ such that $g_{n}\in A_{n}$ and

[TABLE]

Since $\|g_{n}\|\leq 1$ for all $n\in{\mathbb{N}}$ , we have that $\sum_{i=1}^{\infty}\|\beta_{i}g_{i}\|\leq\sum_{i=1}^{\infty}\beta_{i}<\infty$ . As $X$ is a Banach space, this implies that $g:=\sum_{i=1}^{\infty}\beta_{i}g_{i}\in\widehat{X}$ . Because $p$ is continuous, this implies that $(p(\sum_{i=1}^{n}\beta_{i}g_{i}):n\in\mathbb{N})$ is a convergent - and hence bounded - sequence in $\mathbb{R}$ . Moreover, Lemma 3.3 gives that $(p(\sum_{i=1}^{n}\beta_{i}g_{i}):n\in\mathbb{N})$ is an increasing sequence. Therefore, by the Convergence Theorem, $(p(\sum_{i=1}^{n}\beta_{i}g_{i}):n\in\mathbb{N})$ converges to its supremum. That is,

[TABLE]

Since $g\in\widehat{X}$ , and since $B$ is a boundary for $K$ , there must exist a $b^{*}\in B$ such that

[TABLE]

Then,

[TABLE]

Since $B\subseteq\bigcup_{n\in{\mathbb{N}}}C_{n}$ , $b^{*}\in C_{N}$ for some $N\in{\mathbb{N}}$ . Thus, if $N<n$ , then

[TABLE]

since $g_{n}\in A_{n}$ . By taking the limit as $n$ tends to infinity we get that $(s-\varepsilon/2)\leq(s-\varepsilon)$ ; which is impossible. Therefore, $B$ , ( $I$ )-generates $K$ . $\Box$

Remark 3.5.

If $\displaystyle\beta_{n}:=\frac{1}{n!}$ for all $n\in{\mathbb{N}}$ or, $\displaystyle\beta_{n}:=\frac{1}{2^{n^{2}}}$ for all $n\in{\mathbb{N}}$ , then $\displaystyle\lim_{n\rightarrow\infty}\frac{\mbox{$ \sum_{i=n+1}^{\infty} $}\beta_{i}}{\beta_{n}}=0$ .

Theorem 3.6 (James’ Theorem: version 1, [18]).

Let $C$ be a closed and bounded convex subset of a Banach space $(X,\|\cdot\|)$ . If $C$ is separable and every continuous linear functional on $X$ attains its supremum over $C$ , then $C$ is weakly compact.

Proof.

Let $K:=\overline{\widehat{C}}^{w^{*}}$ . To show that $C$ is weakly compact it is sufficient to show $K\subseteq\widehat{X}$ , (see Remark 2.25). In fact, since $X$ is a Banach space and $x\mapsto\widehat{x}$ is a linear isometry, we have that $\widehat{X}$ is a Banach subspace of $X^{**}$ and so a closed subspace of $X^{**}$ . Therefore, it is sufficient to show that for every $0<\varepsilon$ , $K\subseteq\widehat{X}+2\varepsilon B_{X^{**}}$ . To this end, fix $0<\varepsilon$ and let $\{x_{n}:n\in{\mathbb{N}}\}$ be a dense subset of $C$ . For each $n\in{\mathbb{N}}$ , let $K^{\varepsilon}_{n}:=K\cap[\widehat{x_{n}}+\varepsilon B_{X^{**}}]$ . Then $(K^{\varepsilon}_{n}:n\in{\mathbb{N}})$ is a cover of $\widehat{C}$ by weak∗ closed convex subsets of $K$ . Since $\widehat{C}$ is a boundary of $K$ , we have that $K\subseteq\overline{\mbox{co}}\bigcup_{n\in{\mathbb{N}}}K^{\varepsilon}_{n}\subseteq\widehat{X}+2\varepsilon B_{X^{**}}$ ; which completes the proof. $\Box$

By working a bit harder, we could extend this approach to proving James’ theorem, via ( $I$ )-generation, to spaces whose dual ball is weak∗ sequentially compact. Indeed, this is done in the paper [33]. However, in this paper we will take another tack. We will prove James’ theorem, in the case when the dual ball is weak∗ sequentially compact, in a way that naturally extends to the general case, albeit requiring several extra technical results regarding the extraction of subsequences with “small” sets of cluster points.

One of the strengths of Theorem 3.6 is that it essentially only relies upon a separation argument (Theorem 2.17) and Lemma 3.3. In this way we see that this proof is very elementary.

3.2 James’ Theorem on weak compactness: the weak∗ sequentially compact case

We shall shall start this subsection with two simple preliminary results.

Proposition 3.7.

Let $(X,\|\cdot\|)$ be a normed linear space. Then every finite-dimensional subspace of $X^{*}$ is weak∗-closed.

Proof.

Suppose that $Y:=\text{span}\{x^{*}_{1},\dots,x^{*}_{n}\}$ is a finite-dimensional subspace of $X^{*}$ and let $x_{0}^{*}\notin Y$ . Then, by Lemma 2.7, we have that $\bigcap_{i=1}^{n}\ker(x^{*}_{i})\not\subseteq\ker(x_{0}^{*})$ . So, let

[TABLE]

Then, taking $-x$ if need be, we may assume that $x_{0}^{*}(x)>0$ , while $x^{*}_{i}(x)=0$ for all $1\leq i\leq n$ . Observe that $Y=\text{span}\{x^{*}_{1},\dots,x^{*}_{n}\}\subseteq\ker(\widehat{x})$ , since $\ker(\widehat{x})$ is a subspace and $x^{*}_{i}\in\ker(\widehat{x})$ for all $1\leq i\leq n$ . So $y^{*}(x)=0$ for all $y^{*}\in Y$ . Thus,

[TABLE]

is a weak∗-open neighbourhood of $x_{0}^{*}$ , which is disjoint from $Y$ . Since $x_{0}^{*}$ was arbitrary, we have that $Y$ is weak∗-closed. $\Box$

Lemma 3.8.

Let $(X,\|\cdot\|)$ be a normed linear space, let $Y$ be a finite-dimensional subspace of $X^{*}$ , and let $\varepsilon>0$ . If $x^{*}\in X^{*}$ and $\emph{dist}(x^{*},Y)>\varepsilon$ , then there exists an $x\in S_{X}$ such that $x^{*}(x)>\varepsilon$ and $y^{*}(x)=0$ for all $y^{*}\in Y$ .

Proof.

Let $Y$ be a finite-dimensional subspace of $X^{*}$ such that $\text{dist}(x^{*},Y)>\varepsilon>0$ . Then we have that $x^{*}\notin Y+\varepsilon B_{X^{*}}$ . Since $Y$ is weak∗-closed (Proposition 3.7) and convex, and $B_{X^{*}}$ is weak∗-compact (Theorem 2.23) and convex, we have that $Y+\varepsilon B_{X^{*}}$ is also weak∗-closed and convex. Therefore, by Theorem 2.17, there exists an $x\in S_{X}$ such that

[TABLE]

Finally observe that for this $x$ , we have that $\widehat{x}(Y)$ is bounded above, and since $Y$ is a subspace, the only way this is possible is if $\widehat{x}(y^{*})=y^{*}(x)=0$ for all $y^{*}\in Y$ . $\Box$

Theorem 3.9 (James’ Theorem: version 2).

Let $C$ be a closed, bounded, convex subset of a Banach space $X$ . If $(B_{X^{*}},\mbox{weak}^{*})$ is sequentially compact, and every $x^{*}\in X^{*}$ attains its supremum over $C$ , then $C$ is weakly compact.

Proof.

To show that $C$ is weakly compact, it is sufficient to show that $K:=\overline{\widehat{C}}^{w*}\subseteq\widehat{X}$ (see, Remark 2.25). Suppose, for a contradiction, that this is not the case. Then, there exists an $F\in K\backslash\widehat{X}$ . Since $X$ is a Banach space, $\widehat{X}$ is a closed subspace of $X^{**}$ , and so there must exist an $0<\varepsilon<\text{dist}(F,\widehat{X})$ . Let $(\beta_{n}:n\in\mathbb{N})$ be a sequence of strictly positive numbers such that $\lim_{n\rightarrow\infty}\frac{1}{\beta_{n}}\sum_{i=n+1}^{\infty}\beta_{i}=0$ .

Part I: Let $f_{0}:=0$ . We inductively create sequences $(f_{n}:n\in\mathbb{N})$ in $S_{X^{*}}$ and $(\widehat{x}_{n}:n\in\mathbb{N})$ in $\widehat{C}$ , such that the statements

•

$(A_{n}):-$ $|(F-\widehat{x}_{n})(f_{j})|<\varepsilon/2$ for all $0\leq j<n.$

•

$(B_{n}):-$ $F(f_{n})>\varepsilon$ and $\widehat{x}_{j}(f_{n})=0$ for all $1\leq j\leq n.$

are true for all $n\in\mathbb{N}$ . For the first step, choose any $\widehat{x}_{1}\in\widehat{C}$ . Then it is clear that $|(F-\widehat{x}_{1})(f_{0})|=0<\varepsilon/2$ . Now note that

[TABLE]

And so, by Lemma 3.8, there exists $f_{1}\in S_{X}$ such that $F(f_{1})>\varepsilon$ and $\widehat{x}_{1}(f_{1})=0$ . So the statements $(A_{1})$ and $(B_{1})$ hold.

Now fix $k\in\mathbb{N}$ . Suppose that we have created $\{\widehat{x}_{1},\dots,\widehat{x}_{k}\}$ and $\{f_{1},\dots,f_{k}\}$ such that the statements $(A_{k})$ and $(B_{k})$ hold true. Then consider the set

[TABLE]

Since $W$ is a weak∗-open neighbourhood of $F$ , and $F\in\overline{\widehat{C}}^{w^{*}}$ , we can choose $\widehat{x}_{k+1}\in\widehat{C}$ such that $\widehat{x}_{k+1}\in W$ i.e., such that the statement $(A_{k+1})$ holds. Next, observe that

[TABLE]

So, by Lemma 3.8, there exists $f_{k+1}\in S_{X}$ such that $F(f_{k+1})>\varepsilon$ and $\widehat{x}_{j}(f_{k+1})=0$ for all $1\leq j\leq k+1$ . Therefore the statement $(B_{k+1})$ also holds. This completes the induction.

Part II: Now let $(n_{k}:k\in\mathbb{N})$ be a strictly increasing sequence of natural numbers. Then for all $k\in\mathbb{N}$ , define $f^{\prime}_{k}:=f_{n_{k}}$ and $x^{\prime}_{k}:=x_{n_{k}}$ . Also define $f^{\prime}_{0}:=0$ . Then the sequences $(\widehat{x}^{\prime}_{n}:n\in\mathbb{N})$ and $(f^{\prime}_{n}:n\in\mathbb{N})$ still satisfy $(A_{n})$ and $(B_{n})$ for all $n\in\mathbb{N}$ . Therefore, passing to a subsequence does not disturb the statements $(A_{n})$ and $(B_{n})$ .

Now, as $(B_{X^{*}},\mbox{weak}^{*})$ is sequentially compact, and $(f_{n}:n\in\mathbb{N})$ is a sequence in $B_{X^{*}}$ , we have that $(f_{n}:n\in\mathbb{N})$ has a weak∗-convergent subsequence. So, by passing to subsequences and relabelling if necessary, we may assume that $(f_{n}:n\in\mathbb{N})$ is weak∗-convergent to some $f_{\infty}\in B_{X^{*}}$ . By the above, we know that the statements $(A_{n})$ and $(B_{n})$ remain true for all $n\in\mathbb{N}$ .

Part III: Let $k\in\mathbb{N}$ . For any $n\geq k$ , we have that that $\widehat{x}_{k}(f_{n})=0$ by the statement $(B_{n})$ . Therefore, it follows that $\widehat{x}_{k}(f_{\infty})=0$ . Since $k$ was arbitrary, this is true for all $k\in\mathbb{N}$ .

On the other hand, let $k\in\mathbb{N}$ and let $n>k$ . Then, by the statement $(A_{n})$ , we have that $|(F-\widehat{x}_{n})(f_{k})|<\varepsilon/2$ . Moreover, from $(B_{k})$ , we know that $F(f_{k})>\varepsilon$ . Combining these, we get that

[TABLE]

for all $n>k$ . Therefore $\widehat{x}_{n}(f_{k}-f_{\infty})>\varepsilon/2,$ for all $n>k$ .

Part IV: For each $n\in\mathbb{N}$ , define $C_{n}:=\text{co}\{f_{k}:k\geq n\}-f_{\infty}$ and note that $(C_{n}:n\in\mathbb{N})$ is a decreasing sequence of nonempty, convex subsets of $X^{*}$ . Define $p:X^{*}\rightarrow\mathbb{R}$ to be $p(x^{*})=\sup\{x^{*}(c):c\in C\}$ for all $x^{*}\in X^{*}$ . Then $p$ is a sublinear function and $\inf_{f\in C_{1}}p(f)>\varepsilon/4.$

To see this, let $f\in C_{1}$ . Then $f=\sum_{i=1}^{k}\lambda_{i}f_{n_{i}}-f_{\infty}$ where $\lambda_{i}\geq 0$ for all $1\leq i\leq k$ and $\sum_{i=1}^{k}\lambda_{i}=1$ . Let $m>\max\{n_{1},\dots,n_{k}\}$ . Then

[TABLE]

Therefore, since $f\in C_{1}$ was arbitrary, we have that $\inf_{f\in C_{1}}p(f)>\varepsilon/4$ as claimed. So, by Lemma 3.3, there exists a sequence $(g_{n}:n\in\mathbb{N})$ such that for all $n\in\mathbb{N}$ :

(i)

$g_{n}\in\text{co}\{f_{k}:k\geq n\}$ and 2. (ii)

$\displaystyle p($$\sum_{i=1}^{n}\beta_{i}(g_{i}-f_{\infty})$$)+\beta_{n+1}\varepsilon/4<p($$\sum_{i=1}^{n+1}\beta_{i}(g_{i}-f_{\infty}))$ . $(*)$

Part V: Now, since $(X^{*},\mathrm{weak}^{*})$ is a locally convex space $(g_{n}:n\in{\mathbb{N}})$ also converges to $g_{\infty}:=f_{\infty}$ . Indeed, if $W$ is any convex weak∗ open neighbourhood of $f_{\infty}$ then there exists an $N\in{\mathbb{N}}$ such that $f_{n}\in W$ for all $n\geq N$ . Therefore, $\mathrm{co}\{f_{k}:k\geq N\}\subseteq W$ . Since $g_{k}\in\mathrm{co}\{f_{i}:i\geq N\}$ for all $k\geq N$ we have that $g_{k}\in W$ for all $k\geq N$ . This shows that $(g_{k}:k\in{\mathbb{N}})$ converges to $g_{\infty}=f_{\infty}$ . Set $g:=\sum_{i=1}^{\infty}\beta_{i}(g_{i}-f_{\infty})$ . Since $\|g_{i}-f_{\infty}\|\leq 2$ for all $i\in\mathbb{N}$ , we have that

[TABLE]

Therefore, $g\in X^{*}$ since $X^{*}$ is a Banach space. As $p$ is continuous, it is clear that $(p(\sum_{i=1}^{n}\beta_{i}(g_{i}-f_{\infty})):n\in\mathbb{N})$ is a convergent - and in particular bounded - sequence in $\mathbb{R}$ . Moreover, the statement $(*)$ above gives that this is also an increasing sequence. Therefore, by the Monotone Convergence Theorem, $(p(\sum_{i=1}^{n}\beta_{i}(g_{i}-f_{\infty})):n\in\mathbb{N})$ converges to its supremum. That is,

[TABLE]

Part VI: Since $g\in X^{*}$ , there exists a $c\in C$ such that $\widehat{c}(g)=g(c)=\sup\{g(x):x\in C\}=p(g).$ Then, for any $n>1$ ,

[TABLE]

Rearranging gives that

[TABLE]

Taking $n\rightarrow\infty$ we get that

[TABLE]

which contradicts the fact that $\displaystyle f_{\infty}(c)=\lim_{n\rightarrow\infty}g_{n}(c)$ . Therefore, $K\subseteq\widehat{X}$ and so $C$ is weakly compact. $\Box$

The power of this result stems from the fact that the class of all Banach spaces whose dual ball is weak∗ sequentially compact is very large. Indeed, in addition to all the separable Banach spaces (whose dual ball is weak∗ metrisable), it contains all Asplund spaces, [28] (i.e., spaces in which every separable subspace has a separable dual space) and all spaces that admit an equivalent smooth norm, [15] (which includes all WCG spaces, [7]). In fact, it contains all Gateaux differentiability spaces, [28].

3.3 James’ Theorem on weak compactness: the general case

The short-coming of the previous subsection is that the dual ball of a Banach space need not be weak∗ sequentially compact. For example, the dual ball of $(C(\beta{\mathbb{N}}),\|\cdot\|_{\infty})$ is not weak∗ sequentially compact, as it contains a copy of $\beta{\mathbb{N}}$ - the Stone-Cech compactification of the natural numbers, endowed with the discrete topology and this space is known to have no nontrivial (i.e., not eventually constant) convergent sequences, [9].

So the method of passing to a subsequence which is weak∗ convergent must be abandoned. However we can, by passing to a suitable subsequence, insist that $K:=\bigcap_{n\in{\mathbb{N}}}\overline{\{f_{k}:k\geq n\}}^{w^{*}}$ is “small” in the sense that for countably many weak∗ lower semicontinuous real-valued functions $(p_{n}:n\in{\mathbb{N}})$ , the sets $p_{n}(K)$ are singletons. In this way, the set $K$ of all weak∗ cluster points of the sequence $(f_{n}:n\in{\mathbb{N}})$ “acts” like a singleton set in Part V and Part VI of the proof of Theorem 3.9.

So next we will show how to extract “nice” subsequences from a given sequence. The approach we adopt is very general and will provide much more than needed, but these technical results may possibly be of some independent interest.

We shall start with the precise definition of lower semicontinuity. Let $(X,\tau)$ be a topological space. We say a function $f:X\rightarrow\mathbb{R}\cup\{\infty\}$ is lower semicontinuous if for every $\alpha\in\mathbb{R}$ , $\{x\in X:f(x)\leq\alpha\}$ is a closed set.

Since we will be working extensively with subsequences we will introduce some concise notation for a subsequence of a given sequence. Let $\widetilde{x}:\mathbb{N}\rightarrow X$ be the sequence $(x_{n}:n\in\mathbb{N})$ and let $J$ be an infinite subset of $\mathbb{N}$ , i.e., $J=\{n_{k}:k\in\mathbb{N}\}$ with $n_{k}<n_{k+1}$ for all $k\in\mathbb{N}$ . Then the subsequence $(x_{n_{k}}:k\in\mathbb{N})$ will be denoted by $\widetilde{x}|_{J}$ . We will also be working with the set of all cluster points of a given sequence and so it is worth our while to introduce some notation for the set of all cluster points (and another related set as well). Let $(X,\tau)$ be a linear topological space and let $\widetilde{x}:\mathbb{N}\rightarrow X$ be the sequence $(x_{n}:n\in\mathbb{N})$ . We define

[TABLE]

That is, $cl_{\tau}(\widetilde{x})$ is the set of all $\tau$ -cluster points of $\widetilde{x}$ . Further, define $K_{\tau}(\widetilde{x}):=\overline{\mathrm{co}}^{\tau}(cl(\widetilde{x}))$ . When there is no ambiguity concerning the topology, we will simply write $cl(\widetilde{x})$ and $K(\widetilde{x})$ .

Lemma 3.10.

If $\varphi:A\rightarrow\mathbb{R}$ is a convex lower-semicontinuous function defined on a nonempty closed and convex subset $A$ of a Hausdorff locally convex space $(X,+,\cdot,\tau)$ , then for every sequence $\widetilde{x}:=(x_{n}:n\in\mathbb{N})$ in $A$ , there is a subsequence $\widetilde{x}|_{J}$ of $\widetilde{x}$ such that $\varphi(K(\widetilde{x}|_{J}))$ is either empty, or bounded.

Proof.

Let $\widetilde{x}:=(x_{n}:n\in\mathbb{N})$ be a sequence in $A$ . Suppose that $\widetilde{x}$ has no subsequence, $\widetilde{x}|_{J}$ , such that $\varphi(K(\widetilde{x}|_{J}))$ is empty. First, we construct an infinite subset $J^{\prime}$ of ${\mathbb{N}}$ such that $\varphi(K(\widetilde{x}|_{J^{\prime}}))$ is bounded below.

Let $x\in cl(\widetilde{x})$ . Then $x\notin\varphi^{-1}(-\infty,\varphi(x)-1]$ , which is closed and convex. Therefore, there exists a closed and convex neighbourhood, $N$ of $x$ such that $N\cap\varphi^{-1}(-\infty,\varphi(x)-1]=\varnothing$ i.e., $\varphi(N)\subseteq(\varphi(x)-1,\infty)$ . Since $x$ is a cluster point of $\widetilde{x}$ , we may choose an infinite set $J^{\prime}\subseteq\mathbb{N}$ such that $x_{j}\in N$ for all $j\in J^{\prime}$ . Then, because $N$ is closed and convex, $K(\widetilde{x}|_{J^{\prime}})\subseteq N$ and so $\varphi(K(\widetilde{x}|_{J^{\prime}}))\subseteq\varphi(N)\subseteq(\varphi(x)-1,\infty)$ . Hence $\varphi(K(\widetilde{x}|_{J^{\prime}}))$ is bounded below.

We now claim that $J^{\prime}$ possesses an infinite subset $J$ such that $\varphi(K(\widetilde{x}|_{J}))$ is bounded above. Indeed, suppose in order to obtain a contradiction, that this is not the case. Then we inductively proceed as follows. First, there must be $x\in cl(\widetilde{x}|_{J^{\prime}})$ with $\varphi(x)>1$ , otherwise $\varphi(K(\widetilde{x}|_{J^{\prime}}))\subseteq(-\infty,1]$ and we would be done. So, we may choose a closed, convex neighbourhood, $N$ of $x$ such that $N\cap\varphi^{-1}(-\infty,1]=\varnothing$ . Then, since $x$ is a cluster point of $\widetilde{x}|_{J^{\prime}}$ , we can choose an infinite subset $J_{1}\subseteq J^{\prime}$ such that $x_{j}\in N$ for all $j\in J_{1}$ . Because $N$ is closed and convex, we have that $K(\widetilde{x}|_{J_{1}})\subseteq N$ and so $K(\widetilde{x}|_{J_{1}})\cap\varphi^{-1}(-\infty,1]=\varnothing$ .

In general, suppose that we have chosen infinite subsets $J_{n}\subseteq\dots\subseteq J_{1}\subseteq J^{\prime}$ such that for all $1\leq i\leq n$ : $K(\widetilde{x}|_{J_{i}})\cap\varphi^{-1}(-\infty,i]=\varnothing$ .

For the $(n+1)^{\textrm{th}}$ step, we suppose that $cl(\widetilde{x}|_{J_{n}}))\not\subseteq\varphi^{-1}(-\infty,n+1]$ , otherwise $\varphi(K(\widetilde{x}|_{J_{n}}))\subseteq(-\infty,n+1]$ is bounded above and we are done. Therefore, we can choose $x\in cl(\widetilde{x}|_{J_{n}})$ such that $\varphi(x)>n+1$ , and a closed, convex neighbourhood, $N$ of $x$ such that $N\cap\varphi^{-1}(-\infty,n+1]=\varnothing$ . Then, since $x$ is a cluster point of $\widetilde{x}|_{J_{n}}$ , we can choose an infinite subset $J_{n+1}\subseteq J_{n}$ such that $x_{j}\in N$ for all $j\in J_{n+1}$ . Because $N$ is closed and convex, we have that $K(\widetilde{x}|_{J_{n+1}})\subseteq N$ and so $K(\widetilde{x}|_{J_{n+1}})\cap\varphi^{-1}(-\infty,n+1]=\varnothing$ . This completes the induction.

Lastly, we apply the so-called diagonalisation argument. Define $J^{{}^{\prime\prime}}:=\{n_{k}:k\in\mathbb{N}\}\subseteq\mathbb{N}$ such that $n_{k}<n_{k+1}$ and $n_{k}\in J_{k}$ for all $k\in{\mathbb{N}}$ .

Consider the subsequence of $\widetilde{x}$ given by $\widetilde{x}|_{J^{{}^{\prime\prime}}}=(x_{n_{k}}:k\in\mathbb{N})$ . Then, since $J_{n+1}\subseteq J_{n}$ for all $n\in\mathbb{N}$ , we have that $n_{k}\in J_{m}$ for all $k\geq m$ . Let $m\in\mathbb{N}$ . Then

[TABLE]

and so $K(\widetilde{x}|_{J^{{}^{\prime\prime}}})\cap\varphi^{-1}(-\infty,m]=\varnothing$ . Since $m$ was arbitrary, this holds for all $m\in\mathbb{N}$ and so we have that $\varphi(K(\widetilde{x}|_{J^{{}^{\prime\prime}}}))=\varnothing$ , which contradicts our original assumption. Thus, there exists a subsequence $\widetilde{x}|_{J}$ of $\widetilde{x}$ such that $\varphi(K(\widetilde{x}|_{J}))$ is bounded. $\Box$

We can further refine Lemma 3.10 as follows.

Lemma 3.11.

If $\varphi:A\rightarrow\mathbb{R}$ is a convex lower-semicontinuous function defined on a nonempty closed and convex subset $A$ of a Hausdorff locally convex space $(X,+,\cdot,\tau)$ , then for every sequence $\widetilde{x}:=(x_{n}:n\in\mathbb{N})$ in $A$ , there is a subsequence $\widetilde{x}|_{J}$ of $\widetilde{x}$ such that $\varphi(K(\widetilde{x}|_{J}))$ is at most a singleton.

Proof.

Suppose that $\widetilde{x}$ has no subsequence, $\widetilde{x}|_{J}$ , such that $\varphi(K(\widetilde{x}|_{J}))$ is empty. Then, by Lemma 3.10, and by passing to a subsequence if necessary, we may assume that $\varphi(K(\widetilde{x}))$ is bounded. Let $\alpha_{1},\beta_{1}\in\mathbb{R}$ denote $\inf\varphi(K(\widetilde{x}))$ and $\sup\varphi(K(\widetilde{x}))$ respectively and let $J_{0}:=\mathbb{N}$ . Of course if $\alpha_{1}=\beta_{1}$ , then $\varphi(K(\widetilde{x}))$ is a singleton and we are done. If not, we inductively construct a decreasing sequence of infinite subsets $(J_{n}:n\in{\mathbb{N}})$ of ${\mathbb{N}}$ such that $\text{diam}(\varphi(K(\widetilde{x}|_{J_{n}}))\leq(\beta_{1}-\alpha_{1})/2^{n}$ for all $n\in{\mathbb{N}}$ .

We begin as follows. Set $\delta_{1}:=(\alpha_{1}+\beta_{1})/2$ . Since $\varphi$ is convex and lower-semicontinuous, we have that $\varphi^{-1}(-\infty,\delta_{1}]$ is a closed, convex set. Then, we can pick $x\in cl(\widetilde{x})$ such that $\delta_{1}<\varphi(x)\leq\beta_{1}$ . Indeed, if not, then $\varphi(cl(\widetilde{x}))\subseteq(-\infty,\delta_{1}]$ and so $\varphi(K(\widetilde{x}))\subseteq(-\infty,\delta_{1}]$ also. However, this contradicts the fact that $\beta_{1}=\sup\varphi(K(\widetilde{x}))$ .

Therefore $x\notin\varphi^{-1}(-\infty,\delta_{1}]$ , and so there exists a closed, convex neighbourhood, $N$ of $x$ such that $N\cap\varphi^{-1}(-\infty,\delta_{1}]=\varnothing$ . As $x\in cl(\widetilde{x})$ , there is an infinite set $J_{1}\subseteq\mathbb{N}$ such that $x_{j}\in N$ for all $j\in J_{1}$ . In particular, $K(\widetilde{x}|_{J_{1}})\subseteq N$ , since $N$ is closed and convex, and so $\inf\varphi(K(\widetilde{x}|_{J_{1}}))\geq\delta_{1}$ . Also, because $\widetilde{x}|_{J_{1}}$ is a subsequence of $\widetilde{x}$ , we have that $\sup\varphi(K(\widetilde{x}|_{J_{1}}))\leq\beta_{1}$ . Therefore, $\text{diam}(\varphi(K(\widetilde{x}|_{J_{1}}))\leq\beta_{1}-\delta_{1}=(\beta_{1}-\alpha_{1})/2$ .

Suppose now that we have created the infinite subsets $J_{n}\subseteq J_{n-1}\subseteq\cdots\subseteq J_{0}$ such that

[TABLE]

Set $\alpha_{n}:=\inf\varphi(K(\widetilde{x}|_{J_{n}}))$ , $\beta_{n}:=\sup\varphi(K(\widetilde{x}|_{J_{n}}))$ and $\delta_{n}:=(\alpha_{n}+\beta_{n})/2$ . Then,

[TABLE]

by construction. If $\alpha_{n}=\beta_{n}$ then let $J_{n+1}:=J_{n}$ and we are done. Otherwise, we can choose (as above) $x\in cl(\widetilde{x}|_{J_{n}})$ such that $x\notin\varphi^{-1}(-\infty,\delta_{n}]$ , because if not, $\varphi(K(\widetilde{x}|_{J_{n}}))\subseteq(-\infty,\delta_{n}]$ , which contradicts the fact that $\beta_{n}=\sup\varphi(K(\widetilde{x}|_{J_{n}}))$ . Therefore, there exists a closed, convex neighbourhood, $N$ of $x$ such that $N\cap\varphi^{-1}(-\infty,\delta_{n}]=\varnothing.$ Since $x\in cl(\widetilde{x}|_{J_{n}})$ , there is an infinite set $J_{n+1}\subseteq J_{n}$ such that $x_{j}\in N$ for all $j\in J_{n+1}$ . In particular, since $N$ is closed and convex, $K(\widetilde{x}|_{J_{n+1}})\subseteq N$ and so $\inf\varphi(K(\widetilde{x}|_{J_{n+1}}))\geq\delta_{n}$ . Therefore,

[TABLE]

Thus, by induction, we have created a decreasing sequence of infinite subsets $(J_{n}:n\in{\mathbb{N}})$ of ${\mathbb{N}}$ such that

[TABLE]

Lastly, define $J:=\{n_{k}:k\in\mathbb{N}\}$ such that $n_{k}<n_{k+1}$ and $n_{k}\in J_{k}$ for all $k\in{\mathbb{N}}$ . Consider the subsequence of $\widetilde{x}$ given by $\widetilde{x}|_{J}=(x_{n_{k}}:k\in\mathbb{N})$ . Then, since $J_{n+1}\subseteq J_{n}$ for all $n\in\mathbb{N}$ , we have that $n_{k}\in J_{m}$ for all $k\geq m$ . Let $m\in\mathbb{N}$ . Then,

[TABLE]

which gives that $\text{diam}(\varphi(K(\widetilde{x}|_{J})))\leq\text{diam}(\varphi(K(\widetilde{x}|_{J_{m}})))\leq(\beta_{1}-\alpha_{1})/2^{m}$ . Since $m\in\mathbb{N}$ was arbitrary, we conclude that $\varphi(K(\widetilde{x}|_{J}))$ is a singleton as required. $\Box$

For our next result we need to recall the definition of the topology of pointwise convergence. If $X$ is a nonempty set and $A$ is a nonempty subset of $X$ then we may put a topology on the vector space ${\mathbb{R}}^{X}$ of all real-valued functions defined on $X$ endowed with pointwise addition and pointwise scalar multiplication. We will call the weak topology on ${\mathbb{R}}^{X}$ generated by $\{\delta_{a}:a\in A\}$ the topology of pointwise convergence on $A$ , where for each $a\in A$ , $\delta_{a}:{\mathbb{R}}^{X}\to{\mathbb{R}}$ is defined by, $\delta_{a}(f):=f(a)$ . We shall denote the topology of pointwise convergence on $A$ by $\tau_{p}(A)$ .

Corollary 3.12.

For each $n\in{\mathbb{N}}$ , let $\varphi_{n}:A\rightarrow\mathbb{R}$ be a convex lower-semicontinuous function defined on a nonempty closed and convex subset $A$ of a Hausdorff locally convex space $(X,+,\cdot,\tau)$ , then for every sequence $\widetilde{x}:=(x_{n}:n\in\mathbb{N})$ in $A$ , there exists a subsequence, $\widetilde{x}|_{J}$ , of $\widetilde{x}$ such that $\varphi(K(\widetilde{x}|_{J}))$ is at most a singleton for all $\varphi\in\overline{\{\varphi_{n}:n\in\mathbb{N}\}}^{\tau_{p}(A)}$ .

Proof.

Let $J_{0}:=\mathbb{N}$ . We inductively construct a decreasing sequence of infinite subsets $(J_{n}:n\in{\mathbb{N}})$ of ${\mathbb{N}}$ such that $\varphi_{n}(K(\widetilde{x}|_{J_{n}}))$ is at most a singleton for each $n\in{\mathbb{N}}$ .

We begin as follows. Since $\varphi_{1}$ is convex and lower-semicontinuous, there exists, by Lemma 3.11, an infinite subset $J_{1}$ of ${\mathbb{N}}$ such that $\varphi_{1}(K(\widetilde{x}|_{J_{1}}))$ is at most a singleton.

Now, suppose that we have created infinite subsets $J_{n}\subseteq J_{n-1}\subseteq\cdots\subseteq J_{1}\subseteq{\mathbb{N}}$ such that $\varphi_{i}(K(\widetilde{x}|_{J_{i}}))$ is at most a singleton for all $1\leq i\leq n$ .

Then, for the $(n+1)^{\rm{th}}$ step choose, using Lemma 3.11, an infinite subset $J_{n+1}$ of $J_{n}$ such that $\varphi_{n+1}(K(\widetilde{x}|_{J_{n+1}}))$ is at most a singleton.

Now, define $J:=\{n_{k}:k\in\mathbb{N}\}\subseteq\mathbb{N}$ such that $n_{k}<n_{k+1}$ and $n_{k}\in J_{k}$ for all $k\in{\mathbb{N}}$ . Consider the subsequence of $\widetilde{x}$ given by $\widetilde{x}|_{J}=(x_{n_{k}}:k\in\mathbb{N})$ . Then, since $J_{n+1}\subseteq J_{n}$ for all $n\in\mathbb{N}$ , we have that $n_{k}\in J_{m}$ for all $k\geq m$ . Let $m\in\mathbb{N}$ . Then,

[TABLE]

and so $\big{|}\varphi_{m}(K(\widetilde{x}|_{J}))\big{|}\leq\big{|}\varphi_{m}(K(\widetilde{x}|_{J_{m}}))\big{|}\leq 1$ . Since $m$ was arbitrary, this gives that $\varphi_{m}(K(\widetilde{x}|_{J}))$ is at most a singleton for all $m\in\mathbb{N}$ .

Now, let $\varphi\in\overline{\{\varphi_{n}:n\in\mathbb{N}\}}^{\tau_{p}(A)}$ and let $x,y\in K(\widetilde{x}|_{J})$ . Suppose, for a contradiction, that $\varphi(x)>\varphi(y)$ . Then

[TABLE]

is a $\tau_{p}(A)$ -neighbourhood of $\varphi$ . Since $\varphi\in\overline{\{\varphi_{n}:n\in\mathbb{N}\}}^{\tau_{p}(A)}$ there must exist $k\in\mathbb{N}$ such that $\varphi_{k}\in N$ . However, this is impossible as $\varphi_{k}(x)=\varphi_{k}(y)$ for all $k\in\mathbb{N}$ , and so $\varphi(K(\widetilde{x}|_{J}))$ is at most a singleton. $\Box$

By applying Corollary 3.12 we obtain the following technical result that is needed (i.e., provides the required subsequence) in the proof of the general version of James’ weak compactness theorem.

Corollary 3.13.

Let $\varphi:X\to{\mathbb{R}}$ be a $\tau$ -continuous convex function defined on a locally convex space $(X,+,\cdot,\tau)$ . If $\tau^{\prime}$ is a Hausdorff locally convex topology on $X$ such that (i) $\tau^{\prime}\subseteq\tau$ and (ii) $\varphi$ is $\tau^{\prime}$ -lower semicontinuous then, for every sequence $\widetilde{x}:=(x_{n}:n\in\mathbb{N})$ in $X$ , there exists a subsequence, $\widetilde{x}|_{J}$ , of $\widetilde{x}$ such that $\varphi(y-aK_{\tau^{\prime}}(\widetilde{x}|_{J}))$ is at most a singleton for all $y\in\emph{span}\{x_{n}:n\in\mathbb{N}\}$ and all $a\in\mathbb{R}$ .

Proof.

Observe that $Y:=\text{span}\{x_{n}:n\in\mathbb{N}\}$ is separable, so let $\{y_{n}:n\in\mathbb{N}\}$ be a countable, dense subset of $Y$ . Moreover, let $\{q_{n}:n\in\mathbb{N}\}$ be an enumeration of $\mathbb{Q}\backslash\{0\}$ . Now for all $m,n\in\mathbb{N}$ , define $\varphi_{n}^{m}:X\rightarrow\mathbb{R}$ by

[TABLE]

Since $x\mapsto(y_{n}-q_{m}x)$ is a continuous affine function and $\varphi$ is convex and $\tau^{\prime}$ -lower semicontinuous, we have that $\varphi_{n}^{m}$ is $\tau^{\prime}$ -lower-semicontinuous and convex for all $m,n\in\mathbb{N}$ . Then, by Corollary 3.12, there exists a subsequence, $\widetilde{x}|_{J}$ , of $\widetilde{x}$ such that $\psi(K_{\tau^{\prime}}(\widetilde{x}|_{J}))$ is at most a singleton for all $\psi$ in the $\tau_{p}(X)$ -closure of $\{\varphi_{n}^{m}:m,n\in\mathbb{N}\}$ .

Now observe that, for all $a\in\mathbb{R}$ and all $y\in Y$ , the function $\varphi^{a}_{y}:X\rightarrow\mathbb{R}$ given by $\varphi^{a}_{y}(x):=\varphi(y-ax)$ is in the $\tau_{p}(X)$ -closure of $\{\varphi_{n}^{m}:m,n\in\mathbb{N}\}$ . Therefore, $\varphi_{y}^{a}(K_{\tau^{\prime}}(\widetilde{x}|_{J}))=\varphi(y-aK_{\tau^{\prime}}(\widetilde{x}|_{J}))$ is at most a singleton for all $y\in\text{span}\{x_{n}:n\in\mathbb{N}\}$ and all $a\in\mathbb{R}$ , as required. $\Box$

The last result we need before we can prove the full version of James’ theorem concerns the convergence of the subsequences that we constructed in Part IV of the proof of Theorem 3.9.

Proposition 3.14.

*Let $(X,\tau)$ be a locally convex space and let $\widetilde{x}:=(x_{n}:n\in\mathbb{N})$ be a sequence in a

$\tau$ -compact convex subset $K$ of $X$ . If $\widetilde{y}:=(y_{n}:n\in\mathbb{N})$ is any sequence such that $y_{k}\in\emph{co}\{x_{n}:n\geq k\}$ for all $k\in{\mathbb{N}}$ , then $cl(\widetilde{y})\subseteq K(\widetilde{x})$ .*

Proof.

It is sufficient to show that for any open, convex neighbourhood, $W$ of [math], $cl(\widetilde{y})\subseteq K(\widetilde{x})+\overline{W}$ . To this end, let $W$ be an open, convex neighbourhood of [math]. Then note that for $k$ sufficiently large,

[TABLE]

Indeed if this is not the case, then we could construct a subsequence $(x_{n_{k}}:k\in{\mathbb{N}})$ of $(x_{n}:n\in{\mathbb{N}})$ such that $x_{n_{k}}\notin cl(\widetilde{x})+W$ for all $k\in{\mathbb{N}}$ , However, since $X\setminus[cl(\widetilde{x})+W]$ is a closed set containing $\{x_{n_{k}}:k\in{\mathbb{N}}\}$ we have that $\overline{\{x_{n_{k}}:k\in{\mathbb{N}}\}}\cap[cl(\widetilde{x})+W]=\varnothing$ , but this is impossible since $\overline{\{x_{n_{k}}:k\in\mathbb{N}\}}\cap cl(\widetilde{x})\neq\varnothing$ . Thus, we have a contradiction. Therefore, if $y\in cl(\widetilde{y})$ , then for $k$ sufficiently large, we have that

[TABLE]

Hence, $cl(\widetilde{y})\subseteq K(\widetilde{x})+\overline{W}$ as required. $\Box$

Theorem 3.15 (James’ Theorem: version 3, [20]).

Let $C$ be a closed, bounded, convex subset of a Banach space $X$ . If every $x^{*}\in X^{*}$ attains its supremum over $C$ , then $C$ is weakly compact.

Proof.

To show that $C$ is weakly compact, it suffices to show that $K:=\overline{\widehat{C}}^{w*}\subseteq\widehat{X}$ (see Remark 2.26). Suppose, for a contradiction, that this is not the case. Then there exists $F\in K\backslash\widehat{X}$ . Since $\widehat{X}$ is a closed subspace of $X^{**}$ , this means there must exist $0<\varepsilon<\text{dist}(F,\widehat{X})$ . Let $(\beta_{n}:n\in\mathbb{N})$ be a sequence of strictly positive numbers such that $\lim_{n\rightarrow\infty}\frac{1}{\beta_{n}}\sum_{i=n+1}^{\infty}\beta_{i}=0$ .

Part I: We inductively create the two sequences $(\widehat{x}_{n}:n\in\mathbb{N})$ in $\widehat{C}$ , and $(f_{n}:n\in\mathbb{N})$ in $S_{X^{*}}$ , which satisfy the statements $(A_{n})$ and $(B_{n})$ , exactly as in Part I of the proof of Theorem 3.9.

Part II: Define $p:X^{*}\rightarrow\mathbb{R}$ to be $p(x^{*})=\sup\{x^{*}(c):c\in C\}$ for all $x^{*}\in X^{*}$ . Then $p$ is norm-continuous, weak∗-lower-semicontinuous and convex. Just as in Part II of the proof of Theorem 3.9, passing to a subsequence does not disturb the statements $(A_{n})$ and $(B_{n})$ .

So, by passing to a subsequence and relabelling if necessary, by Corollary 3.13 we may assume that for all $f\in\text{span}\{f_{n}:n\in\mathbb{N}\}$ and all $a\in\mathbb{R}$ , the set $p(f-aK_{w^{*}}(f_{n}:n\in\mathbb{N}))$ is at most a singleton. Since $(f_{n}:n\in\mathbb{N})$ is a sequence in $B_{X^{*}}$ (which is weak∗-compact), it must a have a weak∗-cluster point, call it $f_{\infty}$ .

Part III: This step is exactly the same as Part III of the proof of Theorem 3.9 - we deduce that $\widehat{x}_{n}(f_{k}-f_{\infty})>\varepsilon/2$ for all $n>k$ .

Part IV: As in the proof of Theorem 3.9, we use Lemma 3.3 to construct a sequence $(g_{n}:n\in\mathbb{N})$ such that for all $n\in\mathbb{N}$ :

(i)

$g_{n}\in\text{co}\{f_{k}:k\geq n\}$ and 2. (ii)

$\displaystyle p($$\sum_{i=1}^{n}\beta_{i}(g_{i}-f_{\infty})$$)+\beta_{n+1}\varepsilon/4<p($$\sum_{i=1}^{n+1}\beta_{i}(g_{i}-f_{\infty}))$ .

Part V: Since $(g_{n}:n\in\mathbb{N})$ is a sequence in $B_{X^{*}}$ (which is weak∗-compact), it must a have a weak∗-cluster point, call it $g_{\infty}$ . Then, by Proposition 3.14, we have that $g_{\infty}\in K_{w^{*}}(f_{n}:n\in\mathbb{N})$ . While it may no longer be the case that $f_{\infty}=g_{\infty}$ as in Theorem 3.9, we do have that, for all $n\in\mathbb{N}$ ,

[TABLE]

since $g_{\infty}\in K_{w^{*}}(f_{n}:n\in\mathbb{N})$ and for all $f\in\text{span}\{f_{n}:n\in\mathbb{N}\}$ and all $a\in\mathbb{R}$ , the set $p(f-aK_{w^{*}}(f_{n}:n\in\mathbb{N}))$ is a singleton. As in Part V of the proof of Theorem 3.9, we set $g:=\sum_{i=1}^{\infty}\beta_{i}(g_{i}-g_{\infty})$ and deduce that $g\in X^{*}.$

Part VI: This final step is almost the same as Part VI of the proof of Theorem 3.9, with two small changes that we note here. We may replace $f_{\infty}$ with $g_{\infty}$ throughout the inequalities, not because $f_{\infty}=g_{\infty}$ but because of statement $(**)$ above. Lastly, the final contradiction is not because $\lim_{n\rightarrow\infty}g_{n}(c)=g_{\infty}(c)$ necessarily, but because $\liminf_{n\rightarrow\infty}g_{n}(c)\leq g_{\infty}(c)$ . This still gives a contradiction. $\Box$

3.4 James’ Theorem: applications

Theorem 3.16 ([19]).

Let $(X,\|\cdot\|)$ be a Banach space. Then $X$ is reflexive if, and only if, every continuous linear functional $x^{*}$ on $X$ attains its norm (i.e., there exists an $x\in B_{X}$ such that $\|x^{*}\|=x^{*}(x)$ ).

Proof.

By Theorem 2.26, $X$ is reflexive if, and only if, $B_{X}$ is weakly compact. So the result now follows from Theorem 3.15 once one remembers that every continuous linear functional on $X$ is continuous with respect to the weak topology on $X$ . $\Box$

Note: if $X$ is reflexive then one can use the Hahn-Banach Theorem to directly show that every continuous linear functional on $X$ attains its norm. Indeed, suppose that $x^{*}$ is a nonzero continuous linear functional on $X$ . Then by Corollary 2.13 there exists an $x^{**}\in S_{X^{**}}$ such that $x^{**}(x^{*})=\|x^{*}\|$ . However, since $X$ is reflexive, $x^{**}=\widehat{x}$ for some $x\in S_{X}$ . Hence, $\|x^{*}\|=x^{**}(x^{*})=\widehat{x}(x^{*})=x^{*}(x)$ . This shows that $x^{*}$ attains its norm.

We now recall a geometric concept in Banach space theory. We say that a Banach space, $(X,\|\cdot\|)$ , is uniformly convex if, for any $\varepsilon>0$ , there exists $\delta_{\varepsilon}>0$ with the following property: if $x,y\in B_{X}$ and $\|x+y\|>2-\delta_{\varepsilon}$ , then $\|x-y\|<\varepsilon$ .

Theorem 3.17 ([38]).

Proof.

Let $x^{*}\in S_{X^{*}}$ and define $(x_{n}:n\in\mathbb{N})$ in $B_{X}$ so that $x^{*}(x_{n})>1-\frac{1}{n}$ . Let $\varepsilon>0$ and choose $\delta_{\varepsilon}>0$ such that if $x,y\in B_{X}$ and $\|x+y\|>2-\delta_{\varepsilon}$ , then $\|x-y\|<\varepsilon$ Then, for $n,m\in{\mathbb{N}}$ greater than $N_{0}:=2/\delta_{\varepsilon}$ , we have that $2\geq\|x_{n}+x_{m}\|\geq x^{*}(x_{n}+x_{m})>2-\delta_{\varepsilon}$ . By the uniform convexity of $X$ , this gives that for $n,m>N_{0}$ , we have $\|x_{n}-x_{m}\|\leq\varepsilon.$ So, $(x_{n}:n\in\mathbb{N})$ is a Cauchy sequence in $X$ . Therefore, $(x_{n}:n\in\mathbb{N})$ is convergent to some $x\in B_{X}$ . It is clear that for this $x$ , $x^{*}(x)=1=\|x^{*}\|$ . Since $x^{*}$ was arbitrary in $S_{X^{*}}$ , every $x^{*}$ in $X^{*}$ attains its norm, and so, by Theorem 3.16, $X$ is reflexive. $\Box$

Another interesting application of Theorem 3.15 is the Krein-Smulian theorem.

Corollary 3.18 (Krein-Smulian Theorem,[27]).

Let $C$ we a weakly compact subset of a Banach space $(X,\|\cdot\|)$ . Then $\overline{\mbox{co}}(C)$ is also weakly compact.

Proof.

Let $K:=\overline{\text{co}}(C)$ . Since $C$ is weakly compact, every $x^{*}\in X^{*}$ must attain its supremum over $C$ i.e. for every $x^{*}\in X^{*}$ , there exists $c\in C\subseteq K$ such that $x^{*}(c)=\sup_{x\in C}x^{*}(x)$ . However, for every $x^{*}\in X^{*}$ , it is a routine observation that

[TABLE]

And so, every $x^{*}\in X^{*}$ attains its supremum over $K$ too. Therefore, by James’ Theorem (Theorem 3.15), $K$ is weakly compact. $\Box$

Using Theorem 3.4 we can prove some well-known results of S. Simons, see [43]. For a detailed survey of Simons’ results and applications thereof, see [3].

Theorem 3.19 (Simons, [43]).

Let $K$ be a weak∗-compact, convex subset of the dual of a Banach space $(X,\|\cdot\|)$ , let $B$ be a boundary for $K$ , and let $f_{n}:K\rightarrow\mathbb{R}$ be a weak∗-lower-semicontinuous, convex function for all $n\in\mathbb{N}$ . If $(f_{n}:n\in\mathbb{N})$ is equicontinuous with respect to the norm, and $\displaystyle\limsup_{n\rightarrow\infty}f_{n}(b^{*})\leq 0$ for all $b^{*}\in B$ , then $\displaystyle\limsup_{n\rightarrow\infty}f_{n}(x^{*})\leq 0$ for all $x^{*}\in K$ .

Proof.

Let $\varepsilon>0$ . For each $n\in\mathbb{N}$ , define:

[TABLE]

Let $k\in\mathbb{N}$ . Since $f_{k}:K\rightarrow\mathbb{R}$ is weak∗-lower-semicontinuous and convex, the set $\{y^{*}\in K:f_{k}(y^{*})\leq\varepsilon/2\}$ is weak∗-closed and convex. It follows that for all $n\in\mathbb{N}$ , $C_{n}$ is the intersection of weak∗-closed and convex sets, and so is weak∗-closed and convex itself. Then, since $C_{n}\subseteq K$ for all $n\in\mathbb{N}$ , we have that $C_{n}$ is weak∗-compact and convex for all $n\in\mathbb{N}$ . Moreover, if $b^{*}\in B$ , then $\limsup_{n\rightarrow\infty}f_{n}(b^{*})\leq 0$ and so $b^{*}\in C_{N}$ for some $N\in\mathbb{N}$ . Hence, $(C_{n}:n\in\mathbb{N})$ is a countable cover of $B$ by weak∗-compact, convex subsets of $K$ .

Therefore, since $B$ is a boundary for $K$ , by Theorem 3.4 we have that $\text{co}[\bigcup_{n\in\mathbb{N}}C_{n}]=\bigcup_{n\in\mathbb{N}}C_{n}$ (since $C_{n}\subseteq C_{n+1}$ for all $n\in{\mathbb{N}}$ ) is norm-dense in $K$ . Let $x^{*}\in K$ . Since $(f_{n}:n\in\mathbb{N})$ is equicontinuous with respect to the norm, there exists a $\delta>0$ such that $f_{n}(x^{*})<f_{n}(y^{*})+\varepsilon/2$ for all $n\in\mathbb{N}$ and all $y^{*}\in B(x^{*},\delta)$ .

However since $\bigcup_{n\in\mathbb{N}}C_{n}$ is norm-dense in $K$ , there exists $N\in\mathbb{N}$ such that $B(x^{*},\delta)\cap C_{N}\neq\varnothing$ . Therefore, $f_{n}(x^{*})<\varepsilon$ for all $n>N$ and so $\limsup_{n\rightarrow\infty}f_{n}(x^{*})\leq\varepsilon$ . Since $\varepsilon>0$ and $x^{*}\in K$ were arbitrary, we have that $\limsup_{n\rightarrow\infty}f_{n}(x^{*})\leq 0$ for all $x^{*}\in K$ as claimed. $\Box$

Corollary 3.20.

Let $K$ be a weak∗-compact, convex subset of the dual of a Banach space $(X,\|\cdot\|)$ and let $B$ be a boundary for $K$ . Let $(x_{n}:n\in\mathbb{N})$ be a bounded sequence in $X$ and let $x\in X$ . If $\displaystyle\lim_{n\to\infty}b^{*}(x_{n})=b^{*}(x)$ for all $b^{*}\in B$ , then $\displaystyle\lim_{n\to\infty}x^{*}(x_{n})=x^{*}(x)$ for all $x^{*}\in K$ .

Proof.

For all $n\in\mathbb{N}$ , define $f_{n}:K\rightarrow\mathbb{R}$ to be given by

[TABLE]

Then $f_{n}:K\rightarrow\mathbb{R}$ is a weak∗-lower-semicontinuous and convex function for all $n\in\mathbb{N}$ , as $x^{*}\mapsto\widehat{(x_{n}-x)}(x^{*})$ is weak∗ continuous and linear (into ${\mathbb{R}}$ ) and $r\mapsto|r|$ is continuous and convex. Furthermore, $(f_{n}:n\in\mathbb{N})$ is equicontinuous with respect to the norm. Finally, $\limsup_{n\rightarrow\infty}f_{n}(b^{*})\leq 0$ for all $b^{*}\in B$ and so, by Theorem 3.19, $\limsup_{n\rightarrow\infty}f_{n}(x^{*})\leq 0$ for all $x^{*}\in K$ . From this it is clear that $\displaystyle\lim_{n\to\infty}x^{*}(x_{n})=x^{*}(x)$ for all $x^{*}\in K$ . $\Box$

Sometimes called the Rainwater-Simons Theorem, Corollary 3.20 is due to S. Simons (although he proved it differently). It generalises a famous result of J. Rainwater, originally from [41].

Corollary 3.21 (Simons).

Let $K$ be a weak∗-compact, convex subset of the dual of a Banach space $(X,\|\cdot\|)$ , let $B$ be a boundary for $K$ , and let $(x_{n}:n\in\mathbb{N})$ be a bounded sequence in $X$ . Then

[TABLE]

Proof.

Since $B\subseteq K$ , clearly

[TABLE]

So it only remains to show that

[TABLE]

To this end, let

[TABLE]

and for each $n\in\mathbb{N}$ , let $f_{n}:K\rightarrow\mathbb{R}$ be defined by $f_{n}(x^{*}):=\sup\{\widehat{x}_{k}(x^{*}):k\geq n\}-r$ for all $x^{*}\in K$ . Then, for all $n\in\mathbb{N}$ , $f_{n}$ is weak∗-lower-semicontinuous and convex, as the pointwise supremum of a family of convex functions is again convex and the pointwise supremum of a family of lower semi-continuous functions is again lower semi-continuous. Furthermore, $(f_{n}:n\in\mathbb{N})$ is equicontinuous with respect to the norm and moreover, $\lim_{n\rightarrow\infty}f_{n}(b^{*})\leq 0$ for all $b^{*}\in B$ . Therefore, by Theorem 3.19, $\lim_{n\rightarrow\infty}f_{n}(x^{*})\leq 0$ for all $x^{*}\in K$ . From this, the result is immediate. $\Box$

In the next part of this subsection we will show that in order to deduce that a closed and bounded convex subset $C$ of Banach space $(X,\|\cdot\|)$ is weakly compact it is not necessary to show that all the elements of $X^{*}$ attain their maximum value of $C$ , but only a “large” subset of $X^{*}$ . To achieve this goal we need some more definitions.

Let $K$ be a subset of the dual of a normed linear space $(X,\|\cdot\|)$ . A point $x^{*}\in K$ is called a weak∗ exposed point of $K$ if there exists a $x\in X\setminus\{0\}$ such that $\widehat{x}(x^{*})\geq\sup_{y^{*}\in K}\widehat{x}(y^{*})$ . There are some simple, but useful, facts that we can easily deduce about weak∗ exposed points.

Firstly, (i) if $x^{*}$ is a weak∗ exposed point of $K$ then $\lambda x^{*}$ is a weak∗ exposed point of $\lambda K$ for any $\lambda\in{\mathbb{R}}\setminus\{0\}$ ; (ii) if $x^{*}$ is a weak∗ exposed point of $K$ then $x^{*}+y^{*}$ is a weak∗ exposed point of $K+y^{*}$ for any $y^{*}\in X^{*}$ ; (iii) if $x^{*}\in A\subseteq K$ is a weak∗ exposed point of $K$ then $x^{*}$ is a weak∗ exposed point of $A$ .

The next result shows that weak∗ exposed points are directly related to weak compactness.

Proposition 3.22.

Let $K$ be a closed and convex subset of the dual of a Banach space $(X,\|\cdot\|)$ . If $0\in\mathrm{int}(K)$ and every point of $\mathrm{Bb}(K)$ is a weak∗ exposed point of $K$ , then $K_{\circ}$ is a weakly compact subset of $X$ .

Proof.

We shall appeal directly to Theorem 3.15. To this end, let $x^{*}\in X^{*}\setminus\{0\}$ . We consider two cases.

Case (I) Suppose that for every $0<\lambda$ , $\lambda x^{*}\in K$ . Let $k\in K_{\circ}$ and let $0<\lambda$ . Then

[TABLE]

Therefore, $x^{*}(k)\leq\lambda^{-1}$ . Since $0<\lambda$ was arbitrary, $x^{*}(k)\leq 0$ and so $x^{*}$ attains its maximum value over $K_{\circ}$ at $0\in K_{\circ}$ .

Case(II) Suppose that for some $0<\lambda$ , $(\lambda x^{*})\not\in K$ . Let $\Lambda:=\{r\in[0,\infty):rx^{*}\in K\}$ . Then $\Lambda$ is a closed and bounded interval of $[0,\infty)$ since $K$ is closed and convex and $\lambda\not\in\Lambda$ . Let $\lambda_{0}:=\max_{r\in\Lambda}r$ . Then $\lambda_{0}x^{*}\in\mathrm{Bd}(K)$ . Hence there exists a $x\in X\setminus\{0\}$ such that

[TABLE]

By replacing $x$ by $\mu x$ for some $\mu>0$ and relabelling if necessary, we can assume that

[TABLE]

Therefore $x\in K_{\circ}$ . On the other hand, since $\lambda_{0}x^{*}\in K\subseteq(K_{\circ})^{\circ}$ we have that

[TABLE]

Therefore $\lambda_{0}x^{*}$ attains its maximum value over $K_{\circ}$ at $x$ , and hence so does $x^{*}$ . Therefore, by Theorem 3.15, $K_{\circ}$ is weakly compact. $\Box$

Theorem 3.23 ([21]).

Let $(X,\|\cdot\|)$ be a Banach space. If there exists a weak∗ open subset $U$ of $X^{*}$ such that $\varnothing\not=S_{X^{*}}\cap U$ and every member of $S_{X^{*}}\cap U$ attains its norm on $X$ , then $X$ is reflexive.

Proof.

Suppose that $\{x_{1},x_{2},\ldots,x_{n}\}\subseteq X$ , $\varepsilon>0$ and $x_{0}^{*}\in X^{*}$ are chosen so that if

[TABLE]

then $\varnothing\not=S_{X^{*}}\cap W^{\prime}$ and every member of $S_{X^{*}}\cap W$ attains its norm on $X$ (i.e., every point of $S_{X^{*}}\cap W$ is a weak∗ exposed point of $B_{X^{*}}$ - just consider the $x\in B_{X}$ such that $\widehat{x}(x^{*})=\|x^{*}\|=1$ ). Let $K^{\prime}:=W\cap B_{X^{*}}$ . Then $K^{\prime}$ is closed and bounded and convex. Furthermore, $\mathrm{int}(K^{\prime})\not=\varnothing$ . Let us now recall some basic facts from general topology. If $A$ and $B$ are closed subsets of a topological space $(T,\tau)$ then

[TABLE]

Perhaps the easiest way to convince yourself of this is to first show that $\mathrm{Bd}(A\cap B)\subseteq\mathrm{Bd}(A)\cup\mathrm{Bd}(B)$ . Then

[TABLE]

So $\mathrm{Bd}(K^{\prime})\subseteq[\mathrm{Bd}(W)\cap B_{X^{*}}]\cup[S_{X^{*}}\cap W]$ . We claim that every point of $\mathrm{Bd}(K^{\prime})$ is a weak∗ exposed point. To see this, suppose that $x^{*}\in\mathrm{Bd}(W)\cap B_{X^{*}}\subseteq\mathrm{Bd}(W)$ . Then clearly, $x^{*}$ is a weak∗ exposed point of the set $W$ (exposed by $\widehat{x_{k}}$ for some $1\leq k\leq n$ ). Then by property (iii) above, $x^{*}$ is a weak∗ exposed point of $W\cap B_{X^{*}}$ . If $x^{*}\in S_{X^{*}}\cap W$ then by the way $W$ was chosen, $x^{*}$ is a weak∗ exposed point of $B_{X^{*}}$ and hence by property (iii) above, also a weak∗ exposed point of $B_{X^{*}}\cap W$ . Choose $x^{*}\in\mathrm{int}(K^{\prime})$ and let $K:=K^{\prime}-x^{*}$ . Then $0\in\mathrm{int}(K)$ and by property (ii) above, each point of $\mathrm{Bd}(K)$ is a weak∗ exposed point. Thus, by Proposition 3.22, $K_{\circ}$ is weakly compact. Now since $K$ is bounded, $0\in\mathrm{int}(K_{\circ})$ . Hence $X$ is reflexive. $\Box$

The proof of the next theorem can be found in [31], (see also [36, 37]).

Theorem 3.24 ([42]).

Let $(X,\|\cdot\|)$ be a Banach space and let $f:X\to\mathbb{R}\cup\{\infty\}$ be a proper function on $X$ . If $f-x^{*}$ attains minimum for every $x^{*}\in X^{*}$ then for each $a\in\mathbb{R}$ , $S(a):=\{(y,s)\in X\times\mathbb{R}:f(y)\leq s\leq a\}$ is relatively weakly compact.

Proof.

In this proof we will identify the dual of $X\times{\mathbb{R}}$ with $X^{*}\times{\mathbb{R}}$ . We will also consider $X\times{\mathbb{R}}$ endowed with the norm $\|(x,r)\|_{1}:=\|x\|+|r|$ and note that with this norm, $(X\times{\mathbb{R}},\|\cdot\|_{1})$ is a Banach space. We shall apply James’ theorem, (Theorem 3.15), in $X\times{\mathbb{R}}$ . Let $H:=\{(x,r)\in X\times{\mathbb{R}}:r=0\}$ and define $T:(X\times{\mathbb{R}})\setminus H\to(X\times{\mathbb{R}})\setminus H$ by, $T(x,r):=r^{-1}(x,-1)$ . Then $T$ is a bijection. In fact, $T$ is a homeomorphism when $(X\times{\mathbb{R}})\setminus H$ is considered with the relative weak topology. Note that since $f$ is bounded below we may assume, after possibly translating, that $1=\inf_{x\in X}f(x)$ . Our proof relies upon the Fenchel conjugate, $f^{*}:X^{*}\to{\mathbb{R}}$ of $f$ , which is defined by,

[TABLE]

It is routine to check that $f^{*}$ is convex on $X^{*}$ . We claim that $\overline{\mbox{co}}[T(\mbox{epi}(f))\cup\{(0,0)\}]$ is weakly compact. To show this, it is sufficient, because of James’ theorem, to show that every non-zero continuous linear functional attains its maximum value over $T(\mbox{epi}(f))\cup\{(0,0)\}$ . To this end, let $(x^{*},r)\in(X^{*}\times{\mathbb{R}})\setminus\{(0,0)\}$ . We consider two cases.

Case (I) Suppose that for every $0<\lambda$ , $f^{*}(\lambda x^{*})\leq\lambda r$ . Then $x^{*}(x)-\lambda^{-1}f(x)\leq r$ for all $x\in X$ and all $0<\lambda$ . Let $(y,s)\in\mbox{epi}(f)$ and let $0<\lambda$ . Then,

[TABLE]

since $f(y)\leq s$ . As $0<\lambda$ was arbitrary, $(x^{*},r)(T(y,s))\leq 0=(x^{*},r)(0,0)$ . Thus, $(x^{*},r)$ attains its maximum value over $T(\mbox{epi}(f))\cup\{(0,0)\}$ at $(0,0)$ .

Case(II) Suppose that for some $0<\lambda$ , $\lambda r<f^{*}(\lambda x^{*})$ . Then, since the mapping, $\lambda^{\prime}\mapsto f^{*}(\lambda^{\prime}x^{*})$ , is real-valued and convex, it is continuous. Furthermore, it follows, from the intermediate value theorem applied to the function $g:[0,\lambda]\to{\mathbb{R}}$ , defined by,

[TABLE]

that there exists a $0<\mu<\lambda$ such that $g(\mu)=0$ , i.e., $f^{*}(\mu x^{*})=\mu r$ , since $g(0)=-1<0<g(\lambda)$ . Thus, $\mu(x^{*},r)=(\mu x^{*},f^{*}(\mu x^{*}))$ . Choose $z\in X$ such that $f^{*}(\mu x^{*})=\mu x^{*}(z)-f(z)$ . We claim that $(x^{*},r)$ attains its maximum value over $T(\mbox{epi}(f))\cup\{(0,0)\}$ at $T(z,f(z))=f(z)^{-1}(z,-1)$ . Now,

[TABLE]

On the other hand, if $(y,s)\in\mbox{epi}(f)$ then

[TABLE]

since $f(y)\leq s$ . Note also that $(x^{*},r)(0,0)=0<\mu^{-1}=(x^{*},r)(T(z,f(z)))$ . Therefore, by James’ Theorem 3.15 , $\overline{\mbox{co}}[T(\mbox{epi}(f))\cup\{(0,0)\}]$ is weakly compact.

Let $1\leq a$ , then $T(S(a))\subseteq\overline{\mbox{co}}[T(\mbox{epi}(f))\cup\{(0,0)\}]\cap\{(x,r)\in X\times{\mathbb{R}}:r\leq-a^{-1}\}$ ; which is weakly compact. Therefore,

[TABLE]

which completes the proof. $\Box$

For each $a\in{\mathbb{R}}$ , let $L(a):=\{x\in X:f(x)\leq a\}$ . It follows from Theorem 3.24 that if $(X,\|\cdot\|)$ is a Banach space, $f:X\to{\mathbb{R}}\cup\{\infty\}$ is a proper function on $X$ and $f-x^{*}$ attains minimum for every $x^{*}\in X^{*}$ then, for each $a\in{\mathbb{R}}$ , $L(a)$ is relatively weakly compact, since $L(a)=\pi(S(a))$ , where $\pi:X\times{\mathbb{R}}\to X$ is defined by, $\pi(x,r):=x$ for all $(x,r)\in X\times{\mathbb{R}}$ and is weak-to-weak continuous, (see Proposition 2.10).

An interesting corollary of this result is the following.

Corollary 3.25 ([42]).

Let $\varphi:U\to{\mathbb{R}}$ be a continuous convex function defined on a nonempty open convex subset $U$ of a Banach space $(X,\|\cdot\|)$ . If $\varphi-x^{*}$ attains minimum for every $x^{*}\in X^{*}$ then $X$ is reflexive.

Proof.

For each $n\in{\mathbb{N}}$ , let $F_{n}:=\{x\in U:\varphi(x)\leq n\}$ . Then each set $F_{n}$ is closed and $U=\bigcup_{n\in{\mathbb{N}}}F_{n}$ . Since $X$ is a Banach space, $U$ is of the second Baire category. Thus, there exists an $n_{0}\in{\mathbb{N}}$ such that $\mathrm{int}(F_{n_{0}})\not=\varnothing$ . In particular, there exists an $x_{0}\in F_{n_{0}}$ and a $\delta_{0}>0$ such that $B[x_{0},\delta_{0}]\subseteq F_{n_{0}}$ . Therefore, by Theorem 3.24, $B[x_{0},\delta_{0}]=x_{0}+\delta_{0}B_{X}$ is compact with respect to the weak topology, and hence so is $B_{X}$ . The result now follows from Theorem 2.26. $\Box$

For any nonempty bounded subset $A$ of a Banach space $(X,\|\cdot\|)$ and any $x^{*}\in X^{*}$ we shall denote by, $\sup(x^{*},A):=\sup\{x^{*}(a):a\in A\}$ and by $\inf(x^{*},A):=\inf\{x^{*}(a):a\in A\}$ .

Lemma 3.26 ([30]).

Let $(Y,\|\cdot\|)$ be a Banach space and $C$ be a nonempty bounded subset of $Y\times{\mathbb{R}}$ , endowed with the norm $\|(y,r)\|_{1}:=\|y\|+|r|$ . If for every $x^{*}\in Y^{*}$ , $\max\{(x^{*},-1)(y,s):(y,s)\in C\}$ exists then $C$ is relatively weakly compact.

Proof.

Let $\pi:Y\times{\mathbb{R}}\to Y$ be defined by $\pi(y,r):=y$ , $A:=\pi(C)$ and $f:Y\to{\mathbb{R}}\cup\{\infty\}$ be defined by,

[TABLE]

Then $f$ is a proper function on $Y$ and $x^{*}-f$ attains it maximum for every $x^{*}\in Y^{*}$ . To see this, consider the following. Let $x^{*}\in Y^{*}$ , then

[TABLE]

Therefore, by Theorem 3.24, for each $a\in{\mathbb{R}}$ , $S(a):=\{(y,s)\in Y\times{\mathbb{R}}:f(y)\leq s\leq a\}$ is relatively weakly compact. Since $C$ is bounded there exists an $a\in{\mathbb{R}}$ such that $C\subseteq S(a)$ . $\Box$

Theorem 3.27 ([30]).

Let $(X,\|\cdot\|)$ be a Banach space and let $A$ and $B$ be bounded, closed and convex sets with $\mbox{dist}(A,B)>0$ . If every $x^{*}\in X^{*}$ with $\sup(x^{*},B)<\inf(x^{*},A)$ attains its infimum on $A$ and its supremum on $B$ , then both $A$ and $B$ are weakly compact.

Proof.

To show that both $A$ and $B$ are weakly compact it is sufficient (and necessary) to show that $B-A$ is weakly compact. This will be our approach. From the hypotheses it follows that if $C:=\overline{B-A}$ , then $C$ is a bounded nonempty closed and convex subset of $X$ with $0\not\in C$ . Furthermore, it follows that each $x^{*}\in X^{*}$ with $\sup(x^{*},C)<0$ attains it supremum on $C$ . Choose $y^{*}\in X^{*}$ such that $\sup(y^{*},C)<0$ . Note that such a functional exists by the Hahn-Banach theorem. Let $Y:=\mbox{ker}(y^{*})$ and choose $x_{0}\in C$ . Define $S:Y\times{\mathbb{R}}\to X$ by, $S(y,r):=y+rx_{0}$ and let us consider $Y\times{\mathbb{R}}$ endowed with the norm $\|(y,r)\|_{1}:=\|y\|+|r|$ . Then $S$ is an isomorphism and there exists an $0<\varepsilon$ such that $S^{-1}(C)\subseteq\{(y,r)\in Y\times{\mathbb{R}}:\varepsilon\leq r\}$ . Moreover, each $(x^{*},r)\in(Y\times{\mathbb{R}})^{*}$ with $\sup((x^{*},r),S^{-1}(C))<0$ attains its supremum over $S^{-1}(C)$ . Let $\pi:Y\times{\mathbb{R}}\to Y$ be defined by $\pi(y,r):=y$ , $A:=\pi(S^{-1}(C))$ and $f:Y\to{\mathbb{R}}\cup\{\infty\}$ be defined by,

[TABLE]

Next, we define $T:Y\times({\mathbb{R}}\setminus\{0\})\to Y\times({\mathbb{R}}\setminus\{0\})$ by $T(y,s):=s^{-1}(y,-1)$ . Then $T$ is a bijection. In fact, $T$ is a homeomorphism when $Y\times({\mathbb{R}}\setminus\{0\})$ is considered with the relative weak topology. Let $f^{*}:Y^{*}\to{\mathbb{R}}$ be defined by,

[TABLE]

It is routine to check that $f^{*}$ is real-valued and convex on $Y^{*}$ . To show that $C$ is weakly compact it is sufficient to show that $T(S^{-1}(C))$ is a relatively weakly compact subset of $Y\times{\mathbb{R}}$ . To achieve this we appeal to Lemma 3.26. First note that $T(S^{-1}(C))$ is a nonempty bounded subset of $Y\times{\mathbb{R}}$ . Then consider any $x^{*}\in Y^{*}$ . We consider two cases.

Case (I) Suppose that for every $0<\lambda$ , $f^{*}(\lambda x^{*})\leq-\lambda$ . Then $x^{*}(y)-\lambda^{-1}f(y)\leq-1$ for all $y\in Y$ and all $0<\lambda$ . In particular, $-\lambda^{-1}f(0)\leq-1$ for all $0<\lambda$ , i.e., $\lambda\leq f(0)$ for all $0<\lambda$ . On the other hand, $S(0,1)=x_{0}\in C$ , i.e., $(0,1)\in S^{-1}(C)$ and so $f(0)\leq 1$ . Thus, Case (I) does not occur.

Case(II) Suppose that for some $0<\lambda$ , $-\lambda<f^{*}(\lambda x^{*})$ . Then, since the mapping, $\lambda^{\prime}\mapsto f^{*}(\lambda^{\prime}x^{*})$ , is real-valued and convex, it is continuous. Furthermore, it follows from the intermediate value theorem applied to the function $g:[0,\lambda]\to{\mathbb{R}}$ , defined by,

[TABLE]

that there exists a $0<\mu<\lambda$ such that $g(\mu)=0$ , i.e., $f^{*}(\mu x^{*})=-\mu$ , since

[TABLE]

Thus, $\mu(x^{*},-1)=(\mu x^{*},f^{*}(\mu x^{*}))$ and so $f^{*}(\mu x^{*})=\sup((\mu x^{*},-1),S^{-1}(C))=-\mu<0$ .

Choose $(z,s)\in S^{-1}(C)$ such that $(\mu x^{*},-1)(z,s)=\sup((\mu x^{*},-1),S^{-1}(C))=f^{*}(\mu x^{*})$ . Note that $z\in A$ and $s=f(z)$ . We claim that $(x^{*},-1)$ attains its maximum value over $T(S^{-1}(C))$ at $T(z,f(z))=f(z)^{-1}(z,-1)$ . Now,

[TABLE]

On the other hand, if $(y,s)\in S^{-1}(C)$ then

[TABLE]

since $f(y)\leq s$ . This completes the proof. $\Box$

Remark 3.28.

It might be interesting to note the following: If $(X,\|\cdot\|)$ is a Banach space, $A$ and $B$ are nonempty bounded, closed and convex sets such that every $x^{*}\in X^{*}$ with $\inf(x^{*},A)<\sup(x^{*},B)$ attains its infimum on $A$ and its supremum on $B$ , then both $A$ and $B$ are weakly compact. To see this, note that $C:=\mbox{co}[\{0\}\cup\overline{B-A}]$ is a closed and bounded convex subset of $X$ with the property that every continuous linear function attains it supremum over $C$ .

A special case of the previous theorem was given in [4].

Example 3.29.

Let $(X,\|\cdot\|)$ be a non-trivial normed linear space. Then there exists an equivalent norm $|\!|\!|\cdot|\!|\!|$ on $X$ and a nonempty open subset $U$ of $X^{*}$ such that every member of $U$ attains its norm on $(X,|\!|\!|\cdot|\!|\!|)$ .

Proof.

Choose $x_{0}\in X$ with $\|x_{0}\|=2$ . Then, by the Hahn-Banach theorem, there exists a continuous linear functional $x^{*}\in S_{X^{*}}$ such that $x^{*}(x_{0})=2$ . Let $U:=\{y^{*}\in X^{*}:\|y^{*}-x^{*}\|<1/3\}$ and let $B:=\mathrm{co}(B_{X}\cup\{x_{0},-x_{0}\})$ . Then $B$ is convex, bounded, symmetric and $0\in\mathrm{int}(B)$ . Therefore, $B$ is the closed unit ball of some equivalent norm $|\!|\!|\cdot|\!|\!|$ on $X$ . Furthermore, every member of $U$ attains its maximum value over $B$ at $x_{0}$ . Indeed, if $y^{*}\in U$ then

[TABLE]

On the other hand, for any $x\in B_{X}$ ,

[TABLE]

and $y^{*}(-x_{0})=-y^{*}(x_{0})=-2<y^{*}(x_{0})$ . Therefore, $y^{*}$ attains its maximum value over $B$ at $x_{0}$ . $\Box$

Together, Example 3.29 and Theorem 3.23 give rise to the following conjecture.

Conjecture 3.30.

Let $(X,\|\cdot\|)$ be a Banach space. If there exists a weak open subset $U$ of $X^{*}$ such that $\varnothing\not=S_{X^{*}}\cap U$ and every member of $S_{X^{*}}\cap U$ attains its norm on $X$ , then $X$ is reflexive.

A special case of this conjecture was proven in [21]. For some further results in this direction see [6].

4 Convex analysis and minimal uscos

In this section we prove a generalisation of James’ weak compactness theorem. Unfortunately, to achieve this generalisation we will need to take an excursion into convex analysis and set-valued analysis. Hopefully, some of the results along the way are of some interest in their own right.

4.1 Convex functions and monotone operators

We shall need the following very important fact regarding the continuity of convex functions.

Proposition 4.1 ([39, Proposition 1.6]).

Let $U$ be a nonempty open convex subset of a Banach space $(X,\|\cdot\|)$ and let $\varphi:U\rightarrow\mathbb{R}$ be a convex function. If $\varphi$ is locally bounded above on $U$ , that is, for every $x_{0}\in U$ there exists an $M>0$ and a $\delta>0$ such that $B(x_{0},\delta)\subseteq U$ and $\varphi(x)\leq M$ for all $x\in B(x_{0},\delta)$ , then it is locally Lipschitz on $U$ ; that is, for every $x_{0}\in U$ , there exists an $L>0$ and $\delta>0$ such that $B(x_{0},\delta)\subseteq U$ and

[TABLE]

for all $x,y\in B(x_{0},\delta)$ .

Proof.

Let $x_{0}\in U$ . Choose $M^{*}>0$ and $\delta>0$ such that $B(x_{0},2\delta)\subseteq U$ and $\varphi(x)\leq M^{*}$ for all $x\in B(x_{0},2\delta)$ . Then for all $x\in B(x_{0},\delta)$ we have that $2x_{0}-x=x_{0}-(x-x_{0})\in B(x_{0},\delta)$ and $x_{0}=(1/2)(2x_{0}-x)+(1/2)x$ . Hence,

[TABLE]

so $-\varphi(x)\leq M^{*}+2|\varphi(x_{0})|$ ; that is, $|\varphi(x)|\leq(M^{*}+2|\varphi(x_{0})|)=:M^{\prime}$ for all $x\in B(x_{0},\delta)$ . So $|\varphi|$ is bounded by $M^{\prime}$ on $B(x_{0},\delta)$ . Let $\delta^{\prime}:=\delta/2$ . If $x$ and $y$ are distinct points in $B(x_{0},\delta^{\prime})$ , let $\alpha:=\|x-y\|$ and let $z:=y+(\delta/\alpha)(y-x)$ . Note that $z\in B(x_{0},2\delta^{\prime})$ . Since $y=[\alpha/(\alpha+\delta^{\prime})]z+[\delta^{\prime}/(\alpha+\delta^{\prime})]x$ is a convex combination (lying in $B(x_{0},2\delta^{\prime})$ ), we have that $\varphi(y)\leq[\alpha/(\alpha+\delta^{\prime})]\varphi(z)+[\delta^{\prime}/(\alpha+\delta^{\prime})]\varphi(x)$ and so

[TABLE]

Interchanging $x$ and $y$ gives the desired result, with $M:=2M^{\prime}/\delta^{\prime}$ . $\Box$

Suppose that $f:C\to{\mathbb{R}}$ is a convex function defined on a nonempty convex subset of a normed linear space $(X,\|\cdot\|)$ and $x\in C$ . Then we define the subdifferential $\partial f(x)$ by,

[TABLE]

We can also define the subdifferential in terms of the right-hand derivative of $f$ . Suppose that $f:U\to{\mathbb{R}}$ is a convex function defined on a nonempty open convex subset $U$ of a normed linear space $(X,\|\cdot\|)$ . Let $x_{0}\in U$ and let $v\in X$ . Then the right-hand directional derivative of $f$ , at the point $x_{0}\in U$ , in the direction $v$ , is defined to be

[TABLE]

Now there is a subtlety that we have overlooked. Namely, how do we know if the limit exists? Well, if we revisit Lemma 3.1, then we can see why. So suppose $f$ , $x_{0}$ and $v\not=0$ are as in the definition of $f_{+}^{\prime}(x_{0};v)$ and suppose that $0<\beta$ and $0<\beta^{\prime}$ Then,

[TABLE]

Therefore, $t\mapsto\frac{f(x_{0}+tv)-f(x_{0})}{t}$ is an increasing function over $(0,\delta)$ for some $\delta>0$ small enough so that $x_{0}+tv\in U$ whenever $0<t<\delta$ . Since one can also use Lemma 3.1 to show that

[TABLE]

we see that the limit in the definition of the right-hand directional derivative always exists.

We can now give the basic properties of the subdifferential mapping $x\mapsto\partial\varphi(x)$ .

Lemma 4.2 ([39, Proposition 1.11]).

Let $U$ be a nonempty open and convex subset of a normed linear space $(X,\|\cdot\|)$ and let $\varphi:U\rightarrow\mathbb{R}$ be a continuous convex function. If $x_{0}\in U$ then $\partial\varphi(x_{0})\not=\varnothing$ .

Proof.

Let $x_{0}\in U$ and define $p:X\to{\mathbb{R}}$ by, $p(x):=f_{+}^{\prime}(x_{0};x)$ for all $x\in X$ . Note that $p$ is well-defined. Let $0<\mu<\infty$ and let $x\in X$ then

[TABLE]

So $p$ is positively homogeneous on $X$ . Next, choose $\delta>0$ such that $B[x_{0},\delta]\subseteq U$ . We claim that $p$ is convex on $B[0,\delta]$ . Fix $n\in{\mathbb{N}}$ and define $p_{n}:B[0,\delta]\to{\mathbb{R}}$ by,

[TABLE]

Since, $x\mapsto x_{0}+(1/n)x$ , is an affine map, $x\mapsto\varphi(x_{0}+(1/n)x)$ , is convex, and so $p_{n}$ is also convex. Now, $p(x)=\lim_{n\to\infty}p_{n}(x)$ for each $x\in B[0,\delta]$ . Therefore, $p|_{B[0,\delta]}$ is convex, as the pointwise limit of convex functions is again convex. Since $p$ is also positively homogeneous on $X$ it is an easy exercise to show that $p$ is sublinear on $X$ .

Let $y_{0}$ be any element of $S_{X}$ and define $f:\mathrm{span}\{y_{0}\}\to{\mathbb{R}}$ by, $f(\lambda y_{0}):=\lambda p(y_{0})$ for all $\lambda\in{\mathbb{R}}$ . Then $f(\lambda y_{0})=\lambda p(y_{0})=p(\lambda y_{0})\leq p(\lambda y_{0})$ for all $0<\lambda<\infty$ . Now, fix $0<\lambda<\infty$ , then

[TABLE]

Therefore, $(-\lambda)p(y_{0})=-p(\lambda y_{0})\leq p((-\lambda)y_{0})$ . Thus,

[TABLE]

Hence, $f(\lambda y_{0})\leq p(\lambda y_{0})$ for all $\lambda\in{\mathbb{R}}$ . Thus, by the Hahn-Banach Theorem (Theorem 2.11) there exists a linear functional $F:X\to{\mathbb{R}}$ such that $F(x)\leq p(x)$ for all $x\in X$ . Note also, that by Proposition 4.1 and the definition of $p$ , there exists an $L>0$ such that $F(x)\leq p(x)\leq L\|x\|$ for all $x\in X$ . Thus, $F\in X^{*}$ . We claim that $F\in\partial\varphi(x_{0})$ . To see this, let $x\in U$ then

[TABLE]

This completes the proof. $\Box$

Proposition 4.3 ([39, Proposition 1.11]).

Let $U$ be a nonempty open and convex subset of a normed linear space $(X,\|\cdot\|)$ and let $\varphi:U\rightarrow\mathbb{R}$ be a continuous convex function. If $x_{0}\in U$ , then $\partial\varphi(x_{0})$ is a weak∗-compact convex subset of $X^{*}$ . Moreover, the map $x\mapsto\partial\varphi(x)$ is locally bounded at $x_{0}$ . That is, there exists an $L>0$ and a $\delta>0$ such that $B(x_{0},\delta)\subseteq U$ and $\|x^{*}\|\leq M$ whenever $x\in B(x_{0},\delta)$ and $x^{*}\in\partial\varphi(x)$ .

Proof.

For each $x\in U$ , let $F_{x}:=\{x^{*}\in X^{*}:x^{*}(x-x_{0})\leq\varphi(x)-\varphi(x_{0})\}=(\widehat{x-x_{0}})^{-1}(-\infty,\varphi(x)-\varphi(x_{0})]$ . Thus, each set $F_{x}$ is weak∗ closed and convex. Now, $\partial\varphi(x_{0})=\bigcap_{x\in U}F_{x}$ . Therefore, $\partial\varphi(x_{0})$ is weak∗ closed and convex. Let us now show that, $x\mapsto\partial\varphi(x)$ , is locally bounded at $x_{0}$ (Note: this will then automatically show that $\partial\varphi(x)$ is weak∗ compact, by Theorem 2.23). By Proposition 4.1, there exists a $L>0$ and a $\delta>0$ such that $B(x_{0},\delta)\subseteq U$ and $|\varphi(x)-\varphi(y)\|\leq L\|x-y\|$ for all $x,y\in B(x_{0},\delta)$ . We claim that $\|x^{*}\|\leq L$ whenever $x\in B(x_{0},\delta)$ and $x^{*}\in\partial\varphi(x)$ . To this end, let $x\in B(x_{0},\delta)$ and $x^{*}\in\partial\varphi(x)$ . Let $v\in S_{X}$ and choose $0<\mu$ such that $x+\mu v\in B(x_{0},\delta)$ . Then,

[TABLE]

Thus, $\|x^{*}\|\leq L$ . Note: we used here the simple fact that if $x^{*}(v)\leq L$ for all $v\in S_{X}$ then $\|x^{*}\|\leq L$ . $\Box$

One of the most important features of the subdifferential mapping of a convex function is that it belongs to a much studied class of set-valued mappings called “monotone operators”.

Let $T:X\to 2^{X^{*}}$ be a set-valued mapping from $(X,\|\cdot\|)$ be a normed linear space into subsets of its dual $X^{*}$ . $T$ is said to be a monotone operator provided $(x^{*}-y^{*})(x-y)\geq 0$ whenever $x,y\in X$ and $x^{*}\in T(x)$ , $y^{*}\in T(y)$ .

Proposition 4.4 ([39, Example 2.2]).

If $\varphi:U\rightarrow\mathbb{R}$ be a continuous convex function defined on a nonempty open convex subset $U$ of a normed linear space $(X,\|\cdot\|)$ then $T:X\to 2^{X^{*}}$ defined by,

[TABLE]

is a monotone operator on $X$ .

Proof.

Let $x^{*},y^{*}\in X^{*}$ and suppose that $x^{*}\in T(x)$ and $y^{*}\in T(y)$ for some $x,y\in X$ . Then $x,y\in U$ since $T(x)\not=\varnothing$ and $T(y)\not=\varnothing$ . In fact, $T(x)=\partial\varphi(x)$ and $T(y)=\partial\varphi(y)$ . Therefore,

[TABLE]

If we add these two inequalities together we get $(x^{*}-y^{*})(y-x)\leq 0$ and so $(x^{*}-y^{*})(x-y)\geq 0$ . Hence, $T$ is indeed a monotone operator. $\Box$

4.2 Minimal Uscos

In order to prove our final “convex analysts” proof of James’ theorem, we will need to briefly consider some notions from set-valued analysis.

A set-valued mapping $\varphi$ from a topological space $A$ into subsets of a topological space $(X,\tau)$ is $\tau$ -upper semicontinuous at a point $x_{0}\in A$ if for each $\tau$ -open set $W$ in $X$ , containing $\varphi(x_{0})$ , there exists an open neighbourhood $U$ of $x_{0}$ such that $\varphi(U)\subseteq W$ . If $\varphi$ is $\tau$ -upper semicontinuous at each point of $A$ then we say that $\varphi$ is $\tau$ -upper semicontinuous on $A$ . In the case when $\varphi$ also has nonempty compact images then we call $\varphi$ a $\tau$ -usco mapping. Finally, if $(X,\tau)$ is a linear topological space then we call a $\tau$ -usco mapping into convex subsets of $X$ a $\tau$ -cusco mapping.

Our interest in cusco mappings is revealed in the next proposition.

Proposition 4.5 ([39, Proposition 2.5]).

If $\varphi:U\to{\mathbb{R}}$ is a continuous convex function defined on a nonempty open convex subset $U$ of a normed linear space $(X,\|\cdot\|)$ , then the subdifferential mapping, $x\mapsto\partial\varphi(x)$ , is a weak∗-cusco on $U$ .

Proof.

It follows from Lemma 4.2 and Proposition 4.3 that we need only show that, $x\mapsto\partial\varphi(x)$ , is weak∗-upper semicontinuous on $U$ . So suppose, in order to obtain a contradiction, that $\partial\varphi$ is not weak∗ upper semicontinuous at some point $x_{0}\in U$ . Then there exists a weak∗ open subset $W$ of $X^{*}$ , containing $\partial\varphi(x_{0})$ , such that for every $0<\delta$ , $\partial\varphi(B(x_{0},\delta))\not\subseteq W$ . Therefore, in particular, there exist sequences $(x_{n}:n\in{\mathbb{N}})$ in $U$ and $(x^{*}_{n}:n\in{\mathbb{N}})$ in $X^{*}$ such that $\lim_{n\to\infty}x_{n}=x_{0}$ and $x^{*}_{n}\in\partial\varphi(x_{n})\setminus W$ . Furthermore, by Proposition 4.3, we can assume that the sequence $(x^{*}_{n}:n\in{\mathbb{N}})$ is norm bounded in $X^{*}$ . Hence, by the Banach-Alaoglu Theorem (Theorem 2.23), the sequence $(x^{*}_{n}:n\in{\mathbb{N}})$ has a weak∗ cluster-point $x_{\infty}^{*}$ , which must lie in $X^{*}\setminus W$ . We will obtain our desired contradiction by showing that $x_{\infty}^{*}\in\partial\varphi(x_{0})\subseteq W$ . To this end, fix $x\in U$ and $\varepsilon>0$ . Since $\varphi$ is continuous at $x_{0}$ there exists an $N\in{\mathbb{N}}$ such that $|\varphi(x_{n})-\varphi(x_{0})|<\varepsilon$ for all $n>N$ . Let $n>N$ then,

[TABLE]

Therefore, $x_{\infty}^{*}(x-x_{0})=(\widehat{x-x_{0}})(x_{\infty}^{*})\leq[\varphi(x)-\varphi(x_{0})]+\varepsilon$ . Since $\varepsilon>0$ was arbitrary, we have that $x_{\infty}^{*}(x-x_{0})\leq\varphi(x)-\varphi(x_{0})$ . Since $x\in U$ was arbitrary, we have that $x_{\infty}^{*}\in\partial\varphi(x_{0})$ , as desired. $\Box$

Among the class of usco (cusco) mappings, special attention is given to the so-called minimal usco (minimal cusco) mappings.

An usco (cusco) from a topological space $A$ into subsets of a topological space $X$ (linear topological space $X$ ) is said to be a minimal usco (minimal cusco) if its graph does not contain, as a proper subset, the graph of any other usco (cusco) on $A$ .

It is not immediately obvious from this definition that there are any interesting minimal usco mappings at all, apart from single-valued continuous functions (e.g. $f:A\to X$ ), which are trivially minimal uscos once one replaces $f(x)$ with $\{f(x)\}$ - to make them set-valued mappings. So our first task is to show that there are always many minimal uscos.

Proposition 4.6 ([5]).

Suppose that $(X,\tau)$ and $(Y,\tau^{\prime})$ are topological spaces and $\varphi:X\to 2^{Y}$ is an usco on $X$ . If $(Y,\tau^{\prime})$ is Hausdorff then there exists a minimal usco mapping $\Psi:X\to 2^{Y}$ such that $\mathrm{Gr}(\Psi)\subseteq\mathrm{Gr}(\varphi)$ (i.e., every usco contains a minimal usco).

Proof.

Let $\mathcal{U}$ denote the family of all usco mappings defined on $X$ whose graphs are contained in the graph of $\varphi$ . Obviously $\mathcal{U}\neq\varnothing$ as the mapping $\varphi$ is contained in $\mathcal{U}$ . We may now partially order $\mathcal{U}$ as follows. If $\Psi_{1}$ and $\Psi_{2}$ are members of $\mathcal{U}$ , then we write $\Psi_{1}\leq\psi_{2}$ if $\Psi_{1}(x)\subseteq\Psi_{2}(x)$ for each $x\in X$ . Next, we apply Zorn’s lemma to show that $(\mathcal{U},\leq)$ possesses a minimal element. To this end, let $\{\Psi_{\gamma}:\gamma\in\Gamma\}$ be a totally ordered subset of $\mathcal{U}$ and let $\varphi_{M}:X\rightarrow 2^{Y}$ be defined by, $\varphi_{M}(x):=\bigcap\{\Psi_{\gamma}(x):\gamma\in\Gamma\}$ . Since each $\Psi_{\gamma}(x)$ is nonempty and compact, $\varphi_{M}(x)$ too is nonempty and compact. Let $W$ be an open subset of $Y$ and consider $U:=\{x\in X:\varphi_{M}(x)\subseteq W\}$ . We need to show that $U$ is open in $X$ . We may, without loss of generality, assume that $U\neq\varnothing$ and consider $x_{0}\in U$ . By the finite intersection property, there exists some $\gamma_{0}\in\Gamma$ such that $\Psi_{\gamma_{0}}(x_{0})\subseteq W$ . Hence there exists an open neighbourhood $U_{0}$ of $x_{0}$ such that $\Psi_{\gamma_{0}}(U_{0})\subseteq W$ , which means that $\varphi_{M}(U_{0})\subseteq W$ . Therefore $x_{0}\in U_{0}\subseteq U$ and so $U$ is open in $X$ . From this, it follows that $\varphi_{M}\in\mathcal{U}$ and $\varphi_{M}\leq\Psi_{\gamma}$ for each $\gamma\in\Gamma$ . Thus, by Zorn’s lemma, $(\mathcal{U},\leq)$ possesses a minimal element. It is now easy to see that this element is in fact a minimal usco. $\Box$

A similar argument shows that every cusco contains a minimal cusco. However, there is a much more concrete supply of minimal cuscos.

Proposition 4.7.

Let $\varphi:A\to 2^{X^{*}}$ be a weak∗-cusco defined on a nonempty open subset $A$ of a normed linear space $(X,\|\cdot\|)$ . If the mapping $T:X\to 2^{X^{*}}$ defined by, $T(x):=\varphi(x)$ if $x\in A$ and by $T(x):=\varnothing$ if $x\in X\setminus A$ , is a monotone operator, then $\varphi$ is a minimal weak∗-cusco.

Proof.

Suppose, in order to obtain a contradiction, that $\varphi$ is not a minimal weak∗-cusco. Then there exists a weak∗-cusco $\Psi:A\to 2^{X^{*}}$ such that $\Psi(x)\subseteq\varphi(x)$ for all $x\in A$ , but $\Psi(x_{0})\not=\varphi(x_{0})$ for some $x_{0}\in A$ . Choose $x_{0}^{*}\in\varphi(x_{0})\setminus\Psi(x_{0})=T(x_{0})\setminus\Psi(x_{0})$ . By the Separation Theorem (Theorem 2.17), applied in $(X^{*},\mathrm{weak}^{*})$ , there exists a $y\in X$ such that $\sup_{y^{*}\in\Psi(x_{0})}\widehat{y}(y^{*})<\widehat{y}(x_{0}^{*})$ . Let $W:=\{x^{*}\in X^{*}:\widehat{y}(x^{*})<\widehat{y}(x_{0}^{*})\}$ . Then $W$ is a weak∗-open subset of $X^{*}$ , containing $\Psi(x_{0})$ . Therefore, there exists an open neighbourhood $U\subseteq A$ of $x_{0}$ such that $\Psi(U)\subseteq W$ . Choose $0<t<\infty$ such that $x_{0}+ty\in U$ . Let $y^{*}\in\Psi(x_{0}+ty)\subseteq\varphi(x_{0}+ty)=T(x_{0}+ty)$ . Since $T$ is a monotone operator, $x^{*}_{0}\in T(x_{0})$ and $y^{*}\in T(x_{0}+ty)$ , we have that:

[TABLE]

which implies that $y^{*}(y)\geq x_{0}^{*}(y)$ . However, this contradicts the fact that $y^{*}\in W$ , i.e., $\widehat{y}(y^{*})<\widehat{y}(x_{0}^{*})$ . Thus, $\varphi$ must be a minimal weak∗-cusco on $A$ . $\Box$

Corollary 4.8.

If $\varphi:U\to{\mathbb{R}}$ is a continuous convex function defined on a nonempty open convex subset $U$ of a normed linear space $(X,\|\cdot\|)$ , then the subdifferential mapping, $x\mapsto\partial\varphi(x)$ , is a minimal weak∗-cusco on $U$ .

Proof.

By Proposition 4.5 we have that, $x\mapsto\partial\varphi(x)$ , is a weak∗-cusco on $U$ . So the result follows from Proposition 4.4 and Proposition 4.7. $\Box$

We will end our detour into set-valued analysis by giving two more results concerning uscos. The first one shows that minimal usco behave a lot like quasi-continuous mappings, while the last result shows how to convert an usco into a cusco.

Proposition 4.9 ([5]).

Let $\varphi:A\to 2^{X}$ be a minimal $\tau$ -usco acting from a topological space $A$ into nonempty subsets of a topological space $(X,\tau)$ . Then, for every pair of open subsets $U$ of $A$ and $W$ of $X$ such that $\varphi(U)\cap W\not=\varnothing$ , there exists a nonempty open subset $V$ of $U$ such that $\varphi(V)\subseteq W$ .

Proof.

Let $U$ be an open subset of $A$ and let $W$ be an open subset of $X$ such that $\varphi(U)\cap W\not=\varnothing$ . We consider two cases.

Case(I): If there exists a $x\in U$ such that $\varphi(x)\subseteq W$ , then the result follows directly from the $\tau$ -upper semicontinuity of $\varphi$ .

Case(II): Suppose that for each $x\in U$ , $\varphi(x)\not\subseteq W$ . Let $\Psi:A\to 2^{X}$ be defined by, $\Psi(x):=\varphi(x)\cap(X\setminus W)$ if $x\in U$ and by $\Psi(x):=\varphi(x)$ if $x\not\in U$ . Then, by assumption, $\Psi$ has nonempty compact images. In fact, we claim that $\Psi$ is a $\tau$ -usco on $A$ . To show this, we need only show that $\Psi$ is $\tau$ -upper semicontinuous. Let $x_{0}\in A$ and let $W^{\prime}$ be a $\tau$ -open set in $X$ containing $\Psi(x_{0})$ . If $x_{0}\not\in U$ then clearly there exists an open neighbourhood $U$ of $x_{0}$ such that $\Psi(U)\subseteq W^{\prime}$ since, in this case, $\varphi(x_{0})=\Psi(x_{0})\subseteq W^{\prime}$ and $\Psi(x)\subseteq\varphi(x)$ for all $x\in A$ . So we are left to consider the case when $x_{0}\in U$ . Suppose $x_{0}\in U$ . Then $\varphi(x_{0})\subseteq W^{\prime}\cup W$ , since $\varphi(x_{0})\cap(X\setminus W)=\Psi(x_{0})\subseteq W^{\prime}$ . Since $\varphi$ is $\tau$ -upper semicontinuous there exists an open neighbourhood $U$ of $x_{0}$ such that $\varphi(U)\subseteq W^{\prime}\cup W$ . Therefore,

[TABLE]

This shows that $\Psi$ is an $\tau$ -usco. Since, $\varphi$ is a minimal $\tau$ -usco, we must have that $\varphi=\Psi$ , but then $\varphi(U)=\Psi(U)\subseteq(X\setminus W)$ , which contradicts our original assumption that $\varphi(U)\cap W\not=\varnothing$ . Therefore, Case(II) does not occur, and so the result follows from Case(I). $\Box$

Proposition 4.10 ([22, 39]).

Suppose that $\varphi:A\to 2^{X}$ is a $\tau$ -usco acting from a topological space $A$ into nonempty subsets of a locally convex space $(X,+,\cdot,\tau)$ . If for each $t\in A$ , $\overline{\mbox{co}}^{\tau}\varphi(t)$ is a compact subset of $X$ , then the mapping $\Psi:A\to 2^{X}$ defined by, $\Psi(t):=\overline{\mbox{co}}^{\tau}\varphi(t)$ for all $t\in A$ , is a $\tau$ -cusco on $A$ .

Proof.

Clearly, $\Psi$ has nonempty, compact convex images. So it is sufficient to show that $\Psi$ is $\tau$ -upper semicontinuous on $A$ . Let $x_{0}\in A$ and let $W$ be a $\tau$ -open subset of $X$ , containing $\Psi(x_{0})$ . Since vector addition is continuous, for each $x\in\Psi(x_{0})$ there exist $\tau$ -open convex neighbourhoods $U_{x}$ of $x$ and $V_{x}$ of [math] such that $x=x+0\subseteq U_{x}+V_{x}\subseteq W$ . Since linear topological spaces are also regular we can assume, by possibly making $V_{x}$ smaller, that $U_{x}+\overline{V_{x}}^{\tau}\subseteq W$ . Now, $\{U_{x}:x\in\Psi(x_{0})\}$ is an open cover of $\Psi(x_{0})$ . Therefore, there exists a finite subcover $\{U_{x_{k}}:1\leq k\leq n\}$ of $\{U_{x}:x\in\Psi(x_{0})\}$ . Let $V:=\bigcap_{1\leq k\leq n}V_{x_{k}}$ . Then $V$ is a convex open neighbourhood of [math] and futhermore,

[TABLE]

Since $\varphi(x_{0})\subseteq\Psi(x_{0})+V$ , which is $\tau$ -open, there exists an open neighbourhood $U$ of $x_{0}$ such that $\varphi(U)\subseteq\Psi(x_{0})+V$ . Let $x\in U$ . Then

[TABLE]

Here we used the fact that the sum of a closed set with a compact set is closed. $\Box$

4.3 A generalisation of James’ Theorem

By making an obvious modification to Corollary 3.13, we obtain the following lemma.

Lemma 4.11.

Let $\varphi:A\to{\mathbb{R}}$ be a $\tau$ -continuous convex function defined on a nonempty convex subset $A$ of a locally convex space $(X,\tau)$ and let $\tau^{\prime}$ is a Hausdorff locally convex topology on $X$ such that (i) $\tau^{\prime}\subseteq\tau$ and (ii) $\varphi$ is $\tau^{\prime}$ -lower semicontinuous. If $T$ is a nonempty $\tau^{\prime}$ -closed and convex subset of $X$ and $S$ is any $\tau$ -separable subset of $A$ such that $S-T\subseteq A$ then, for every sequence $\widetilde{x}:=(x_{n}:n\in\mathbb{N})$ in $T$ , there exists a subsequence, $\widetilde{x}|_{J}$ , of $\widetilde{x}$ such that $\varphi(y-aK_{\tau^{\prime}}(\widetilde{x}|_{J}))$ is at most a singleton for all $y\in S$ and all $a\in[0,1]$ .

The following lemma shows us that in Theorem 4.13 we get the weak∗ lower semicontinuity of $\varphi$ for free.

Lemma 4.12.

Let $(X,\|\cdot\|)$ be a Banach space and let $A$ be a nonempty, open, convex subset of $X^{*}$ . If $\varphi:A\rightarrow\mathbb{R}$ is a continuous, convex function and $\partial\varphi(x^{*})\cap\widehat{X}\neq\varnothing$ for all $x^{*}\in A$ , then $\varphi$ is weak∗-lower-semicontinuous on $A$ .

Proof.

Let $x^{*}_{0}\in A$ and let $\varepsilon>0$ . Then, there exists an $\widehat{x}\in\partial\varphi(x_{0}^{*})\cap\widehat{X}$ . Define $h:A\rightarrow\mathbb{R}$ to be

[TABLE]

Then observe that, since $\widehat{x}\in\partial\varphi(x_{0}^{*})$ , we have $h(x^{*})\leq\varphi(x^{*})$ for all $x^{*}\in A$ . Now the set

[TABLE]

is a weak∗-open neighbourhood of $x^{*}_{0}$ , and for all $x^{*}\in U$ , we have that

[TABLE]

Therefore, $\varphi$ is weak∗-lower-semicontinuous at $x^{*}_{0}$ . Since $x^{*}_{0}$ was arbitrary, we conclude that $\varphi$ is weak∗-lower-semicontinuous on $A$ . $\Box$

At last, we can present our “convex analysts” version of James’ weak compactness theorem.

Theorem 4.13.

Let $(X,\|\cdot\|)$ be a Banach space and let $A$ be a nonempty, open, convex subset of $X^{*}$ . If $\varphi:A\rightarrow\mathbb{R}$ is a continuous, convex function and $\partial\varphi(x^{*})\cap\widehat{X}\neq\varnothing$ for all $x^{*}\in A$ , then $\partial\varphi(x^{*})\subseteq\widehat{X}$ for all $x^{*}\in A$ .

Proof.

Let $x_{0}^{*}\in A$ . Without loss of generality, we may assume that $x_{0}^{*}=0$ . Indeed, if not, we consider the function $\psi:(A-x_{0}^{*})\rightarrow\mathbb{R}$ given by $\psi(x^{*}):=\varphi(x^{*}+x_{0}^{*})$ . Note that $\psi$ is continuous and convex and that $\partial\varphi(x^{*}+x^{*}_{0})=\partial\psi(x^{*})$ for all $x^{*}\in A$ . In particular, $\partial\psi(x^{*})\cap\widehat{X}\neq\varnothing$ for all $x^{*}\in(A-x^{*}_{0})$ and $\partial\varphi(x^{*}_{0})=\partial\psi(0)$ . So, if $x_{0}^{*}\neq 0$ , we can simply translate $\varphi$ and use the argument at 0.

Since $A$ is open and since $x^{*}\mapsto\partial\varphi(x^{*})$ is locally bounded (Proposition 4.3), there exist $m,L>0$ such that $mB_{X^{*}}\subseteq A$ and $\|x^{**}\|\leq L$ for all $x^{**}\in\partial\varphi(B(0,m))$ . Let $(\beta_{n}:n\in\mathbb{N})$ be a sequence of strictly positive numbers such that $\sum_{n=1}^{\infty}\beta_{n}<m/2$ and $\lim_{n\rightarrow\infty}\frac{1}{\beta_{n}}\sum_{i=n+1}^{\infty}\beta_{i}=0$ .

Since $x^{*}\mapsto\partial\varphi(x^{*})$ is a minimal weak∗-cusco, (see, Corollary 4.8) we know that there exists a minimal weak∗-usco, $M:A\rightarrow 2^{X^{**}}$ , such that $M(x^{*})\subseteq\partial\varphi(x^{*})$ for all $x^{*}\in A$ , by Proposition 4.6. In fact, by Proposition 4.10, we know that $\partial\varphi(x^{*})=\overline{\text{co}}^{w^{*}}[M(x^{*})]$ for all $x^{*}\in A$ .

Therefore, to show that $\partial\varphi(0)\subseteq\widehat{X}$ , it suffices to show that $M(0)\subseteq\widehat{X}$ . This is because if $M(0)\subseteq\widehat{X}$ , then $M(0)$ is weakly compact (see Remark 2.26) and then, by the Krein-Smulian Theorem (Corollary 3.18), $\overline{\text{co}}[M(0)]$ is also weakly compact. Since the weak∗ topology is weaker than the weak topology, $\overline{\text{co}}[M(0)]$ is clearly weak∗-compact and hence weak∗-closed. Therefore,

[TABLE]

So suppose, for a contradiction, that $M(0)\not\subseteq\widehat{X}$ . Then there exists an $F\in M(0)\setminus\widehat{X}$ . Since $\widehat{X}$ is a closed subspace of $X^{**}$ , this means there must exist an $0<\varepsilon<\text{dist}(F,\widehat{X})$ .

Part I: Let $f_{0}:=0$ . We inductively create sequences $(f_{n}:n\in\mathbb{N})$ in $S_{X^{*}}$ , $(v_{n}:n\in\mathbb{N})$ in $B(0,m)$ , and $(\widehat{x}_{n}:n\in\mathbb{N})$ in $\widehat{X}$ , such that the statements

•

$(A_{n}):-$ $\|v_{n}\|<m/n$ and $\widehat{x}_{n}\in\partial\varphi(v_{n})$ .

•

$(B_{n}):-$ $(F-\widehat{x}_{n})(f_{j})|\leq\varepsilon/2$ for all $0\leq j<n$ .

•

$(C_{n}):-$ $F(f_{n})>\varepsilon$ and $\widehat{x}_{j}(f_{n})=0$ for all $1\leq j\leq n$ .

are true for all $n\in\mathbb{N}$ . For the first step, choose any $v_{1}\in B(0,m)\subseteq A$ . Then $\partial\varphi(v_{1})\cap\widehat{X}\neq\varnothing$ and so we may choose $\widehat{x}_{1}\in\partial\varphi(v_{1})\cap\widehat{X}$ which clearly satisfies $|(F-\widehat{x}_{1})(f_{0})|=0\leq\varepsilon/2$ . Now note that

[TABLE]

and so, by Lemma 3.8, there exists $f_{1}\in S_{X}$ such that $F(f_{1})>\varepsilon$ and $\widehat{x}_{1}(f_{1})=0$ . So the statements $(A_{1})$ , $(B_{1})$ and $(C_{1})$ hold.

Now fix $k\in\mathbb{N}$ . Suppose that we have created $\{v_{1},\dots,v_{k}\}$ , $\{\widehat{x}_{1},\dots,\widehat{x}_{k}\}$ and $\{f_{1},\dots,f_{k}\}$ such that the statements $(A_{k})$ , $(B_{k})$ and $(C_{k})$ hold true. Then consider the set

[TABLE]

Note that $F\in M(0)\cap W$ and so $M(B(0,\frac{m}{k+1}))\cap W\neq\varnothing$ . Therefore, by the minimality of $M$ and Proposition 4.9, there exists a nonempty open set $V\subseteq B(0,\frac{m}{n+1})$ such that $M(V)\subseteq W$ .

Choose $v_{k+1}\in V$ . Then $\|v_{k+1}\|<m/(k+1)$ . By hypothesis, since $v_{k+1}\in A$ , we have that $\partial\varphi(v_{k+1})\cap\widehat{X}\neq\varnothing$ , and so we may choose $\widehat{x}_{k+1}\in\widehat{X}$ such that

[TABLE]

Thus the statements $(A_{k+1})$ and $(B_{k+1})$ hold. Finally, observe that

[TABLE]

and so, by Lemma 3.8, there exists $f_{k+1}\in S_{X}$ such that $F(f_{k+1})>\varepsilon$ and $\widehat{x}_{j}(f_{k+1})=0$ for all $1\leq j\leq k+1$ . Therefore the statement $(C_{k+1})$ also holds. This completes the induction.

Part II: Now let $(n_{k}:k\in\mathbb{N})$ be a strictly increasing sequence of natural numbers. Then for all $k\in\mathbb{N}$ , define $v^{\prime}_{k}:=v_{n_{k}}$ and $x^{\prime}_{k}:=x_{n_{k}}$ and $f^{\prime}_{k}:=f_{n_{k}}$ . Also define $f^{\prime}_{0}:=0$ . Then the sequences $(v^{\prime}_{n}:n\in\mathbb{N})$ , $(\widehat{x}^{\prime}_{n}:n\in\mathbb{N})$ and $(f^{\prime}_{n}:n\in\mathbb{N})$ still satisfy $(A_{n})$ , $(B_{n})$ and $(C_{n})$ for all $n\in\mathbb{N}$ . Therefore, the properties $(A_{n})$ , $(B_{n})$ and $(C_{n})$ are stable under passing to subsequences.

Now, since $\partial\varphi(x^{*})\cap\widehat{X}\neq\varnothing$ for all $x^{*}\in A$ , we have that $\varphi$ is weak∗-lower-semicontinuous on $A$ , by Lemma 4.12. Let $S:=\frac{m}{2}B_{X^{*}}\cap\text{span}\{f_{n}:n\in\mathbb{N}\}$ and $T:=\frac{m}{2}B_{X^{*}}$ and note that $S-T\subseteq mB_{X}\subseteq A$ . Then by passing to a subsequence and relabelling if necessary, we may assume that the set $\varphi(f-a[(m/2)K_{w^{*}}(f_{n}:n\in\mathbb{N})])$ is a singleton for all $0\leq a\leq 1$ and all $f\in S$ , by Lemma 4.11. Since $(f_{n}:n\in\mathbb{N})$ is a sequence in $B_{X^{*}}$ (which is weak∗-compact), it must a have a weak∗-cluster point, call it $f_{\infty}$ .

Part III: As in Part III of the proof of Theorem 3.9, we can derive that $\widehat{x}_{n}(f_{k}-f_{\infty})>\varepsilon/2$ for all $n>k$ from the statements $(B_{n})$ and $(C_{n})$ . We also note that, from the statement $(A_{n})$ , we have $v_{n}\rightarrow 0$ in norm. Therefore, since $\varphi$ is norm-continuous, there exists $N_{0}\in\mathbb{N}$ such that

[TABLE]

Lastly observe that for all $n\in\mathbb{N}$ , $v_{n}\in B(0,m)$ and $\widehat{x}_{n}\in\partial\varphi(v_{n})$ and thus $\|\widehat{x}_{n}\|\leq L$ by the local-boundedness of $\partial\varphi$ . Therefore, if $n>\frac{8Lm}{\beta_{1}\varepsilon}$ , we have that

[TABLE]

Part IV: For each $n\in\mathbb{N}$ , let $K_{n}:=\text{co}\{f_{k}:k\geq n\}-f_{\infty}$ and note that $(K_{n}:n\in\mathbb{N})$ is a decreasing sequence of nonempty, convex subsets of $X^{*}$ . Set $r:=\varepsilon/8$ . Then we have that

[TABLE]

To see this, let $f\in K_{1}$ . Then $f=\sum_{i=1}^{k}\lambda_{i}f_{n_{i}}-f_{\infty}$ where $\lambda_{i}\geq 0$ for all $1\leq i\leq k$ and $\sum_{i=1}^{k}\lambda_{i}=1$ . Set $N>\max\{n_{1},\dots,n_{k},N_{0},\frac{8Lm}{\beta_{1}\varepsilon}\}$ . Then we have,

[TABLE]

Therefore, since $f\in K_{1}$ was arbitrary, we have that $\beta_{1}r+\varphi(0)<\inf_{f\in K_{1}}\varphi(\beta_{1}f)$ as claimed. So, by Lemma 3.2, there exists a sequence $(g_{n}:n\in\mathbb{N})$ such that for all $n\in\mathbb{N}$ :

(i)

$g_{n}\in\text{co}\{f_{k}:k\geq n\}$ and 2. (ii)

$\displaystyle\varphi($$\sum_{i=1}^{n}\beta_{i}(g_{i}-f_{\infty})$$)+\beta_{n+1}r<\varphi($$\sum_{i=1}^{n+1}\beta_{i}(g_{i}-f_{\infty})).$ $(*)$

Part V: Since $(g_{n}:n\in\mathbb{N})$ is a sequence in $B_{X^{*}}$ , it must have a weak∗-cluster point. So, let $g_{\infty}$ be a weak∗-cluster point of $(g_{n}:n\in\mathbb{N})$ . Then, by Proposition 3.14, we have that $g_{\infty}\in K_{w^{*}}(f_{n}:n\in\mathbb{N})$ . Since for all $n\in\mathbb{N}$ , we have that $\sum_{i=1}^{n}\beta_{i}g_{i}\in S=\frac{m}{2}B_{X^{*}}\cap\text{span}\{f_{n}:n\in\mathbb{N}\}$ , and $0\leq\sum_{i=1}^{n}\beta_{i}\leq m/2$ , then

[TABLE]

by the observation made in Part II. As in Part V of the proof of Theorem 3.9, we set $g:=\sum_{i=1}^{\infty}\beta_{i}(g_{i}-g_{\infty})$ and deduce that $g\in X^{*}.$

Part VI: Lastly note that $\|g\|\leq 2\sum_{i=1}^{\infty}\beta_{i}\leq m$ and so $g\in mB_{X^{*}}\subseteq A$ . Therefore, in order to contradict our original assumption, and thus complete the proof, it suffices to show that $\partial\varphi(g)\cap\widehat{X}=\varnothing$ . So, suppose that there exists $\widehat{x}\in\widehat{X}$ such that $\widehat{x}\in\partial\varphi(g)$ . Then, if $n>1$ ,

[TABLE]

Rearranging gives us that

[TABLE]

Taking the limit as $n\rightarrow\infty$ we get that

[TABLE]

which contradicts the inequality $\displaystyle\liminf_{n\rightarrow\infty}g_{n}(x)\leq g_{\infty}(x)$ . Thus, $\partial\varphi(g)\cap\widehat{X}=\varnothing$ , which contradicts our original assumption concerning the function $\varphi$ . This completes the proof. $\Box$

Remark 4.14.

To see that Theorem 4.13 is indeed a generalisation of Theorem 3.15 consider the following. Suppose that $C$ is a nonempty closed and bounded convex subset of a Banach space $(X,\|\cdot\|)$ with $0\in C$ . Define $p:X^{*}\to{\mathbb{R}}$ by, $p(x^{*}):=\sup_{c\in C}x^{*}(c)$ for all $x^{*}\in X^{*}$ . Then $\widehat{C}\subseteq\partial p(0)$ . If every $x^{*}\in X^{*}$ attains its supremum over $C$ then $\partial p(x^{*})\cap\widehat{X}\not=\varnothing$ for every $x^{*}\in X^{*}$ . This last fact follows because, if $x^{*}\in X^{*}\setminus\{0\}$ , $c\in C$ and $p(x^{*})=x^{*}(c)$ then $\widehat{c}\in\partial p(x^{*})$ . Thus, by Theorem 4.13,

[TABLE]

Hence, $C$ is weakly compact by Remark 2.25. Let us also note that an earlier version of Theorem 4.13 appeared in [32, Theorem 2.2].

5 Variational Principle

The corner stone of this section is the Brøndsted-Rockafellar Theorem which gives the existence of subgradients for lower semicontinuous convex functions defined on Banach spaces. The key notion behind this theorem is the notion of an “ $\varepsilon$ -subgradient”. Suppose that $f:X\to{\mathbb{R}}\cup\{\infty\}$ is a convex proper lower semicontinuous function on a normed linear space $(X,\|\cdot\|)$ and $x\in\mbox{Dom}(f)$ . Then, for any $\varepsilon>0$ , we define the $\varepsilon$ -subdifferential $\partial_{\varepsilon}f(x)$ by,

[TABLE]

Theorem 5.1 ([2]).

Suppose that $f:X\to{\mathbb{R}}\cup\{\infty\}$ is a convex proper lower semicontinuous function on a Banach space $(X,\|\cdot\|)$ . Then, given any point $x_{0}\in\mbox{Dom}(f)$ , $\varepsilon>0$ and any $x_{0}^{*}\in\partial_{\varepsilon}f(x_{0})$ , there exists $x\in\mbox{Dom}(f)$ and $x^{*}\in X^{*}$ such that $x^{*}\in\partial f(x)$ , $\|x-x_{0}\|\leq\sqrt{\varepsilon}$ and $\|x^{*}-x_{0}^{*}\|\leq\sqrt{\varepsilon}$ .

We will use the Brøndsted-Rockafellar Theorem (Theorem 5.1) to show that certain functions attain their maximum value in a rather strong way, that we now make precise. We shall say that a function $f:X\to[-\infty,\infty)$ defined on a normed linear space $(X,\|\cdot\|)$ attains a (or has a) strong maximum at $x_{0}\in X$ if, $f(x_{0})=\sup_{x\in X}f(x)$ and $\lim_{n\to\infty}x_{n}=x_{0}$ whenever $(x_{n}:n\in{\mathbb{N}})$ is a sequence in $X$ such that $\lim_{n\to\infty}f(x_{n})=\sup_{x\in X}f(x)=f(x_{0})$ .

In addition to the Brøndsted-Rockafellar Theorem and the definition of a strong maximum, we shall require one more definition. Let $\varphi:X\to 2^{Y}$ be a set-valued mapping acting between a topological space $(X,\tau)$ and a normed linear space $(Y,\|\cdot\|)$ . Then we say that $\varphi$ is single-valued and norm upper semicontinuous at $x_{0}\in X$ if, $\varphi(x_{0})=:\{y_{0}\}$ is a singleton subset of $Y$ and for each $\varepsilon>0$ there exists an open neighbourhood $U$ of $x_{0}$ such that $\varphi(U)\subseteq B[y_{0},\varepsilon]$ .

We shall now combine the Brøndsted-Rockafellar Theorem with these definitions in order to obtain the following preliminary result.

Proposition 5.2.

Suppose that $f:X\to{\mathbb{R}}\cup\{\infty\}$ is a proper function on a Banach space $(X,\|\cdot\|)$ and suppose that $f^{*}:X^{*}\to{\mathbb{R}}\cup\{\infty\}$ (the Fenchel conjugate of $f$ ) is defined by,

[TABLE]

Then,

(i)

$f^{*}$ * is a convex and weak*∗* lower semicontinuous function on $\mathrm{Dom}(f^{*})$ ;* 2. (ii)

$f^{*}$ * is continuous on $\mathrm{int}(\mathrm{Dom}(f^{*}))$ ;* 3. (iii)

if $x^{*}\in\mathrm{Dom}(f^{*})$ and $x\in\mathrm{argmax}(x^{*}-f)$ then $\widehat{x}\in\partial f^{*}(x^{*})$ ; 4. (iv)

if $\varepsilon>0$ , $x^{*}\in\mathrm{Dom}(f^{*})$ , $x\in X$ and $f^{*}(x^{*})-\varepsilon<x^{*}(x)-f(x)$ then $\widehat{x}\in\partial_{\varepsilon}f^{*}(x^{*})$ ; 5. (v)

if $x_{0}^{*}\in\mathrm{int}(\mathrm{Dom}(f^{*}))$ , $x\in\mathrm{argmax}(x_{0}^{*}-f)$ and $x^{*}\mapsto\partial f^{*}(x^{*})$ is single-valued and norm upper semicontinuous at $x_{0}^{*}$ then $x_{0}^{*}-f$ has a strong maximum at $x$ .

Proof.

For those people familiar with the Fenchel conjugate, they may want to skip the proofs of (i)-(iv).

(i)

For each $x\in\mathrm{Dom}(f)$ define $g_{x}:X^{*}\to{\mathbb{R}}$ by, $g_{x}(x^{*}):=\widehat{x}(x^{*})-f(x)$ . Then each function $g_{x}$ is weak∗ continuous and affine. Now for each $x^{*}\in X^{*}$ , $f^{*}(x^{*})=\sup_{x\in\mathrm{Dom}(f)}g_{x}(x^{*})$ . Thus, as the pointwise supremum of a family of weak∗ continuous affine mappings, the Fenchel conjugate of $f$ , is itself convex and weak∗ lower semicontinuous. [Recall the general fact that the pointwise supremum of a family of convex functions is convex and the pointwise supremum of a family of lower semicontinuous mappings is again lower semicontinuous]. 2. (ii)

Since this statement is vacuously true when $\mathrm{int}(\mathrm{Dom}(f^{*}))=\varnothing$ , we will assume that $\mathrm{int}(\mathrm{Dom}(f^{*}))$ is nonempty. Let us first recall that by Proposition 4.1, and by the fact that $f^{*}$ is convex and the fact that $\mathrm{int}(\mathrm{Dom}(f^{*}))$ is also convex, it is sufficient to show that $f^{*}$ is locally bounded above on $\mathrm{int}(\mathrm{Dom}(f^{*}))$ . In fact, as we shall now show, it is sufficient to show that $f^{*}$ is locally bounded above at a single point $x_{0}^{*}\in\mathrm{int}(\mathrm{Dom}(f^{*}))$ . To this end, suppose that $f^{*}$ is locally bounded above at $x_{0}^{*}\in\mathrm{int}(\mathrm{Dom}(f^{*}))$ . Then there exist an $0<M$ and a $0<\delta$ such that $f^{*}(y^{*})\leq M$ for all $y^{*}\in B[x_{0}^{*},\delta]$ . Let $x^{*}$ be any point in $\mathrm{int}(\mathrm{Dom}(f^{*}))$ . Since $\mathrm{int}(\mathrm{Dom}(f^{*}))$ is an open convex set, there exists a point $y^{*}\in\mathrm{int}(\mathrm{Dom}(f^{*}))$ and a $0<\lambda<1$ such that $x^{*}=\lambda y^{*}+(1-\lambda)x_{0}^{*}$ . Let $M^{*}:=\max\{M,f^{*}(y^{*})\}$ and note that

[TABLE]

We claim that $f^{*}$ is bounded above by $M^{*}$ on $B[x^{*},(1-\lambda)\delta]$ . To see this, let $z^{*}$ be any element of $B[x^{*},(1-\lambda)\delta]$ . Then $z^{*}=\lambda y^{*}+(1-\lambda)w^{*}$ for some $w^{*}\in B[x_{0}^{*},\delta]$ since,

[TABLE]

Therefore,

[TABLE]

Next, we will use that fact that since $\mathrm{int}(\mathrm{Dom}(f^{*}))$ is a nonempty open subset of a complete metric space, it is itself a Baire space with the relative topology. Now, for each $n\in{\mathbb{N}}$ , let

[TABLE]

Since $f^{*}$ is weak∗ lower semicontinuous, it is lower semicontinuous with respect to the norm topology too. Therefore, each set $F_{n}$ is closed with respect to the relative norm topology on $\mathrm{int}(\mathrm{Dom}(f^{*}))$ . Since $\mathrm{int}(\mathrm{Dom}(f^{*}))=\bigcup_{n\in{\mathbb{N}}}F_{n}$ , there exists an $n_{0}\in{\mathbb{N}}$ such that $\mathrm{int}(F_{n_{0}})\not=\varnothing$ . Hence, $f^{*}$ is locally bounded above at each point of $\mathrm{int}(F_{n_{0}})$ . This completes the proof of part (ii). 3. (iii)

Let $y^{*}$ be any element of $\mathrm{Dom}(f^{*})$ . Then,

[TABLE]

Therefore, $\widehat{x}\in\partial f^{*}(x^{*})$ . 4. (iv)

Let $y^{*}$ be any element of $\mathrm{Dom}(f^{*})$ . Then,

[TABLE]

Therefore, $\widehat{x}\in\partial_{\varepsilon}f^{*}(x^{*})$ . 5. (v)

Let $(x_{n}:n\in{\mathbb{N}})$ be a sequence in $X$ such that

[TABLE]

We will show that $(x_{n}:n\in{\mathbb{N}})$ converges to $x$ . Let $\varepsilon>0$ . By (iii) and the assumption that $\partial f^{*}(x_{0}^{*})$ is a singleton we have that $\partial f^{*}(x_{0}^{*})=\{\widehat{x}\}$ . Since, $x^{*}\mapsto\partial f^{*}(x^{*})$ , is norm upper semicontinuous at $x_{0}^{*}$ there exists a $0<\delta<\varepsilon$ such that if $\|x^{*}-x_{0}^{*}\|\leq\delta$ then $\|F-\widehat{x}\|<\varepsilon$ for all $F\in\partial f^{*}(x^{*})$ . Choose $N\in{\mathbb{N}}$ such that $(x_{0}^{*}-f)(x_{n})>f^{*}(x_{0})-\delta^{2}$ for all $n>N$ . Then, by (iv), $\widehat{x_{n}}\in\partial_{\delta^{2}}f^{*}(x_{0}^{*})$ for all $n>N$ . Let $n>N$ . Then, by the Brøndsted-Rockafellar Theorem, there exist $x_{n}^{*}\in\mathrm{Dom}(f^{*})$ and $F_{n}\in X^{**}$ such that $F_{n}\in\partial f^{*}(x_{n}^{*})$ , $\|x_{n}^{*}-x_{0}^{*}\|\leq\delta$ and $\|F_{n}-\widehat{x_{n}}\|\leq\delta<\varepsilon$ . Therefore,

[TABLE]

This completes the proof. $\Box$

Our first variational principle applies to dual differentiation spaces, [13]. Recall that a Banach space $(X,\|\cdot\|)$ is called a dual differentiability space (or DD-space for short) if every continuous convex function $\varphi:A\to{\mathbb{R}}$ defined on a nonempty open convex subset $A$ of $X^{*}$ such that $\{x^{*}\in A:\partial\varphi(x^{*})\cap\widehat{X}\not=\varnothing\}$ contains a dense and $G_{\delta}$ subset of $A$ , has the property that its subdifferential mapping $\partial\varphi:A\to 2^{X^{**}}$ is single-valued and norm upper semicontinuous at each point of a dense and $G_{\delta}$ subset of $A$ (or equivalently, $\varphi$ is Fréchet differentiable at the points of a dense and $G_{\delta}$ subset of $A$ , [39, Proposition 2.8]).

Theorem 5.3.

Let $f:X\to{\mathbb{R}}\cup\{\infty\}$ be a proper function on a dual differentiation space $(X,\|\cdot\|)$ . If there exists a nonempty open subset $A$ of $\mathrm{Dom}(f^{*})$ and a dense and $G_{\delta}$ subset $R$ of $A$ such that $\mathrm{argmax}(x^{*}-f)\not=\varnothing$ for each $x^{*}\in R$ , then there exists a dense and $G_{\delta}$ subset $R^{\prime}$ of $A$ such that $(x^{*}-f):X\to{\mathbb{R}}\cup\{-\infty\}$ has a strong maximum for each $x^{*}\in R^{\prime}$ . In addition, if $0\in A$ and $\varepsilon>0$ then there exists an $x_{0}^{*}\in X^{*}$ with $\|x^{*}_{0}\|<\varepsilon$ such that $(x_{0}^{*}-f):X\to{\mathbb{R}}\cup\{-\infty\}$ has a strong maximum.

Proof.

Consider $\partial f^{*}:A\to 2^{X^{**}}$ . Then by Proposition 5.2 part (iii)

[TABLE]

contains a dense and $G_{\delta}$ subset of $A$ . Since $X$ is a dual differentiation space,

[TABLE]

contains a dense and $G_{\delta}$ subset of $A$ . Let $R^{\prime}:=R_{1}\cap R_{2}$ . Then $R^{\prime}$ contains a dense and $G_{\delta}$ subset of $A$ and by Proposition 5.2 part (v), $(x^{*}-f)$ has a strong maximum for each $x^{*}\in R^{\prime}$ . $\Box$

Remark 5.4.

There are two main weaknesses with this theorem: (i) although it is known that many Banach spaces (e.g. all spaces with the Radon-Nikodým property, [13], all weakly Lindelöf spaces, [26], all spaces that admit an equivalent locally uniformly rotund norm [12] and all spaces whose dual space $X^{*}$ is weak Asplund, [13]) are dual differentiation spaces, it is still an open question as to whether every Banach space is a dual differentiation space; (ii) it is not clear how one would go about showing that there exists a “large” subset $R$ of $\mathrm{int}(\mathrm{Dom}(f^{*}))$ with the property that $\mathrm{argmax}(x^{*}-f)\not=\varnothing$ for each $x^{*}\in R$ .

For our next result, and main variational principle, we address concern (i) of Remark 5.4 by giving a variational principle that holds in all Banach spaces. Unfortunately, there is a cost for this level of generality. Namely, we need to impose a strong assumption upon the mapping $x^{*}\mapsto\mathrm{argmax}(x^{*}-f)$ . We also need to employ the following non-trivial result concerning minimal weak cuscos, which was first proved by J. Christensen in [5], using topological games (in the domain space), and later rephrased in [13].

Theorem 5.5.

A minimal weak∗ cusco $\varphi:A\to 2^{X^{**}}$ from a complete metric space $A$ into subsets of the second dual $X^{**}$ of a Banach space $(X,\|\cdot\|)$ , where the set $\{x\in A:\varphi(x)\subseteq\widehat{X}\}$ contains a dense and $G_{\delta}$ subset of $A$ , is single-valued and norm upper semicontinuous at the points of a dense and $G_{\delta}$ subset of $A$ .

In order to extend the applicability of Theorem 5.5, we will show that some sets that are not necessarily complete metric spaces under their given metrics can be “re-metrized” to become a complete metric space under a new metric, while retaining the same topology. Indeed, suppose that $A$ is a nonempty open subset of a complete metric space $(M,d)$ . Then $(M\times{\mathbb{R}},\rho)$ is also a complete metric space under the metric,

[TABLE]

Let $f:A\to{\mathbb{R}}$ be defined by, $f(x):=\inf\{d(x,y)\in{\mathbb{R}}:y\in M\setminus A\}=\mathrm{dis}(x,M\setminus A)$ . Note that $f$ is continuous on $A$ . Let $G:=\{(x,r)\in M\times{\mathbb{R}}:x\in A\mbox{ and }r=1/f(x)\}$ . Then $G$ is a closed subset of $M\times{\mathbb{R}}$ , and hence is a complete metric space with respect to the restriction of the metric $\rho$ to $G$ . Finally, let us note that $G$ is homeomorphic to $A$ . Indeed, the mapping $\pi:G\to A$ defined by, $\pi(x,r):=x$ , is such a homeomorphism. Thus, a nonempty open subset of a complete metric space is “completely metrisable”.

Theorem 5.6.

Let $f:X\to{\mathbb{R}}\cup\{\infty\}$ be a proper function on a Banach space $(X,\|\cdot\|)$ . If there exists a nonempty open subset $A$ of $\mathrm{Dom}(f^{*})$ such that $\mathrm{argmax}(x^{*}-f)\not=\varnothing$ for each $x^{*}\in A$ , then there exists a dense and $G_{\delta}$ subset $R^{\prime}$ of $A$ such that $(x^{*}-f):X\to{\mathbb{R}}\cup\{-\infty\}$ has a strong maximum for each $x^{*}\in R^{\prime}$ . In addition, if $0\in A$ and $\varepsilon>0$ then there exists an $x_{0}^{*}\in X^{*}$ with $\|x^{*}_{0}\|<\varepsilon$ such that $(x_{0}^{*}-f):X\to{\mathbb{R}}\cup\{-\infty\}$ has a strong maximum.

Proof.

Consider $\partial f^{*}:A\to 2^{X^{**}}$ . Then, by Proposition 5.2 part (iii), $\partial f^{*}(x^{*})\cap\widehat{X}\not=\varnothing$ for all $x^{*}\in A$ . Thus, by Theorem 4.13, $\partial f^{*}(x^{*})\subseteq\widehat{X}$ for all $x^{*}\in A$ . Hence, $x^{*}\mapsto\partial f^{*}(x^{*})$ , is a minimal weak cusco on $A$ . Therefore, by Theorem 5.5, there exists a dense and $G_{\delta}$ subset $R^{\prime}$ of $A$ such that $\partial f^{*}$ is single-valued and norm upper semicontinuous at each point of $R^{\prime}$ . So, by Proposition 5.2 part (v), $(x^{*}-f)$ has a strong maximum for each $x^{*}\in R^{\prime}$ . $\Box$

Note that the conclusion of this theorem is identical to that of Stegall’s variational principle, see [44].

Question 5.7.

Is every Banach space $(X,\|\cdot\|)$ a dual differentiation space?

If the answer to this question is “yes” then Theorem 5.3 will supersede Theorem 5.6.

Index of notation and assumed knowledge

•

The natural numbers, $\mathbb{N}:=\{1,2,3,\ldots\}$ .

•

The integers, $\mathbb{Z}:=\{\ldots,-2,-1,0,1,2\ldots\}$ .

•

The rational numbers, $\mathbb{Q}:=\left\{a/b:a,b\in\mathbb{Z},b\neq 0\right\}$ .

•

The real numbers, ${\mathbb{R}}$ .

•

For any set $X$ , $\mathcal{P}(X)$ is the set of all subsets of $X$ .

•

For any subset $A$ of a topological space $\left(X,\tau\right)$ , we define

–

$\mathrm{int}(A)$ , called the interior of $A$ , is the union of all open sets contained in $A$ ;

–

$\overline{A}$ , called the closure of $A$ , is the intersection of all closed sets containing $A$ ;

–

$\mathrm{Bd}(A)$ , called the boundary of $A$ , is $\overline{A}\setminus\mathrm{int}(A)$ ,

•

For any points $x$ and $y$ in a vector space $X$ , we define the following intervals:

–

$[x,y]:=\{x+\lambda(y-x):0\leq\lambda\leq 1\};$

–

$(x,y):=\{x+\lambda(y-x):0<\lambda<1\};$

–

$[x,y):=\{x+\lambda(y-x):0\leq\lambda<1\};$

–

$(x,y]:=\{x+\lambda(y-x):0<\lambda\leq 1\}.$

•

For any normed linear space $\left(X,\|\cdot\|{\cdot}\right)$ , we define

–

$B[x,r]:=\left\{y\in X:\|x-y\|\leq r\right\}$ , for any $x\in X$ and $r>0$ ;

–

$B(x;r):=\left\{y\in X:\|x-y\|<r\right\}$ , for any $x\in X$ and $r>0$ ;

–

$B_{X}:=B[0,1]$ ;

–

$S_{X}:=\left\{x\in X:\|x\|=1\right\}$ .

•

Given a compact Hausdorff space $K$ , we write $C(K)$ for the set of all real-valued continuous functions on $K$ . This is a vector space under the operations of pointwise addition and pointwise scalar multiplication. $C(K)$ becomes a Banach space when equipped with the uniform norm $\|\cdot\|_{\infty}$ , defined by

[TABLE]

•

Let $A$ and $B$ be sets. Given a function $f:A\rightarrow B$ , we define $f(A):=\bigcup_{a\in A}\{f(x)\}$ . Similarly, given a set valued mapping $\varphi:A\rightarrow\mathcal{P}(B)$ , we define $\varphi(A):=\bigcup_{a\in A}\varphi(x)$ .

•

For a normed linear space $(X,\|\cdot\|{\cdot})$ , $X^{*}$ , the set of bounded linear maps from $X$ to $\mathbb{R}$ , is called the dual space of $X$ . $X^{*}$ is a Banach space when equipped with the operator norm, given by

[TABLE]

•

Let $X$ be a set and $Y$ a totally ordered set. For any function $f:X\rightarrow Y$ we define

[TABLE]

•

Let $A$ be a subset of a vector space $X$ . Then the convex hull of $A$ , denoted by $\mathrm{co}(A)$ , is defined to be the intersection of all convex subsets of $X$ that contain $A$ .

•

Let $X$ be a set and let $f:X\rightarrow\mathbb{R}\cup\{\infty\}$ a function. Then

[TABLE]

We say that the function $f$ is a proper function if $\mathrm{Dom}(f)\not=\varnothing$ .

•

Let $(X,\|\cdot\|)$ be a normed linear space and $f:X\to[-\infty,\infty]$ . Then the Fenchel conjugate of $f$ is the function $f^{*}:X^{*}\to[-\infty,\infty]$ defined by,

[TABLE]

The function $f^{*}$ is convex and if $f$ is a proper function then $f^{*}$ never takes the value $-\infty$ .

•

If $f$ is a convex function defined on a nonempty convex subset $K$ of a normed linear space $(X,\|\cdot\|{\cdot})$ and $x\in K$ , then we define the subdifferential of $f$ at $x$ to be the set $\boldsymbol{\partial f(x)}$ of all $x^{*}\in X^{*}$ satisfying

[TABLE]

•

It is assumed that the reader has a basic working knowledge of metric spaces, normed linear spaces and even basic general topology. In particular, it is assumed that the reader is familiar with Tychonoff’s theorem.

Theorem (Tychonoff’s Theorem [9]).

The Cartesian product $\prod_{s\in S}S_{s}$ , where $X_{s}\not=\varnothing$ for all $s\in S$ , is compact if, and only if, all spaces $X_{s}$ are compact.

Bibliography44

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Alaoglu, Leon, Weak topologies of normed linear spaces , Ann. of Math. (2) 41 (1940), 252–267.
2[2] Brøndsted, A. and Rockafellar, R. T., On the subdifferentiability of convex functions , Proc. Amer. Math. Soc. 16 (1965), 605–611.
3[3] B. Cascales, J. Orihuela, and M. Ruiz Galán, Compactness, optimality, and risk , Computational and analytical mathematics, Springer Proc. Math. Stat., vol. 50, Springer, New York, 2013, pp. 161–218. MR 3108428
4[4] Cascales, B. and Orihuela, J. and Pérez, A., One-sided James’ compactness theorem , J. Math. Anal. Appl. 445 (2017), no. 2, 1267–1283.
5[5] Christensen, Jens Peter Reus, Theorems of Namioka and R. E. Johnson type for upper semicontinuous and compact valued set-valued mappings , Proc. Amer. Math. Soc. 86 (1982), no. 4, 649–655.
6[6] Gabriel Debs, Gilles Godefroy, and Jean Saint-Raymond, Topological properties of the set of norm-attaining linear functionals , Canad. J. Math. 47 (1995), no. 2, 318–329. MR 1335081
7[7] Robert Deville, Gilles Godefroy, and Václav Zizler, Smoothness and renormings in Banach spaces , Pitman Monographs and Surveys in Pure and Applied Mathematics, vol. 64, Longman Scientific & Technical, Harlow; copublished in the United States with John Wiley & Sons, Inc., New York, 1993. MR 1211634
8[8] Dunford, Nelson and Schwartz, Jacob T., Linear Operators. I. General Theory , Interscience Publishers, Inc., New York; Interscience Publishers, Ltd., London, 1958.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

1 Introduction

2 Preliminaries

2.1 Weak topologies on sets

Proposition 2.1**.**

Proof.

Proposition 2.2**.**

Proof.

2.2 Linear topologies

Proposition 2.3**.**

Proof.

Remark 2.4**.**

Proposition 2.5**.**

Proposition 2.6**.**

Proof.

Lemma 2.7**.**

Proof.

Proposition 2.8**.**

Proof.

Remark 2.9**.**

Proposition 2.10**.**

Proof.

2.3 Hahn-Banach Theorem

Theorem 2.11** (Hahn-Banach Theorem [8]).**

Proof.

Corollary 2.12**.**

Proof.

Corollary 2.13**.**

Proof.

Proposition 2.14**.**

Proof.

Theorem 2.15**.**

Proof.

Remark 2.16**.**

Theorem 2.17** (Separation Theorem).**

Proof.

Proposition 2.18**.**

Proof.

2.4 Weak∗ topology

Proposition 2.19**.**

Proof.

Proposition 2.20**.**

Proof.

Theorem 2.21** (Bipolar Theorem).**

Proof.

Corollary 2.22** (Goldstine’s Theorem).**

Proof.

Theorem 2.23** (Banach-Alaoglu Theorem [1]).**

Proof.

Proposition 2.24**.**

Proof.

Remark 2.25**.**

Theorem 2.26** (​​[8]).**

Proof.

3 James’ Theorem on weak compactness

3.1 James’ Theorem on weak compactness: the separable case

Lemma 3.1** (​​ [34]).**

Proof.

Lemma 3.2** (​​[34]).**

Proof.

Lemma 3.3**.**

Theorem 3.4** (​​[10, 11]).**

Proof.

Remark 3.5**.**

Theorem 3.6** (James’ Theorem: version 1, [18]).**

Proof.

3.2 James’ Theorem on weak compactness: the weak∗ sequentially compact case

Proposition 3.7**.**

Proof.

Lemma 3.8**.**

Proof.

Theorem 3.9** (James’ Theorem: version 2).**

Proof.

3.3 James’ Theorem on weak compactness: the general case

Proposition 2.1.

Proposition 2.2.

Proposition 2.3.

Remark 2.4.

Proposition 2.5.

Proposition 2.6.

Lemma 2.7.

Proposition 2.8.

Remark 2.9.

Proposition 2.10.

Theorem 2.11 (Hahn-Banach Theorem [8]).

Corollary 2.12.

Corollary 2.13.

Proposition 2.14.

Theorem 2.15.

Remark 2.16.

Theorem 2.17 (Separation Theorem).

Proposition 2.18.

Proposition 2.19.

Proposition 2.20.

Theorem 2.21 (Bipolar Theorem).

Corollary 2.22 (Goldstine’s Theorem).

Theorem 2.23 (Banach-Alaoglu Theorem [1]).

Proposition 2.24.

Remark 2.25.

Theorem 2.26 ([8]).

Lemma 3.1 ( [34]).

Lemma 3.2 ([34]).

Lemma 3.3.

Theorem 3.4 ([10, 11]).

Remark 3.5.

Theorem 3.6 (James’ Theorem: version 1, [18]).

Proposition 3.7.

Lemma 3.8.

Theorem 3.9 (James’ Theorem: version 2).

Lemma 3.10.

Lemma 3.11.

Corollary 3.12.

Corollary 3.13.

Proposition 3.14.

Theorem 3.15 (James’ Theorem: version 3, [20]).

Theorem 3.16 ([19]).

Theorem 3.17 ([38]).

Corollary 3.18 (Krein-Smulian Theorem,[27]).

Theorem 3.19 (Simons, [43]).

Corollary 3.20.

Corollary 3.21 (Simons).

Proposition 3.22.

Theorem 3.23 ([21]).

Theorem 3.24 ([42]).

Corollary 3.25 ([42]).

Lemma 3.26 ([30]).

Theorem 3.27 ([30]).

Remark 3.28.

Example 3.29.

Conjecture 3.30.

Proposition 4.1 ([39, Proposition 1.6]).

Lemma 4.2 ([39, Proposition 1.11]).

Proposition 4.3 ([39, Proposition 1.11]).

Proposition 4.4 ([39, Example 2.2]).

Proposition 4.5 ([39, Proposition 2.5]).

Proposition 4.6 ([5]).

Proposition 4.7.

Corollary 4.8.

Proposition 4.9 ([5]).

Proposition 4.10 ([22, 39]).

Lemma 4.11.

Lemma 4.12.

Theorem 4.13.