In
mathematics, 'Jensen's inequality', named after the Danish mathematician
Johan Jensen, relates the value of a
convex function of an
integral to the integral of the convex function. It was proved by Jensen in
1906[1]. Given its generality, the inequality appears in many forms depending on the context, some of which are presented below.
The finite form of the equation was the logo of
Institute for Mathematical Sciences at
University of Copenhagen until
2006.
Statements
The classical form of Jensen's inequality involves several numbers and weights. The inequality can be stated quite generally using
measure theory, and can be further generalized to its ''full strength'' in a probabilistic setting.
Finite form
For a real
convex function φ, numbers ''x
i'' in its domain, and positive weights ''a
i'', Jensen's inequality can be stated as:
:
and the inequality is clearly reversed if φ is
concave.
As a particular case, if the weights ''a
i'' are all equal to unity, then
:
For instance, the
function is ''concave'', so substituting
in the previous formula, this establishes the (logarithm of) the familiar
arithmetic mean-geometric mean inequality:
:
The variable ''x'' may, if required, be a function of another variable (or set of variables) ''t'', so that
. All of this carries directly over to the general continuous case: the weights ''a
i'' are replaced by a non-negative integrable function ''f''(''x''), such as a probability distribution, and the summations replaced by integrals.
In measure-theoretic notation
Let (Ω,A,μ) be a
measure space, such that μ(Ω) = 1. If ''g'' is a
real-valued function that is μ-
integrable, and if φ is a
measurable convex function on the real axis, then:
:
In probability-theory notation (real space)
The same result can be stated in a
probability theory setting. Let
be a
probability space,
an
integrable real-valued
random variable and φ a measurable
convex function. Then:
:
In this probability setting, the measure μ is intended as a probability
, the integral with respect to μ as an
expected value , and the function ''g'' as a
random variable .
In probability-theory notation (general)
More generally, let ''T'' be a real
topological vector space, and
a ''T''-valued
integrable random variable. In this general setting, ''integrable'' means that for any element ''z'' in the
dual space of ''T'':