In
mathematics, 'Muirhead's inequality', also known as the "bunching" method, generalizes the
inequality of arithmetic and geometric means.
Two preliminary definitions
The "''a''-mean"
For any
real vector
:
define the "''a''-mean" [''a''] of nonnegative real numbers ''x''
1, ..., ''x''
''n'' by
:
where the sum extends over all
permutations σ of { 1, ..., ''n'' }.
In case ''a'' = (1, 0, ..., 0), this is just the ordinary
arithmetic mean of ''x''
1, ..., ''x''
''n''. In case ''a'' = (1/''n'', ..., 1/''n''), it is the
geometric mean of ''x''
1, ..., ''x''
''n''. (When ''n'' = 2, this is the
Heinz mean.)
Doubly stochastic matrices
An ''n'' × ''n'' matrix ''P'' is ''
doubly stochastic'' precisely if both ''P'' and its transpose ''P''
T are
stochastic matrices. A ''stochastic matrix'' is a square matrix of nonnegative real entries in which the sum of the entries in each column is 1. Thus, a doubly stochastic matrix is a square matrix of nonnegative real entries in which the sum of the entries in each row and the sum of the entries in each column is 1.
The inequality
Muirhead's inequality states that [''a''] ≤ [''b''] for all ''x''
''i'' ≥ 0 if and only if there is some doubly stochastic matrix ''P'' for which ''a'' = ''Pb''.
The proof makes use of the fact that every doubly stochastic matrix is a weighted average of
permutation matrices (
Birkhoff-von Neumann theorem).
Another equivalent condition
Because of the symmetry of the sum, no generality is lost by sorting the exponents into decreasing order:
:
:
Then the existence of a doubly stochastic matrix ''P'' such that ''a'' = ''Pb'' is equivalent to the following system of inequalities:
:
:
:
:
:
:
(The ''last'' one is an equality; the others are weak inequalities.)
The sequence
is said to 'majorize' the sequence
.
Symmetric sum-notation tricks
It is useful to use a kind of special notation for the sums. A success in reducing an inequality in this form means that the only condition for testing it is to verify whether one exponent sequence (
) majorizes the other one.
:
This notation requires developing every permutation, developing an expression made of n! monomials, for instance:
:
Deriving the arithmetic-geometric mean inequality
Let
:
:
we have
:
:
:
:
then
:[''a
A''] ≥ [''a
G'']
which is
:
yielding the inequality.
Examples
Suppose you want to prove that ''x''
2 + ''y''
2 ≥ 2''xy'' by using bunching (Muirhead's inequality):
We transform it in the symmetric-sum notation:
:
The sequence (2, 0) majorizes the sequence (1, 1), thus the inequality holds by bunching. Again,
:
:
which yields
:
the sequence (3, 0, 0) majorizes the sequence (1, 1, 1), thus the inequality holds by bunching.
References
★ ''Combinatorial Theory'' by John N. Guidi, based on lectures given by
Gian-Carlo Rota in 1998, MIT Copy Technology Center, 2002.
★ Kiran Kedlaya's guide to solving inequalities at
[1].
★
Simple explanation with examples
★
Reference on PlanetMath (Muirhead's theorem)