The Heisenberg uncertainty principle seems like a principle that is so fundamentally experimental but it can actually be derived through theory. This requires the consideration of three concepts: the Cauchy-Shwarz inequality, measurement operators, and commutators.

1. The Cauchy-Shwarz inequality

This can be proved through the completeness relation and treating as one of the orthonormal basis vectors. Look at the proof below where is the set of orthonormal basis vectors.

2. Measurement Operators

Observables or, in linear algebra terms, bases have different basis states that they are comprised of as well as a set of measurement operators. A measurement operator for some state can be expressed as such that is the probability that the measurement of yields . This can be easily be seen by substitution: . Each state also has some associated energy . Using this, one can multiply the probability of each state by its energy and sum over all states to arrive at an expected value for the energy. The mathematical representation of this is shown below.

This wavefunctions can be factored out to arrive at the following form.

The sum of scaled operators on the inside can be treated as a single operator . This is basically the decomposition of some observable or basis. Note that this can be done with different observables that have different states. Now, it can be seen that is the expected energy of the observable, often notated for simplicity.

One can use this average value to measure standard variation by defining it as the average square of the difference between the measured value and the expected value. The square is used so that positive and negative deviations from the mean don’t cancel out. This can be put in mathematical terms, in some sense, as . This is simplified below.

3. Commutators

If and are two operators, the following describes measures of commutativity and anti-commutativity.

Assume . Then, and . This allows for the relationship shown below.

The Derivation

Through the Cauchy-Shwarz inequality, the two vectors and can be related in the following way.

Using earlier knowledge about commutators, this can be rewritten as it is done below.

This can be put in the nicer notation described earlier.

Now assume and where and are observables. These new definitions are substituted into the above inequality.

If one treats as position and as momentum, then can make use of the canonical commutation relation in quantum mechanics which states that the commutator between position and momentum is .

Because variance is the merely the square of deviation, one can take the square root of both sides to arrive at at Heisenberg uncertainty principle.

If you want to know more or see where I learned it from, read “Quantum computation and information” by Isaac Chuang and Michael Nielsen. It should be in the “Books” section of this site.