In this part of the lesson, we'll discuss the fidelity between quantum states, which is a measure of their similarity — or how much they "overlap."
Given two quantum state vectors, the fidelity between the pure states associated with these quantum state vectors equals the absolute value of the inner product between the quantum state vectors.
This provides a basic way to measure their similarity: the result is a value between 0 and 1, with larger values indicating greater similarity.
In particular, the value is zero for orthogonal states (by definition), while the value is 1 for states equivalent up to a global phase.
Intuitively speaking, the fidelity can be seen as an extension of this basic measure of similarity, from quantum state vectors to density matrices.
It's fitting to begin with a definition of fidelity.
At first glance, the definition that follows might look unusual or mysterious, and perhaps not easy to work with.
The function it defines, however, turns out to have many interesting properties and multiple alternative formulations, making it much nicer to work with than it may initially appear.
Definition
Let ρ and σ be density matrices representing quantum states of the same system.
The fidelity between ρ and σ is defined as
F(ρ,σ)=Trρσρ.
Remark
Although this is a common definition, it is also common that the fidelity is defined as the square of the quantity defined here, which is then referred to as the root-fidelity.
Neither definition is right or wrong — it's essentially a matter of preference.
Nevertheless, one must always be careful to understand or clarify which definition is being used.
To make sense of the formula in the definition, notice first that ρσρ is a positive semidefinite matrix:
ρσρ=M†M
for M=σρ.
Like all positive semidefinite matrices, this positive semidefinite matrix has a unique positive semidefinite square root, the trace of which is the fidelity.
For every square matrix M, the eigenvalues of the two positive semidefinite matrices M†M and MM† are always the same, and hence the same is true for the square roots of these matrices.
Choosing M=σρ and using the fact that the trace of a square matrix is the sum of its eigenvalues, we find that
F(ρ,σ)=Trρσρ=TrM†M=TrMM†=Trσρσ=F(σ,ρ).
So, although it is not immediate from the definition, the fidelity is symmetric in its two arguments.
An equivalent way to express the fidelity is by this formula:
F(ρ,σ)=σρ1.
Here we see the trace norm, which we encountered in the previous lesson in the context of state discrimination.
The trace norm of a (not necessarily square) matrix M can be defined as
∥M∥1=TrM†M,
and by applying this definition to the matrix σρ we obtain the formula in the definition.
An alternative way to express the trace norm of a (square) matrix M is through this formula.
∥M∥1=UunitarymaxTr(MU).
Here the maximum is over all unitary matrices U having the same number of rows and columns as M.
Applying this formula in the situation at hand reveals another expression of the fidelity.
One last point on the definition of fidelity is that every pure state is (as a density matrix) equal to its own square root, which allows the formula for the fidelity to be simplified considerably when one or both of the states is pure.
In particular, if one of the two states is pure we have the following formula.
F(∣ϕ⟩⟨ϕ∣,σ)=⟨ϕ∣σ∣ϕ⟩
If both states are pure, the formula simplifies to the absolute value of the inner product of the corresponding quantum state vectors, as was mentioned at the start of the section.
The fidelity has many remarkable properties and several alternative formulations.
Here are just a few basic properties listed without proofs.
For any two density matrices ρ and σ having the same size, the fidelity F(ρ,σ) lies between zero and one: 0≤F(ρ,σ)≤1. It is the case that F(ρ,σ)=0 if and only if ρ and σ have orthogonal images (so they can be discriminated without error), and F(ρ,σ)=1 if and only if ρ=σ.
The fidelity is multiplicative, meaning that the fidelity between two product states is equal to the product of the individual fidelities:
F(ρ1⊗⋯⊗ρm,σ1⊗⋯⊗σm)=F(ρ1,σ1)⋯F(ρm,σm).
The fidelity between states is nondecreasing under the action of any channel. That is, if ρ and σ are density matrices and Φ is a channel that can take these two states as input, then it is necessarily the case
that
F(ρ,σ)≤F(Φ(ρ),Φ(σ)).
The Fuchs-van de Graaf inequalities establish a close (though not exact) relationship between fidelity and trace distance: for any two states ρ and σ we have
1−21∥ρ−σ∥1≤F(ρ,σ)≤1−41∥ρ−σ∥12.
The final property can be expressed in the form of a figure:
Specifically, for any choice of states ρ and σ of the same system, the horizontal line that crosses the y-axis at F(ρ,σ) and the vertical line that crosses the x-axis at 21∥ρ−σ∥1 must intersect within the gray region bordered below by the line y=1−x and above by the unit circle.
The most interesting region of this figure from a practical viewpoint is the upper left-hand corner of the gray region:
if the fidelity between two states is close to one, then their trace distance is close to zero, and vice versa.
Next we'll take a look at a simple but important fact, known as the gentle measurement lemma, which connects fidelity to non-destructive measurements.
It's a very useful lemma that comes up from time to time, and it's also noteworthy because the seemingly clunky definition for the fidelity actually makes the lemma very easy to prove.
The set-up is as follows.
Let X be a system in a state ρ and let {P0,…,Pm−1} be a collection of positive semidefinite matrices representing a general measurement of X.
Suppose further that if this measurement is performed on the system X while it's in the state ρ, one of the outcomes is highly likely.
To be concrete, let's assume that the likely measurement outcome is 0, and specifically let's assume that
Tr(P0ρ)>1−ε
for a small positive real number ε>0.
What the gentle measurement lemma states is that, under these assumptions, the non-destructive measurement obtained from {P0,…,Pm−1} through Naimark's theorem causes only a small disturbance to ρ in case the likely measurement outcome 0 is observed.
More specifically, the lemma states that the fidelity-squared between ρ and the state we obtain from the non-destructive measurement, conditioned on the outcome being 0, is greater than 1−ε.
F(ρ,Tr(P0ρ)P0ρP0)2>1−ε.
We'll need a basic fact about measurements to prove this.
The measurement matrices P0,…,Pm−1 are positive semidefinite and sum to the identity, which allows us to conclude that all of the eigenvalues of P0 are real numbers between 0 and 1.
This follows from the fact that, for any unit vector ∣ψ⟩, the value
⟨ψ∣Pa∣ψ⟩ is a nonnegative real number for each a∈{0,…,m−1} (because each Pa is positive semidefinite), together with these numbers summing to one.
a=0∑m−1⟨ψ∣Pa∣ψ⟩=⟨ψ∣(a=0∑m−1Pa)∣ψ⟩=⟨ψ∣I∣ψ⟩=1.
Hence ⟨ψ∣P0∣ψ⟩ is always a real number between 0 and 1, and this implies that every eigenvalue of P0 is a real number between 0 and 1 because we can choose ∣ψ⟩ specifically to be a unit eigenvector corresponding to whichever eigenvalue is of interest.
From this observation we can conclude the following inequality for every density matrix ρ.
Tr(P0ρ)≥Tr(P0ρ)
In greater detail, starting from a spectral decomposition
from the fact that ⟨ψk∣ρ∣ψk⟩ is a nonnegative real number and λk≥λk for each k=0,…,n−1. (Squaring numbers between 0 and 1 can never make them larger.)
Now we can prove the gentle measurement lemma by evaluating the fidelity and then using our inequality.
First, let's simplify the expression we're interested in.
Notice that these are all equalities — we've not used our inequality (or any other inequality) at this point, so we have an exact expression for the fidelity.
We can now use our inequality to conclude
To conclude the lesson, we'll take a look at Uhlmann's theorem, which is a fundamental fact about the fidelity that connects it with the notion of a purification.
What the theorem says, in simple terms, is that the fidelity between any two quantum states is equal to the maximum inner product (in absolute value) between two purifications of those states.
Theorem
Uhlmann's theorem: Let ρ and σ be density matrices representing states of a system X, and let Y be a system having at least as many classical states as X. The fidelity between ρ and σ is given by
where the maximum is taken over all quantum state vectors ∣ϕ⟩ and ∣ψ⟩ of (X,Y).
We can prove this theorem using the unitary equivalence of purifications — but it isn't completely straightforward and we'll make use of a trick along the way.
To begin, consider spectral decompositions of the two density matrices ρ and σ.
ρσ=a=0∑n−1pa∣ua⟩⟨ua∣=b=0∑n−1qb∣vb⟩⟨vb∣
The two collections {∣u0⟩,…,∣un−1⟩} and {∣v0⟩,…,∣vn−1⟩} are orthonormal bases of eigenvectors of ρ and σ, respectively, and p0,…,pn−1 and q0,…,qn−1 are the corresponding eigenvalues.
We'll also define ∣u0⟩,…,∣un−1⟩ and
∣v0⟩,…,∣vn−1⟩ to be the vectors obtained by taking the complex conjugate of each entry of ∣u0⟩,…,∣un−1⟩ and ∣v0⟩,…,∣vn−1⟩.
That is, for an arbitrary vector ∣w⟩ we can define ∣w⟩ according to the following equation for each c∈{0,…,n−1}.
⟨c∣w⟩=⟨c∣w⟩
Notice that for any two vectors ∣u⟩ and ∣v⟩ we have
⟨u∣v⟩=⟨v∣u⟩.
More generally, for any square matrix M we have the following formula.
⟨u∣M∣v⟩=⟨v∣MT∣u⟩
It follows that ∣u⟩ and ∣v⟩ are orthogonal if and only if ∣u⟩ and ∣v⟩ are orthogonal, and therefore
{∣u0⟩,…,∣un−1⟩} and