adapted from L.Piela, Ideas of Quantum Chemistry.
Variational principle
Solving the exact Hamiltonian for complicated systems is generally undesirable, and involves several approximations to be made possible. One of the most widely used technique in approximating a solution is called the Variational principle. In mathematical terms, it states that
for some arbitrary that is:
- the same coordinates (e.g. , or ) of the corresponding Schrodinger equation
- normalizable
Then, we could say , where is the true ground state energy, and if and only if is the exact ground state wavefunction of .
Proof:
We know that of is a set of complete orthonormal basis. That means any function can be written as:
where
Plugging into the expectation value of energy , we get:
With the theorem in mind, how do we actually apply it?
Variational Method
Suppose a trial function and we know its analytical form. We introduce parameters , which we may modify, into . Then, for some , we can say . Recall the variational principle, equation , since on the right hand side we integrate out , then the left hand side depends exclusively on the variational parameter, which is in this case. i.e.
And the problem amounts to an optimization problem of finding such that is minimized, approaching the number we want.
In general, it’s not an easy task, because
is a necessary but not sufficient condition to find the global minimum of . More details of this can be found in texts related to optimization.
The Linear Variational Principle (Ritz Method)
Here we will talk about a special case of variational method, where we represent the trial function as a linear combination of some other functions that we know of, called the basis functions. The previously parameter will be the linear coefficients of the basis functions. i.e.
This special case allows us to find the global minimum by just looking at the derivative. We can rewrite the equation for energy as
where we define
These matrix elements can be obtained by explicitly integrating because we know the exact forms of .
Also notice that is a function of 's, so let’s look at its derivative.
Let
Assume all 's are real for convenience, then for each , by the product rule we have:
The individual derivatives for and are
Similarly,
If you are having trouble thinking of these derivatives, try expanding out the individual sums with the appropriate indices, set , and evaluate which terms are not relevant to . Then apply this for general . Now we can write
Setting the above to 0, we get
This is also called the secular equation, and one way to solve it is by requiring the determinant of the matrix be 0, which reduces to diagonalizing the matrix . The resulting eigenvalues are the energies, and the eigenvectors are the linear coefficients of basis functions that corresponds to the energy. The smallest eigenvalue corresponds to the approximated ground state energy, and the larger eigenvalues can be thought of in some sense excited state energies.