Basis Sets Notes

Notes from reading Andrew Leach’s Molecular Modelling book.

To reiterate, basis sets are the orbital sets used to approxmiate the atomics orbitals, which are linearly combined (weighted) to form molecular orbitals.

Slater type orbitals (STOs) is one way to simplify atomic orbitals obtained from solving the Schrondinger equation on the hydrogen atom. It replaces the Legendre polynomial spherical harmonics in the radial parts with a simpler form involving an empirical quantity, the effect charge.

Simplifying the functional form is all about computaional efficiency. The adaptation of Gaussian type orbital (GTOs) over STOs is due to its efficiency in computation. Such orbital has a general form of:

$φ = x^ay^bz^c exp(-\alpha r^2)$

The exponential format allows orbital joining (multiplication) very cheap to calculate.

The powers a,b,c which can take on values of 1 or 0. When all of them equal to zero, it is a zeroth-order Gaussian that resembles the formalism of s-type atomic orbitals. When one of the three is one it is a first-order Gaussian, resembling p-type orbitals.

Often a linear combinations of GTOs are used to approximate STOs. Fitting is done via minimising least square error for example. After optimisation the linear combination, the coefficient in front of each GTO and the $\alpha$ is often fixed when calculating the final MO form. In addition, the $\alpha$ term is fixed for s and p type orbitals. This is again for saving computational cost.

Some Basis Sets

Minimal basis set includes all the filled orbitals for an atom, each such orbital is represented by a set of Gaussian orbitals. The notation “STO-3G” therefore means each Slater type orbital is approximated by 4 Gaussian orbitals. For ab initio calculation, approximation by 3 Gaussians is the bare minimum. Reminder of the hierarchy, MOs are approxmiate by LCAO, AOs are approximated by STO, and STOs are approximated by GTOs.

Split valence basis set accounts for anistropy problems for the minimal basis set by allowing more than one type of STOs to be used to model the valence atomic orbitals. The most used type is the double split zeta basis set (zeta is the coefficient in front of the exponent in the radial part of STO) which models valence orbitals with two different STOs and inner shell orbitals with one. For example, in the 3-21G basis set, each inner shell atomic orbital is approximated by one STO, which is in turn approximated by three GTOs. While the valence shell orbitals are modeled by two different types of STOs, one of the two is approximated by two GTOs, the other by one.

One can further account for the charge distortion effect on the orbitals by modelling polarisations - reducing the charge symmetry by including un-occupied p orbitals to the basis set to atoms only having filled s orbitals, similarly adding d to atoms only with s and p characters. 3-21G* means polarisations is done on all heavy atom (with occupied d orbitals). Two asterisks means polarisation is done on hydrogen and helium (p type polarisation) too. To account for species with large electron density away from the nuclei (where all Gaussians for a given orbital center on), one can add extra set of diffused orbitals. This is what the “+” notation means.

So, 3-21++G* means for a given atom, each of its inner shell orbital is modelled by one STO - approximated by three GTOs. Each of its valence orbital is modelled by two STOs, one approximated by two GTOs and the other by one. On top, each atom has a extra set of diffuse s and p type GTOs added (indicated by + +, + would be for each non-hydrogen atom). Additionally, extra unoccupied (d) orbitals (still approximated by GTOs) is added for each non-hydrogen atom. The asterisk can be replaced by more precise notation such as 3-21++G(3df, 3pd), which means 3 d-type GTOs 1 f-type GTOs are added for heavy atoms and 3 p-type GTOs and 1 d-type GTO is added to each hydrogen.

There are other basis sets too, e.g. see wikipedia.

At this point, a quick comparison between molecular mechanics and quantuam mechanics can show why the latter is so much more expensive than the former. Say for evaluation of the system evolution over time. For each time step, a MM approach only requires the evaluation of the analytical functions composing of the different energy contributions (torsion, electrostatic, etc) due to the nuclear coordinate. But a QM calculation would need to iteratively solve for the new MOs for the changed nuclei coordinates before even starting the energy evaluations.