Molecular vibrations and rotations Prof. dr. Ad van der Avoird
by user
Comments
Transcript
Molecular vibrations and rotations Prof. dr. Ad van der Avoird
Molecular vibrations and rotations Prof. dr. Ad van der Avoird January 28, 2010 Contents 0.1 Books . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2 0.2 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2 0.3 Vibrations of polyatomic molecules (harmonic) . . . . . . . . . 3 0.3.1 Harmonic oscillator in one dimension x . . . . . . . . . 3 0.3.2 Two coupled oscillators . . . . . . . . . . . . . . . . . . 5 0.3.3 Harmonic vibrations of molecules in mass-weighted coordinates . . . . . . . . . . . . . . . . . . . . . . . . . . 8 0.3.4 Harmonic vibrations of molecules in cartesian (displacement) coordinates . . . . . . . . . . . . . . . . . . . . . 12 0.3.5 Invariance conditions on the matrix F . . . . . . . . . . 14 0.3.6 Vibrations in internal coordinates (3n-6) . . . . . . . . 18 0.4 Rotations of rigid (non-linear) molecules . . . . . . . . . . . . 21 0.4.1 Rotation matrices and infinitesimal rotations . . . . . . 22 0.4.2 The rigid rotor in classical mechanics . . . . . . . . . . 24 0.4.3 The rigid rotor Hamiltonian . . . . . . . . . . . . . . . 31 0.4.4 The rigid rotor in quantum mechanics . . . . . . . . . 34 0.5 The molecular vibration-rotation Hamiltonian . . . . . . . . . 38 0.5.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . 38 0.5.2 The (classical) kinetic energy of a semi-rigid molecule . 39 0.5.3 Semi-rigid molecules in quantum mechanics . . . . . . 44 0.5.4 Floppy molecules and Van der Waals molecules . . . . 46 1 0.1 Books 1. Molecular vibrations, E.B. Wilson, J.C. Decius, and P.C. Cross, McGrawHill, New York (1955) 2. Vibrational states, S. Califano, Wiley, London (1976) 3. Molecular Vibrational-Rotational spectra, D. Papousek and M.R. Aliev, Elsevier, Amsterdam (1982) 4. (Angular momentum in quantum physics, L.C. Biedenharn and J.D. Louck, Addison-Wesley, Reading (1981)) 0.2 Introduction Nuclear motion — Born-Oppenheimer approximation, second step For n nuclei with coordinates r k , k = 1, . . . , n the Schrödinger equation for the nuclear motion problem is given by: Ĥ(r 1 , . . . , rn )Ψ(r 1 , . . . , rn ) = EΨ(r1 , . . . , r n ) (1) with the Hamiltonian Ĥ consisting of the kinetic and potential energy operators: Ĥ(r 1 , . . . , rn ) = T̂(r1 , . . . , rn ) + V̂(r1 , . . . , r n ). (2) The potential energy V (r 1 , . . . , r n ) can be obtained from the electronic Schrödinger equation (Born-Oppenheimer approximation, first step). It is an energy eigenvalue of the electronic Hamiltonian with coordinates of the nuclei fixed at rk , k = 1, . . . , n. Variation of the nuclear coordinates r k yields the potential energy surface V (r 1 , . . . , r n ). Solution of the nuclear motion Schrödinger equation for molecules → translations, rotations, vibrations Spectroscopy: microwave, submilimeter, IR / Raman 0 − 30 cm−1 100 − 5000 cm−1 rotations vibrations 2 Lasers → very high resolution (in principle) (7 significant digits) Doppler and collision broadening occurring in the gas phase are avoided in molecular beams (“supersonic nozzle beams”: very low T) Globally: rotational spectrum → molecular structure vibrational spectrum → intramolecular forces (“force field”) associated with chemical bonds spectra of van der Waals molecules → intermolecular (noncovalent) forces 0.3 0.3.1 Vibrations of polyatomic molecules (harmonic) Harmonic oscillator in one dimension x Momentum operator, in atomic units (h̄ = 1): p̂x = 1 d i dx (3) The Hamiltonian of a harmonic oscillator is: Ĥ = T̂ + V̂ = p̂2x 1 1 d2 1 2 + f x̂2 = − + f x̂ , 2m 2 2m dx2 2 (4) d2 V dV is the force constant. It is assumed that = 0 (potential 2 dx dx minimum) and that higher than second derivatives of V can be neglected. where f = It is convenient to introduce mass-weighted coordinates: √ q = mx 1 d 1 d = p̂ = √ i m dx i dq 3 (5) (6) and write: Ĥ = 1 2 1 2 p̂ + λ q̂ 2 2 (7) f m √ The frequency of the harmonic oscillator is ω0 = λ. where λ = Eigenvalues are the energies Eigenfunctions are where Hv are Hermite polynomials. 1 Ev = ω0 (v + ) v = 0, 1, 2, . . . 2 √ Ψv (q) = Hv ( ω0 q) exp − 12 ω0 q 2 , 4 (8) (9) 0.3.2 Two coupled oscillators Example 1: Figure 1: Linear CO2 with two internal coordinates. Example 2: Figure 2: Diatomic molecule in laboratory coordinates. Ĥ = 1 p̂2 1 p̂21 + 2 + f11 q̂21 + f22 q̂22 +f12 q̂1 q̂2 2m1 2m2 2 2 (10) Let us assume symmetry, so that m1 = m2 = m and f11 = f22 . Define new coordinates: √ √ Q1 = (q1 + q2 )/ 2 q1 = (Q1 + Q2 )/ 2 (11) √ √ Q2 = (q1 − q2 )/ 2 q2 = (Q1 − Q2 )/ 2 (12) The associated momentum operators are: 1 ∂q1 ∂ ∂q2 ∂ 1 ∂ = + P̂1 = i ∂Q1 i ∂Q1 ∂q1 ∂Q1 ∂q2 1 ∂ = P̂2 = i ∂Q2 5 √ = (p̂1 + p̂2 )/ 2 (13) √ = (p̂1 − p̂2 )/ 2 (14) Hence: 1 ∂ p̂1 = = i ∂q1 1 ∂ = p̂2 = i ∂q2 1 ∂Q1 ∂ ∂Q2 ∂ + i ∂q1 ∂Q1 ∂q1 ∂Q2 √ = P̂1 + P̂2 / 2 √ = P̂1 − P̂2 / 2 Substitution of these new coordinates gives: 2 2 2 1 1 P̂1 + P̂2 + P̂1 − P̂2 + f11 Q̂1 + Q̂2 + Ĥ = 4m 4 1 2 1 + f12 Q̂1 + Q̂2 Q̂1 − Q̂2 + f22 Q̂1 − Q̂2 2 4 1 1 2 2 2 = P̂1 + P̂2 + (f11 + f22 + 2f12 ) Q̂1 + 2m 4 1 2 1 + (f11 + f22 − 2f12 ) Q̂2 + (f11 − f22 ) Q̂1 Q̂2 4 2 (15) (16) (17) (18) The last term vanishes because f11 = f22 and we get: Ĥ = 2 2 1 2 1 1 2 1 P̂1 + (f11 + f12 ) Q̂1 + P̂2 + (f11 − f12 ) Q̂2 = Ĥ1 + Ĥ2 . 2m 2 2m 2 (19) As one can see, our system separates into two uncoupled oscillators, with: √ √ coordinates Q1 = (q1 + q2 )/ 2 Q2 = (q1 − q2 )/ 2 (20) 1/2 1/2 f11 + f12 f11 − f12 frequencies ω1 = ω2 = (21) m m The energy of the system is: E = Ev1 + Ev2 = ω1 v1 + 6 1 2 + ω2 v2 + 1 2 (22) Example 1: CO2 Figure 3: Q1 and Q2 for CO2 . √ Q1 = (q1 + q2 )/√2 — symmetric stretch vibration Q2 = (q1 − q2 )/ 2 — asymmetric stretch vibration Q1 and Q2 are called “normal mode” coordinates Example 2: Diatomic molecule Figure 4: Q1 and Q2 for a diatomic molecule. √ Q1 = (q1 + q2 )/√2 — overall translation (center of mass motion) Q2 = (q1 − q2 )/ 2 — stretch vibration (relative motion) In Example 2 the molecule (center of mass) freely moves in space, the energy does not depend on Q1 , and we get ω1 = 0. This corresponds to the invariance condition: f11 + f12 ( = f12 + f22 ) = 0. 7 (23) √ For the stretch vibration Q2 = (q1 − q2 )/ 2 we find: 1/ 1/ 1/2 f11 − f12 2 2f11 2 f11 ω2 = = = , (24) m m µ m where µ = is the reduced mass. 2 Conclusion: √ Both examples show that the simple transformation Q1,2 = (q1 ± q2 )/ 2 separates the potential energy, while the kinetic energy remains separated. As a result we get two uncoupled harmonic oscillators. 0.3.3 Harmonic vibrations of molecules in mass-weighted coordinates Coordinates of atoms: rk = ak + dk (k = 1, . . . , n) (25) where ak denotes equilibrium positions and dk stands for displacements. The components of the coordinates are: xk akx dkx yk = aky + dky (26) zk akz dkz Altogether: rkα = akα + dkα ri = ai + di k = 1, . . . , n; α = 1, 2, 3(x, y, z) i = 1, . . . . . . , 3n (27) (28) Atomic masses: mk , k = 1, . . . , n, or mi , i = 1, . . . , 3n m1 = m2 = m3 m4 = m5 = m6 etc. Mass-weighted coordinates (for displacements): √ qi = mi di (29) Kinetic energy Classical expression: T = 1X 1X 2 1X 2 2 mi ḋi = q̇ = p 2 i 2 i i 2 i i 8 (30) Quantum expression: T̂ = 1X 2 1 X ∂2 p̂i = − 2 i 2 i ∂qi2 (31) Potential energy: V = V0 + X ∂V i = V0 + X ∂qi qi 0 fi qi i 1 X X ∂2V + qi qj 2 i j ∂qi ∂qj 0 1 XX fij qi qj + 2 i j + ... (32) + ... (33) Forces in equilibrium position are equal to zero: ∂V fi = =0 for all i = 1, . . . , 3n ∂qi 0 (34) V0 is a constant (we will omit it). We use the harmonic approximation: V = 1X fij qi qj 2 i,j Matrix - vector notation: q1 p1 .. .. . . q = qi p = pi . . .. .. q3n p3n (35) f11 . . . f1,3n .. .. f = ... . . f3n,1 . . . f3n,3n (36) We use column-vectors, f is the force constant matrix. Now the Hamiltonian may be written: 1 1 H = pT p + q T f q (37) 2 2 We transform the coordinates as follows: Q = L−1 q q=LQ (38) For the momenta we get: P = LT p p = LT 9 −1 P (39) Proof (comment by PG): we know that qk = Lkm Qm . For coordinate Qj the conjugate momentum is (we use the Einstein’s summation convention and the chain rule) 1 ∂ 1 ∂qk ∂ 1 ∂ ∂ = = (Lkm Qm ) = i ∂Qj i ∂Qj ∂qk i ∂Qj ∂qk 1 ∂ 1 ∂ = Lkm δjm = Lkj = Lkj p̂k = LT jk p̂k i ∂qk i ∂qk P̂j = (40) Substitution of the new coordinates and momenta into H yields: T 1 1 H = P T L−1 L−1 P + QT LT f L Q 2 2 (41) Let us choose L such that it is real and: L−1 L−1 T =E (42) Where E is the identity matrix. Taking the inverse, it follows that: LT L = E (43) And for non-singular L it also follows that: LT = L−1 (44) which means that L is an orthogonal (and unitary, since it is real) matrix. We may further choose L such that: LT f L = Λ (45) λ1 0 .. where Λ = is a diagonal matrix. Multiplying by L gives . 0 λ3n f L=LΛ (46) For ith column of L, denoted by Li , we have: f Li = Li λi (47) which shows that Li is an eigenvector of f with the eigenvalue λi . Thus to find L we have to diagonalize f, i.e., to find eigenvectors and eigenvalues of f. 10 Substitution of these results into H yields: 1 1 X 2 1 1X 2 1X H = P T P + QT Λ Q = Pi + λi Q2i = Pi + λi Q2i (48) 2 2 2 i 2 i 2 i In other words the transformation q → Q, p → P gave us 3n uncoupled harmonic oscillators. The frequencies are p ω i = λi (49) The Qi are called “normal coordinates”: X X Lji qj Qi = L−1 ij qj = (50) j j or: Q = L−1 q = LT q √ We used mass-weighted coordinates qi = mi di , and we can write: m1 0 m2 1/ 2 q = M d with M = . . . 0 m3n 1 (51) (52) 1/ M /2 has mi 2 on the diagonal. The normal coordinates are then given in terms of atomic displacements by: 1 Q = LT M /2 d Example: H2 O normal modes. Figure 5: Normal modes of water molecule. · Q1 — symmetric stretch vibration 11 (53) · Q2 — asymmetric stretch vibration · Q3 — bending vibration + 3 overall translations (frequencies = 0) + 3 overall rotations (frequencies = 0) 0.3.4 Harmonic vibrations of molecules in cartesian (displacement) coordinates d — coordinates, F — force constants Fij = ∂2V ∂di ∂dj 1 1/ 2 = mi 0 ∂2V ∂qi ∂qj 1/ 1/ 1 f = M − /2 F M − /2 F = M /2 f M /2 (54) 0 1 1 1/ mj 2 = mi 2 fij mj 2 (55) The Hamiltonian of the system in simple cartesian coordinates (d and pd , 1 ∂ 1 1/ ∂ pd,i = = mi 2 ) is: i ∂di i ∂qi X (pd )2 1 X 1 T 1 T −1 i Fij di dj + H = pd M pd + d F d = 2 2 2m 2 i,j i i (56) To uncouple H we return to normal coordinates Q: Q = L −1 d pd = L T Applying this to H: −1 d=L Q (57) P (58) T 1 1 H = P T L −1 M −1 L −1 P + QT L T F L Q 2 2 which becomes: (59) 1 1 H = P T P + QT Λ Q 2 2 (60) L TM L = E (61) if: 12 and L TF L = Λ (62) From the last equation we get: F L = LT Substituting the previous one gives: −1 (63) Λ F L =M L Λ (64) so L must satisfy a generalized eigenvalue problem of F with M as the metric (or “overlap”) matrix. 1 1 From the equation F = M /2 f M /2 follows directly the relation between the eigenvectors L and L : 1 1 L = M − /2 L L = M /2 L (65) 1 (It can be also easily seen from q = L Q, d = L Q and q = M /2 d.) Because L−1 = LT we have: L −1 = L T M (66) and the normal coordinates are given in terms of atomic displacements by: Q = L −1 d = L T M d or: Qi = L Ti M d = X Lji mj dj (67) (68) i where L i is the ith column of L . Remember: • the index i runs over eigenvalues λi and the associated normal coordinates Qi • the index j = kα, k = 1, . . . , n, α = 1, 2, 3 (x, y, z) Thus Qi = XX k Lkα,i mk dkα α 13 (69) 0.3.5 Invariance conditions on the matrix F We will now derive invariance conditions which show that six eigenvalues λi of the matrix F are equal to zero. The associated normal coordinates correspond to overall translations and rotations of the whole molecule. 2 X ∂V 1 XX ∂ V V = V0 + di + di dj = ∂di 0 2 i j ∂di ∂dj 0 i n n (70) X X ∂V 1 XX Fkα,lβ dkα dlβ = V0 + dkα + ∂dkα 0 2 k,l=1 α,β k=1 α=1,2,3 | {z } =0 The force acting on atom k in the direction α = x, y, z is: X n X ∂V ∂V ∂V + Fkα,lβ dlβ = = ∂di ∂dkα ∂dkα 0 l=1 β | {z } (71) =0 Translational invariance: this force should not change if we move all atoms by the same distance: dlβ = δβ for l = 1, . . . , n, hence 0= n X X l=1 Fkα,lβ δβ = X β β δβ n X Fkα,lβ (72) l=1 As this should apply to any δ we must require that n X Fkα,lβ = 0 l=1 (73) for all k = 1, . . . ,n; α = 1, 2, 3; β = 1, 2, 3(x, y, z) This is the first set of invariance conditions. Note that summation here is over l, with fixed k, α and β. Different k, α and β give different conditions, so indeed we have here a set of invariance conditions. Equation (64) (F L = M L Λ) written in components is: X X Fij Ljr = mi Lij Λjr = mi Lir λr j (74) j The matrix elements Ljr for j = 1, . . . , 3n form the r th column of L i.e. the r th eigenvector L r i = (k, α) j = (l, β) n X X l=1 Fkα,lβ Llβ,r = mk Lkα,r λr β 14 (75) Now suppose that Llβ,r ( 1 = 0 for all l = 1, . . . , n and for β = 1 for β = 2, 3 (76) It follows immediately from the invariance condition, eq. (73) that the left hand side of equation (75) equals 0. Thus also the right hand side must be equal to 0 and, since not all components of L r are equal to zero, λr must be equal to zero. So we have found one eigenvalue λr = 0 and the associated eigenvector L r with components = 1 for β = 1 and = 0 for β = 2, 3. Similarly we find two more eigenvalues λr′ = λr′′ = 0 with eigenvectors L r′ and L r′′ that have components in the directions β = 2 and β = 3. Let us call these three eigenvalues: λ1 = λ2 = λ3 = 0 The associated normal coordinates are: X Lkα,r mk dkα Qr = (77) (78) kα Substituting the conditions for L r into this general formula leads to: Q1 = n X mk dk1 k=1 Q2 = Q3 = n X k=1 n X mk dk2 (79) mk dk3 k=1 This can be written in a more compact way: n Q1 X Q2 = mk dk k=1 Q3 (80) These are the normal coordinates for the overall translations of the molecule (λ1 = λ2 = λ3 = 0). In fact, they are the coordinates of the center of mass of the molecule. Rotational invariance 15 We will now see how all the atoms move when we rotate the whole molecule and we will again require that the forces on the atoms do not change during this movement. We rotate the molecule around the axis eγ , γ = 1, 2, 3(= x, y, z) over an angle ∆ϕγ . For very small (infinitesimal) rotations it turns out that a vector r 0 changes as follows: r = r 0 + ∆ϕγ eγ×r 0 (81) (This will be demonstrated in chapter 3.) Atom k has equilibrium position ak . After the rotation of the molecule its position becomes rk = ak + dk = ak + ∆ϕγ eγ×ak (82) Thus the displacement of the k th atom is: dk = ∆ϕγ eγ×ak (83) The force acting on atom k in the direction α was zero when the atom was in its equilibrium position, and it must still be zero after we rotated the whole molecule (because the relative positions of the atoms did not change): XX ∂V = Fkα,lβ dlβ (84) 0= ∂dkα l β After substituting dlβ : 0= n X X l=1 β Fkα,lβ ∆ϕγ eγ×al β (85) As this should apply to any ∆ϕγ for γ = 1, 2 and 3 we must require that the following invariance conditions are fulfilled: n X X β=1,2,3 l=1 Fkα,lβ eγ×al β = 0 (86) for k = 1, . . . , n, α = 1, 2, 3, γ = 1, 2, 3(= x, y, z) It follows that the vector L r with components Lkα,r = eγ×ak α is an eigenvector of F with eigenvalue λr = 0. There are three such vectors: γ = 1, 2, 3. Let us label these eigenvectors with r = 4, 5, 6. Associated normal coordinates are: Qγ+3 = n X X k=1 α=1,2,3 mk eγ×ak α dkα = 16 n X k=1 mk eγ×ak · dk (87) The vector product satisfies the relation: (a × b) · c = a · (b × c) (88) Using this relation we get: Qγ+3 = n X k=1 mk eγ · (ak×dk ) (89) The vectors e1 , e2 and e3 are unit vectors along the axes of the Cartesian coordinate system: 1 0 0 e2 = 1 e3 = 0 (90) e1 = 0 0 0 1 As a result we have: n Q4 X Q5 = mk ak×dk k=1 Q6 (91) These are the normal coordinates of overall rotations of the molecule. The associated eigenvalues are λ4 = λ5 = λ6 = 0. So far we have found six normal coordinates of the molecule (Q1 , . . . , Q6 ), that correspond with eigenvalues λ1 = . . . = λ6 = 0. The remaining normal coordinates Qi XX (92) Qi = mk Lkα,i dkα = L Ti M d k α must be found as the eigenvectors of the matrix F , with “overlap matrix” M . For these eigenvectors we have: L TM L = E (93) where E is the unit matrix. Thus L Ti M L j = δij (94) The coordinates Q1 , Q2 , Q3 describe the translations of (the center of mass of) the molecule, the coordinates Q4 , Q5 , Q6 describe the overall rotations, the remaining Qi are the (3n − 6) “real” vibrations. We want to separate these “real” vibrations from the rotations and translations. This can be achieved by using a “molecule-fixed” coordinate system that rotates and 17 translates with the molecule. Such a system is defined by requiring that at every moment the following conditions are satisfied: n Q1 X Q2 = mk dk = 0 k=1 Q3 n Q4 X Q5 = mk ak×dk = 0 k=1 Q6 (95) These are the so-called Eckhart conditions. (Compare to equations (80) and (91).) Setting Q1 , Q2 , Q3 equal to zero implies that the origin of the molecule-fixed coordinate system is the center of mass of the molecule. Setting Q4 , Q5 , Q6 equal to zero minimizes the coupling between rotations and vibrations. The latter coupling cannot be completely removed, however, in contrast with the coupling between vibrations and translations. All of this will be explained below. 0.3.6 Vibrations in internal coordinates (3n-6) 1. Bond lengths rkl 2. Bending angles Θkln 3. Out-of-plane wagging Θklmn 4. Torsion angles τklmn Figure 6: Example: Internal coordinates of water molecule (s1 = ∆rkl , s2 = ∆rkm , s3 = ∆Θklm ). 18 The Cartesian coordinates r k are functions of the internal coordinates, and this also applies to the displacements dk . There are 3n Cartesian coordinates dk , for k = 1, . . . , n, but 3n − 6 internal coordinates s. Besides the internal coordinates s, there are also 3 coordinates that describe the position of the molecule in space (actually the position of the center of mass) and 3 angle coordinates that describe the orientation of the molecule. The force constants Fij of the molecule in Cartesian coordinates have little physical meaning. Rather, we know the force constants related to bond lengths, bond angles, etc. — the internal coordinates s — these force constants form the matrix F : 2 ∂ V (96) Fij = ∂si ∂sj The harmonic approximation of the potential in the internal coordinates s is: 1 (97) V = sT F s 2 We need to write also the kinetic energy in terms of the internal coordinates s. To achieve this, we first define the transformation s=B d (98) • d is a column vector of length 3n • s is a column vector of length 3n − 6 • B is a matrix of dimension (3n − 6) × (3n) If s is a small (infinitesimal) change of the internal coordinates, and d is a small change of the Cartesian displacements, then: Bij = ∂si ∂dj (99) Example H2 O - see Figure 6: s1 = ∆rkl 1/2 rkl = (xk − xl )2 + (yk − yl )2 + (zk − zl )2 1 ∂s1 1 xk − xl = [. . .]− /2 · 2(xk − xl ) = ∂xk 2 rkl ∂s1 xl − xk ∂s1 = =− ∂xl rkl ∂xk 19 (100) (101) (102) (103) These elements, together with derivatives with respect to yk , yl , zk and zl , form the 1st row of the matrix B. The remaining elements of this row are equal to zero. s3 = ∆Θklm r ·r cos Θklm = kl km rkl rkm (104) (105) Differentiation would give us the elements of the third row of the B matrix. In a similar way we compute all the elements of the matrix. For the usual types of coordinates 1 — 4 you can find the derivatives in the books of Wilson, Decius, and Cross and of Califano. All derivatives must be calculated for the equilibrium structure of the molecule. B is not a square matrix, so B −1 is not defined. The remedy is to add to s the 6 coordinates Q1 , . . . , Q6 of translation and rotation of the whole molecule. These also are (already presented) linear combinations of d (see section E). After adding these 6 rows B becomes a square and non-singular matrix, so it is possible to write: d = B −1 s (106) For the momenta ps , conjugate to s we have: ps = B −1 T pd = B T ps pd (107) Now we can write the Hamiltonian of the molecule as follows: 1 1 H = T + V = pTd M −1 pd + dT F d = 2 2 1 T 1 = ps B M −1 B T ps + sT F s 2 2 where F = B −1 We define the matrix G : T F B −1 . G = B M −1 B T . (108) (109) (110) Then we can write 1 1 H = pTs G ps + sT F s 2 2 We want to uncouple the Hamiltonian: H= 1 1X 2 1 Pi + λi Q2i = P T P + QT Λ Q 2 i 2 2 20 (111) (112) To do this we must return to the normal coordinates. We use the transformation: s = L ′Q (113) Through an analogy to what we have already seen, we derive conditions: L′ L T G −1 L ′ = E ′ T F L′ = Λ (114) (115) (116) We can combine these two equations into one: −1 T F L′ = L′ Λ = G −1 L ′ Λ (117) This is a generalised eigenvalue problem of the matrix F with “overlap matrix” G −1 . As we can see, G −1 is indeed symmetric: G −1 = B −1 T M B −1 (118) After multiplying equation (117) by G from the left side: G F L ′ = L ′Λ G F L ′i = L ′i λi (119) This is the eigenvalue problem of the matrix G F . This method is called “Wilson’s GF matrix method”. The dimension of this method is 3n − 6, as we can omit the 6 rows associated with Qi in B (we don’t need B −1 in G = B M −1 B T ) and we define 2 V . The internal coordinates do not change when the F as Fij = ∂s∂i ∂s j whole molecule is translated or rotated. That is, the Eckart conditions are automatically taken into account. 0.4 Rotations of rigid (non-linear) molecules See also: Paul Wormer, Courses and teaching material http://www.theochem.ru.nl/ pwormer/teachmat.html The rigid rotor in classical and quantum mechanics 21 0.4.1 Rotation matrices and infinitesimal rotations Choose a coordinate frame that “rotates with the molecule” — such a frame is called a body-fixed (BF) frame. So the orientation of the molecule is defined by the orientation of the BF frame relative to the laboratory = space fixed (SF) frame. Hence, the orientation of the molecule, i.e., the orientation of the BF axes, with respect to the SF frame is described by a 3 × 3 matrix of direction cosines: the rotation matrix C. Any orientation (rotation) can be described by three Euler angles ϕ, ϑ, χ: rotate around the z-axis over ϕ → x′ , y ′ , z ′ axes (z ′ axis = z axis) rotate around the y ′-axis over ϑ → x′′ , y ′′, z ′′ axes (y ′′ axis = y ′ axis) rotate around the z ′′ -axis over χ The corresponding rotation matrices (containing direction cosines) are: cos ϕ − sin ϕ 0 cos ϑ 0 sin ϑ 1 0 C z (ϕ) = sin ϕ cos ϕ 0 C y (ϑ) = 0 0 0 1 − sin ϑ 0 cos ϑ (120) For rotations around the x-axis we have: 1 0 0 C x (α) = 0 cos α − sin α (121) 0 sin α cos α The Euler (three-angle) rotation matrix is: C(ϕ, ϑ, χ) = C z ′′ (χ)C y′ (ϑ)C z (ϕ), (122) which can also be written as (prove this!): C(ϕ, ϑ, χ) = C z (ϕ)C y (ϑ)C z (χ) (123) We define the so-called Lie derivatives or infinitesimal rotations J x , J y , J z: − sin ϕ − cos ϕ 0 0 −1 0 dC z (ϕ) = cos ϕ − sin ϕ 0 = 1 0 0 Jz = dϕ ϕ=0 0 0 0 ϕ=0 0 0 0 (124) 0 0 0 0 0 1 J x = 0 0 −1 J y = 0 0 0 0 1 0 −1 0 0 22 Any vector can be decomposed as: x r = y = xe1 + ye2 + ze3 z in term of unit vectors: 1 e1 = 0 0 0 e2 = 1 0 (125) 0 e3 = 0 1 The infinitesimal rotations of the vector r are: 0 = e1 × r −z J xr = y z J y r = 0 = e2 × r −x −y J z r = x = e3 × r 0 ωx Any infinitesimal rotation ω = ωy can be described as ωz J ω = ωx J x + ωy J y + ωz J z = 0 −ωz ωy 0 −ωx = Ω = ωz −ωy ωx 0 (126) (127) (128) — this is the most arbitrary antisymmetric 3 × 3 matrix (any 3 × 3 antisymmetric matrix can be written in this way). It is easy to verify that Jω r =Ω r =ω×r Furthermore, it follows that: − sin ϕ − cos ϕ 0 dC z (ϕ) = cos ϕ − sin ϕ 0 = J z C z (ϕ) = C z (ϕ)J z dϕ 0 0 0 23 (129) (130) 0.4.2 The rigid rotor in classical mechanics A set of particles (atoms) has the coordinates r k (t) in the laboratory (SF) frame. Their masses are mk . The position vector of the center of mass is: R(t) = M −1 n X mk r k (t) (131) k=1 with M denoting the total mass, M = P k mk . In a coordinate frame which moves with R(t), but has axes parallel to the axes of the laboratory (SF) frame, the atoms have coordinates: ρk (t) = rk (t) − R(t) It follows that: X mk ρk (t) = 0 at each time t (132) (133) k (prove this). In other words: the position vector of the center of mass is 0 in the center of mass frame. We assume that the particles (atoms) belong to a moving rigid body (molecule). We define a rotating (BF) coordinate frame so that any time t: ρk (t) = C(ϕ(t), ϑ(t), χ(t))ak (134) The vectors ak are the constant position vectors of the atoms in the BF coordinate frame that rotates with the molecule. The angles ϕ(t), ϑ(t), χ(t) are the Euler angles describing the orientation of the BF frame with respect to the SF frame at time t. Finally, we can write: r k (t) = R(t) + C(ϕ(t), ϑ(t), χ(t))ak (135) with · r k (t) — coordinates of atom k in the laboratory (SF) frame · R(t) — coordinates of the center of mass in the laboratory (SF) frame · C(ϕ(t), ϑ(t), χ(t)) — rotation matrix containing direction cosines of the rotating BF axes in the laboratory (SF) frame 24 · ak — constant position vector of atom k in BF coordinates The velocities of the atoms are: v k (t) = ∂r k (t) = ṙ k (t) ∂t (136) ṙk (t) = Ṙ(t) + Ċ(ϕ(t), ϑ(t), χ(t))ak = Ṙ(t) + Ċ(ϕ, ϑ, χ)C −1 (ϕ, ϑ, χ)ρk (t) (137) Rotation matrices are orthogonal: C −1 = C T (138) Theorem: The matrix Ċ C −1 = Ċ C T is an antisymmetric matrix. Proof: C CT = E identity matrix T Ċ C T + C Ċ = 0 Hence T T Ċ C = −C Ċ = − Ċ C Let us assume that: T T 0 −ωz ωy 0 −ωx Ċ C T = Ω = ωz −ωy ωx 0 (139) (140) (141) Theorem: The relation between the angular velocity ω and the time derivatives ϕ̇ ϕ ∂ q̇ = ϑ̇ = ϑ of the Euler angles is given by: ∂t χ χ̇ with ϕ̇ ω = U ϑ̇ = U q̇ χ̇ (142) 0 − sin ϕ cos ϕ sin ϑ U = 0 cos ϕ sin ϕ sin ϑ 1 0 cos ϑ (143) 25 Proof: Ċ(ϕ, ϑ, χ) = Ċ z (ϕ)C y (ϑ)C z (χ) + C z (ϕ)Ċ y (ϑ)C z (χ) + C z (ϕ)C y (ϑ)Ċ z (χ) Ċ z (ϕ) = ∂C z (ϕ) ∂ϕ ∂ C z (ϕ) = = J z C z (ϕ)ϕ̇ ∂t ∂ϕ ∂t (144) (145) Ω = Ċ C T = J z C z (ϕ) C y (ϑ) C z (χ)C Tz (χ) C Ty (ϑ) C Tz (ϕ) ϕ̇ + C z (ϕ)J y C y (ϑ) C z (χ)C Tz (χ) C Ty (ϑ) C Tz (ϕ) ϑ̇ (146) + C z (ϕ)C y (ϑ)J z C z (χ)C Tz (χ) C Ty (ϑ)C Tz (ϕ) χ̇ Each “box” yields the identity matrix E (because rotation matrices are orthogonal), so we have: Ω = J z ϕ̇ + C z (ϕ)J y C Tz (ϕ) ϑ̇ + C z (ϕ)C y (ϑ)J z C Ty (ϑ)C Tz (ϕ) χ̇ (147) Substitution of eq. (120) for the matrices C z and C y and eq. (124) for J z and J y , and using the relation in eq. (128) between Ω and ω proves eqs. (142) and (143). As a result we can express the velocities of the atoms as: ṙk (t) = Ṙ(t) + Ċ C T ρk (t) = Ṙ(t) + Ω ρk (t) (148) One may remember that multiplication of the vector ρk (t) by the antisymmetric matrix Ω can be replaced by taking the vector product ω × ρk (t): ṙ k (t) = Ṙ(t) + ω × ρk (t) (149) Because ρk (t) is the velocity of atom k in a coordinate frame that moves with the center of mass, but does not rotate, it follows that ω is the angular velocity in the laboratory (SF) frame ω SF = ω 26 (150) We may further write ṙk (t) = Ṙ(t) + ω × ρk (t) = Ṙ(t) + C C −1 ω × ρk (t) i h = Ṙ(t) + C C −1 ω × C −1 ρk (t) (151) The last line follows from the properties of the vector product under rotation: C (a × b) = (C a) × (C b). Here we can substitute ak = C −1 ρk (t) and we get: ṙk (t) = Ṙ(t) + C ω BF × ak The angular velocity with respect to the rotating (BF) frame is: ϕ̇ ϕ̇ BF −1 −1 SF −1 ω = C ω = C ω = C U ϑ̇ = V ϑ̇ χ̇ χ̇ with − sin ϑ cos χ sin χ 0 V = C −1 U = sin ϑ sin χ cos χ 0 cos ϑ 0 1 (152) (153) (154) This can be easily shown, if one remembers eq. (143): 0 − sin ϕ cos ϕ sin ϑ V = C −1 U = C Tz (χ)C Ty (ϑ)C Tz (ϕ) 0 cos ϕ sin ϕ sin ϑ = 1 0 cos ϑ = C Tz (χ)C Ty (ϑ)C Tz (ϕ) e3 , C z (ϕ)e2 , C z (ϕ)C y (ϑ)e3 = (155) − sin ϑ cos χ sin χ 0 T T T sin ϑ sin χ cos χ 0 = C z (χ)C y (ϑ)e3 , C z (χ)e2 , e3 = cos ϑ 0 1 We have now expressed the velocities of the atoms ṙ k in terms of the velocity ϕ̇ SF BF of the center of mass Ṙ and the angular velocity ω , or ω , or q̇ = ϑ̇ . χ̇ 27 We want to derive the expression for the kinetic energy, but first we define: xk (t) 0 −zk (t) yk (t) 0 −xk (t) ρk (t) = yk (t) → X k (t) = zk (t) (156) zk (t) −yk (t) xk (t) 0 — an antisymmetric matrix with the cartesian (SF) coordinates of atom k. We define a similar matrix for the BF frame: akx 0 −akz aky 0 −akx ak = aky → A k = akz (157) −aky akx 0 akz When we consider the elements of the product matrix X Tk X k : = yk2 + zk2 = (x2k + yk2 + zk2 ) − x2k X Tk X k 11 X Tk X k = −xk yk (158) 12 we see that the matrix X I SF (t) = mk X Tk (t)X k (t) (159) k is the inertia tensor of the molecule in the laboratory frame. Similarly, we see that: X mk ATk A k I BF = (160) k is the inertia tensor in the BF frame. It is obvious that I BF is constant in time. The kinetic energy T = 1 2 X mk vk2 = 1 2 k X k mk ṙk · ṙ k = 1 2 X mk ṙ Tk ṙk (161) k can be written in different ways. In the laboratory (SF) frame we get: T = 1 2 X k T mk Ṙ Ṙ + X k mk Ṙ T T X ω × ρk ω × ρk + 21 mk ω × ρk k (162) 28 The second therm vanishes, because X k T T mk Ṙ ω × ρk = Ṙ ω× X mk ρk k ! T = Ṙ (ω × 0) = 0 (163) see eq. (133). We may further write that: Ω ρk = ω × ρk = −ρk × ω = −X k ω (164) and the result becomes: T T = 21 M Ṙ Ṙ + 21 ω T X mk X Tk X k k T T = 12 M Ṙ Ṙ + 1 2 ! ω ω T I SF ω (165) (166) The first term corresponds to the translation of the center of mass, the second to the rotation. As we see, there is an exact separation between translation and rotation. In the BF frame we obtain (also here the cross term vanishes): X T T mk ω BF × ak C T C ω BF × ak T = 21 M Ṙ Ṙ + 12 k = T 1 M Ṙ Ṙ 2 + 1 2 ω BF T T X T = 12 M Ṙ Ṙ + mk ATk A k k 1 2 ω BF T ! ω (167) BF I BF ω BF (168) ϕ Let us now express the kinetic energy in terms of the Euler angles q = ϑ . χ To this end we must substitute: ω SF = ω = U q̇ ω BF = C −1 U q̇ = V q̇ (169) Ater this substitution we get: T + 21 q̇ T U T I SF U q̇ T + 21 q̇ T V T I BF V q̇ T = 12 M Ṙ Ṙ T = 12 M Ṙ Ṙ 29 (170) which we can also write as T T = 12 M Ṙ Ṙ + 21 q̇ T g q̇ (171) g = U T I SF U = V T I BF V (172) where is the metric matrix (metric tensor — “overlap matrix”) belonging to the Euler angles q (N.B. we have seen this previously, in Wilson’s GF method, where instead of q we used internal coordinates s. The matrix G , defined there, is the inverse of the metric tensor, G −1 = g . Of course, g and g are different matrices, s s but they play a similar role.) Finally, we present an expression for the kinetic energy in terms of angular momenta X X Jk = mk r k × ṙ k (173) J= k In SF coordinates: k r k = R + ρk ṙ k = Ṙ + ω × ρk X X J= mk R × Ṙ + mk ρk × Ṙ k + X k P k mk R × ω × ρk + X k mk ρk × ω × ρk (174) (175) Because k mk ρk = 0, the second and the third term vanish. The fourth term can be rewritten as follows X X mk ρk × ρk × ω = mk ρk × ω × ρk = − k =− X k k mk X k X k ω = X mk X Tk X k ω = I SF ω (176) k so that we get J = MR × Ṙ + I SF ω (177) In this equation J CM = MR × Ṙ is the angular momentum due to the center of mass motion, and I SF ω is the rotational angular momentum. 30 In BF coordinates: r k = R + C ak ṙ k = Ṙ + C ω BF × ak (178) J = J CM + C I BF ω BF Summarizing: J = J CM + J SF rot SF SF J SF ω rot = I J = J CM + C J BF rot BF BF J BF ω rot = I (179) Apart from the term related to the motion of the center of mass, we then get for the kinetic energy: T SF −1 SF T = 12 J SF I J (180) T −1 BF T = 12 J BF I BF J 0.4.3 The rigid rotor Hamiltonian To go from classical mechanics to quantum mechanics, we must first write the classical kinetic energy in terms of the coordinates q and conjugated momenta p. In classical mechanics, if qi are the (possibly curvilinear) cordinates, then the conjugated momenta are defined as: pi = ∂T ∂ q̇i (181) x In cartesian coordinates, r = y , the expression for the kinetic energy is: z T = 21 mv 2 = 21 m(ẋ2 + ẏ 2 + ż 2 ) ∂T = mẋ, etc., and p = This is consistent with eq. (181), because px = ∂ ẋ mṙ = mv. ϕ Let us now apply this to the Euler angles q = ϑ , and first derive the χ conjugated momenta p. We return to eq. (171): X gij q̇i q̇j (182) T = 21 q̇ T g q̇ = 21 i,j 31 with the metric tensor given by g = U T I SF U = V T I BF V , see eq. (172). The momenta are: pi = X ∂T = gij q̇j ∂qi j p = g q̇ q̇ = g −1 p (183) and the classical Hamiltonian for the kinetic energy is: T T T = 21 pT g −1 g g −1 p = 21 pT g −1 p (184) T = 12 pT g −1 p (185) According to the postulates of quantum mechanics, the momentum operators conjugated to a set of (curvilinear) coordinates qi are: pi = h̄ ∂ i ∂qi (186) The quantum mechanical momentum operators conjugated to the Euler angles are: ∂/∂ϕ h̄ (187) p = ∂/∂ϑ i ∂/∂χ From here on, we will use atomic units with h̄ = 1. We want to quantize equation (185). But we must be careful, because the matrices U and V are functions of ϕ, ϑ, χ, and so is the matrix g. So, p does not commute with g −1 . B. Podolski [Quantum-mechanically correct form of Hamiltonian function for conservative system, Phys. Rev., 32, 812 (1928)]: We cannot simply replace the classical momenta in eq. (185) by operators, but we have to use the general formula for the Laplace operator ∆ = ∇2 = ∂2 ∂2 ∂2 + + ∂x2 ∂y 2 ∂x2 (188) in curvilinear coordinates q. This yields: 1 1 T = 12 g − /2 pT g /2 g −1 p 32 (189) where g = det(g). We see that, if g did not depend on q, then g would commute with p and we would have the simpler expression: T = 21 pT g −1 p. The expression for g is known in terms of the Euler angles, g = V T I BF V , (Euler angles appear in V , I BF is constant for a rigid rotor). So, we can derive an explicit quantum mechanical formula for T . We start by calculating 2 (190) g = det g = det V T I BF V = det V det I BF = sin2 ϑ I1 I2 I3 I1 , I2 , I3 are the so-called principal moments of inertia — the eigenvalues of I BF . We can then write: ∂/∂ϕ 2 −1 T −1 ∂ ∂ ∂ h̄ ∂/∂ϑ (191) sin ϑ V −1 I BF , , V T =− 2 sin ϑ ∂ϕ ∂ϑ ∂χ ∂/∂χ Further we should plug in the explicit formula for V , eq. (154). It appears that the result can simply be written in terms of the angular momenta J SF or J BF , but we must first quantize these. In classical mechanics: J SF = I SF ω = I SF U q̇ q̇ = g −1 p = U −1 I SF Combining these two equations gives: J SF = U T −1 −1 UT −1 p p (192) (193) ∂/∂ϕ 1 After inverting U T and substituting p = ∂/∂ϑ we derive the expressions i ∂/∂χ SF for J in terms of the Euler angles: J SF x J SF y J SF z ∂ ∂ cos ϕ ∂ = i cos ϕ cot ϑ + sin ϕ − ∂ϕ ∂ϑ sin ϑ ∂χ ∂ ∂ sin ϕ ∂ = i sin ϕ cot ϑ − cos ϕ − ∂ϕ ∂ϑ sin ϑ ∂χ ∂ =i − ∂ϕ 33 (194) Analogously, we find J BF = V T −1 p = C −1 U T −1 p = C T J SF After a similar elaboration we get: ∂ ∂ cos χ ∂ BF Jx =i − sin χ − cot ϑ cos χ sin ϑ ∂ϕ ∂ϑ ∂χ sin χ ∂ ∂ ∂ BF − cos χ + cot ϑ sin χ Jy =i − sin ϑ ∂ϕ ∂ϑ ∂χ ∂ J BF z = i − ∂χ (195) (196) Now we know that J BF ∂/∂ϕ −1 −1 ∂/∂ϑ = VT p= VT ∂/∂χ and we can also show (via substitution and differentiation) that T −ih̄ ∂ ∂ ∂ sin ϑ V −1 = JxBF , JyBF , JzBF = J BF , , sin ϑ ∂ϕ ∂ϑ ∂χ (197) (198) When we apply this to equation (191), we get simple quantum mechanical expressions for the kinetic energy: T BF −1 BF I J T = 12 J BF (199) and similarly T = 1 2 J SF T I SF −1 J SF (200) These are the exact quantum mechanical expressions in which the operators J BF and J SF are known. It appears that in terms of the angular momenta, the classical and quantum mechanical expressions are the same! To some extent this is a coincidence. 0.4.4 The rigid rotor in quantum mechanics It is simpler to work in the BF frame, because (in contrast with I SF , which depends on the Euler angles) I BF is a constant matrix X mk ATk A k I BF = k 34 depending on the constant position vectors ak . It is a symmetric 3×3 matrix, so we can easily diagonalise it and its eigenvalues are real. It is customary to choose a rotating coordinate frame that makes I BF diagonal −1 I1 0 0 I 0 0 1 −1 = 0 I2−1 0 (201) I BF I BF = 0 I2 0 −1 0 0 I3 0 0 I3 and define the rotational constants: A= 1 2I1 B= 1 2I2 C= 1 2I3 Then, we obtain the well known kinetic energy expression: 2 2 2 T = A JxBF + B JyBF + C JzBF (202) (203) The components of J SF satisfy the following commutation relations: SF SF Jx , Jy = ih̄JzSF (204) (with cyclic permutations of the indices x, y, z). The components of the BF angular momentum operators, J BF , obey anomalous commutation relations: BF BF Jx , Jy = −ih̄JzBF (205) (cyclic permutations of the indices x, y, z are allowed; pay attention to the minus sign!) Since J SF = C J BF and C T C = E it follows that 2 T T 2 J SF = J SF J SF = J BF C T C J BF = J BF = J 2 All components of J SF commute with J SF 2 of J BF commute with J BF . 2 (206) and, similarly, all components ∂ ∂ commutes with JzBF = −i . So, we have a set ∂ϕ ∂χ of commuting operators: J 2 , JzSF , JzBF (207) Furthermore, JzSF = −i The question is: are these operators constants of the motion, i.e., do they commute with the Hamiltonian? 2 2 2 T = A JxBF + B JyBF + C JzBF The answer is: 35 · J 2 commutes with T , because it commutes with JxBF , JyBF , and JzBF . ∂ and the operators ∂ϕ ∂ ) do not contain ϕ (they contain only ∂ϕ · JzSF also commutes with T , because JzSF = −i JxBF , JyBF · JzBF does not commute with T , because it does not commute with 2 2 JxBF , and JyBF Let us now consider some special cases. Symmetric top A = B 6= C · prolate A = B < C (like a cigar) · oblate A = B > C (like a pancake) Examples of a symmetric top: benzene, NH3 (we will use the notation J ≡ J BF ) H = T = A Jx2 + Jy2 + CJz2 = AJ 2 + (C − A) Jz2 (208) This Hamiltonian does commute with Jz = JzBF . Hence, there are three constants of the motion in this case, and three quantum numbers: j, k, and m. H |j, k, mi = Ej,k |j, k, mi J 2 |j, k, mi = j(j + 1) |j, k, mi JzBF |j, k, mi = k |j, k, mi JzSF |j, k, mi = m |j, k, mi −j ≤k ≤j (209) −j ≤m≤j The energy eigenvalues are Ej,k = Aj(j + 1) + (C − A)k 2 (210) The energy does not depend on the quantum number m, so it is (2j + 1)-fold degenerate. Since T commutes with JzBF , the eigenfunctions of T have to be also eigen∂ functions of JzBF = −i , with eigenvalue k. So, they contain a factor ∂χ 36 ∂ , with ∂ϕ eigenvalue m, which yields the factor exp(imϕ). Thus, they can be written as: exp(ikχ). Similarly, they have to be eigenfunctions of JzSF = −i (j) (j) |j, k, mi = Ψj,k,m(ϕ, ϑ, χ) = exp(imϕ)dmk (ϑ) exp(ikχ) = Dmk (ϕ, ϑ, χ)∗ (211) (j) The functions Dmk (ϕ, ϑ, χ) are called Wigner rotation matrices. The func(j) tions dmk (ϑ), which are called Wigner (small) d-matrices, are the solutions of a differential equation in cos ϑ; they can be obtained by means of recursion relations. Spherical top A = B = C, examples: CH4 , SF6 H = T = AJ 2 Ej = Aj(j + 1) (212) (213) The energy does not depend on the quantum numbers k and m, so it is (2j + 1)2 -fold degenerate. The eigenfunctions of the spherical top Hamiltonian are the same functions (j) Dkm (ϕ, ϑ, χ)∗ Asymmetric top The three rotational constants are different, A > B > C. JzBF is not a constant of the motion, thus k is not a good quantum number. But j and m remain good quantum numbers. The eigenfunctions of H can be written as: Ψj,m,i(ϕ, ϑ, χ) = j X (j) Dmk (ϕ, ϑ, χ)∗ cki (214) k=−j The coefficients cki follow from diagonalization of the matrix of the Hamil(j) tonian H in the basis of Wigner D-functions, Dkm (ϕ, ϑ, χ)∗ , for fixed j and m, and k = −j, . . . , j The dimension of this matrix is 2j + 1. The matrix elements can be easily derived by writing T in terms of raising (or step-up) and lowering (or step-down) operators: J±BF = JxBF ∓ iJyBF 37 (215) These operators act on the basis functions as: 1 J±BF |j, k, mi = [j(j + 1) ± k(k ± 1)] /2 |j, k ± 1, mi (216) Note the interchange of the + and − signs in the definition of these operators, eq. (215), which is due to the anomalous commutation relations between the components of J BF . The alternative is not to interchange the + and − signs in the definition of the operators, but then the interchange occurs in eq. (216). For more details we refer to Paul Wormer’s lecture notes, mentioned at the beginning of this section, and to the textbooks on angular momentum theory listed in these notes. 0.5 0.5.1 The molecular vibration-rotation Hamiltonian Introduction In Chapter 2 we discussed pure vibrations (in the harmonic approximation), in Chapter 3 pure rotations. Now we will consider the vibrations and rotations of molecules simultaneously. The question is: how do we define the rotating (BF) axes in a vibrating molecule? Or: how can we separate vibrations and rotations? The answer to the latter question is that we cannot do this exactly. By a suitable choice of the rotating axes we can minimize the coupling between vibrations and rotations, however. Our task is then to: 1. Derive the Hamiltonian for which the rot-vib coupling is minimal. 2. Find the approximations needed to arrive at a harmonic oscillator model for the vibrations an a rigid rotor model for the rotations. A lot of attention has been given to these issues, by Eckhart, Sayvetz, Van Vleck, Wigner & Hirschfelder, Wilson & Howard, Darling & Dennison, Watson, and others. 38 0.5.2 The (classical) kinetic energy of a semi-rigid molecule The word semi-rigid implies that the vibrations involve only small displacements of the atoms from their equilibrium positions in the molecule. To derive the classical expression for the kinetic energy T we write the 3n cartesian coordinates r k (t) of the atoms with respect to a laboratory (SF) frame in terms of: 3 coordinates R(t) — the origin of the moving BF frame 3 Euler angles ϕ, ϑ, χ — the orientation of the rotating BF frame axes 3n − 6 internal coordinates — bond lengths, angles between bonds, etc. by means of the equation: r k (t) = R(t) + C (ϕ(t), ϑ(t), χ(t)) ak (t) (217) Because of the vibrations, also the ak (t) depend on t now. However, they are not internal coordinates, but (3n, not 3n − 6) cartesian coordinates of the atoms with respect to the rotating frame. The atomic velocities are: ṙ k = Ṙ(t) + Ċ(ϕ, ϑ, χ)ak + C ȧk = Ṙ(t) + C(ω BF × ak ) + C ȧk (218) When comparing this to the previously derived equations, we see an extra term C ȧk , because ak (t) depends on time. First, let us choose R(t) as the position of the center of mass: X mk r k (t) (219) R(t) = M −1 k Then, it follows that at each time t X mk ak (t) = 0 (220) k so, it is also true that X mk ȧk (t) = 0 k 39 (221) The kinetic energy can be written as X mk ṙTk ṙ k = T = 21 k T = 12 M Ṙ Ṙ + X k + X T T mk Ṙ C ω BF × ak + 1 2 mk ω BF × ak T k + X k X mk Ṙ C ȧk + k mk ω BF × ak ȧk + 1 2 X T mk ȧTk ȧk ω BF × ak + (222) k We have used here that C T C = E (C is an orthogonal matrix). The second and third term vanish, because of the translational conditions for ak and ȧk , eqs. (220) and (221): ! X X T T mk Ṙ C ω BF × ak = Ṙ C ω BF × mk ak = (223) k k T = Ṙ C ω BF × 0 = 0 X k T T mk Ṙ C ȧk = Ṙ C X mk ȧk k ! T = Ṙ C 0 = 0 (224) So, there is an exact separation between the translations in the kinetic energy and the rotations and vibrations. Because of this separation, we can now T “forget” about the term 21 M Ṙ Ṙ that corresponds to the kinetic energy of the (center-of-mass) translation, as it does not interfere with the rotations and vibrations. The fourth term was already elaborated in Chapter 3, where we wrote it in terms of the inertia tensor I BF . It contains the coordinates ak (t) and is now time-dependent (because of the vibrations). The fifth term is the rotation-vibration coupling. We will also rewrite this term, with the aid of the relation (a × b) · c = a · (b × c). The result is: X X T T mk ω BF (ak × ȧk ) + 21 T = 12 ω BF I BF ω BF + mk ȧTk ȧk (225) {z } | rotation |k {z } | k {z } rot-vib coupling vibration We cannot eliminate the rot-vib coupling term but we can minimize it, by an appropriate choice of the rotating axes. 40 Eckart’s first attempt (1934) At each time t, choose the principal axes of the inertia tensor I BF (t) as the rotating axes, i.e., choose a BF coordinate frame, that makes I BF diagonal. The principal moments of inertia I1 , I2 and I3 are the eigenvalues of I BF . This led to the so-called Eckart paradox: the inclusion of the rotationvibration coupling (with the aid of first order perturbation theory) yields a kinetic energy term that has the same form, AJx2 + BJy2 + CJz2 , as the rigid rotor Hamiltonian. The constants, however, are given by expressions such 1 1 as A = 2(I2I−I 2 , instead of 2I , as for the rigid rotor. This is a paradox, 3) 1 because one expects that in the limit of zero amplitude vibrations T should be precisely the rigid rotor kinetic energy. It was shown later by Van Vleck that the inclusion of second order terms restores the correct expressions. A derivation of this result was given by F. Jørgensen in his thesis. Eckart’s second attempt (1935) Let us assume an equilibrium structure of the molecule with atomic positions ak (0) and small displacements dk (t) ak (t) = ak (0) + dk (t) ȧk (t) = d˙k (t) since ak (0) is constant. (226) It is obvious that, if dk (t) = 0, we may choose the axes of the rotating frame as the principal axes of I BF . But how do we choose the frame when dk (t) 6= 0? Eckart (1935): choose the rotating frame such that the vibrational displacements at each time t satisfy the Eckart conditions (see Chapter 2): X a) mk dk (t) = 0 k b) X k (227) mk ak (0) × dk (t) = 0 The three conditions a) follow directly from putting the origin of the coordinate system at the center of mass, which implies that: X X X mk ak (0) + mk ak (t) = mk dk (t) (228) 0= k |k {z =0 41 } |k {z =0 } The first, time-independent, term in the above equation equals 0, because we assumed that dk (0) = 0. Then, it follows that also the second term must be equal to zero at all times t. We knew this already: the translations separate exactly from both the rotations and vibrations. The three conditions b) give the best possible separation of rotations and vibrations, because: T X Trot-vib = ω BF mk ak (t) × ȧk (t) Coriolis coupling k = ω BF T X k mk ak (0) × d˙k (t) + ω BF T X k mk dk (t) × d˙k (t) (229) The first term, which is linear in the vibrational displacements dk , equals 0 because of (the time derivative of) conditions b). The second term that is quadratic in P dk remains. This term is (the time derivative of) the minimum value of k mk aTk ak . One can see this by looking at the change in P mk aTk ak under an infinitesimally small rotation ∆ω, which is equal to kP −2 k mk ∆ω T (ak (0) × ak (t)). The conditions b) implicitly fix the directions of the axes of the rotating (BF) frame. The orientation of this frame relative to the laboratory (SF) frame is determined by the rotation matrix C(ϕ, ϑ, χ). An explicit formula for C(ϕ, ϑ, χ) can be derived directly from the conditions b). Write b) as: 0= X k mk ak (0) × dk (t) = X k mk ak (0) × ak (t) (230) which holds because ak (0) × ak (0) = 0. Recall that rk (t) = R(t) + C ak (t) (231) ak (t) = C T [r k (t) − R(t)] (232) or Combined with b) this gives: X 0= mk ak (0) × C T [r k (t) − R(t)] (233) k This equation, written in components, is XX T 0= mk akα (0)Cβγ [rkγ (t) − Rγ (t)] k − γ XX k γ T mk akβ (0)Cαγ [rkγ (t) − Rγ (t)] 42 (234) Now we define a matrix Y with components: X mk [rkα (t) − Rα (t)] akβ (0) Yαβ (t) = (235) k or, in matrix notation Y (t) = X k mk [rk (t) − R(t)] ⊗ ak (0) (236) where ⊗ defines the outer (or tensor) product of two vectors. We find that: X (Cγβ Yγα − Cγα Yγβ ) = 0 (237) γ or, in matrix form Y T C = CT Y (238) Y T = CT Y CT (239) C is orthogonal, C T = C −1 , so: Y T Y = C T Y C T Y = (C T Y )2 (240) 1/ 2 is well defined, and we Y T Y is symmetric and positive-definite, so (Y T Y ) can write: 1 (Y T Y ) /2 = C T Y So we get: (241) 1 C = Y (Y T Y )− /2 (242) This equation gives the rotation matrix C, which defines the orientation of the body-fixed frame, in terms of the matrix Y (t). The matrix Y (t) contains the equilibrium positions ak (0) of the atoms in the molecule, but also depends on their instantaneous positions r k (t) − R(t) = ρk (t) relative to the space fixed frame (with the origin at the center of mass). So, at each time t, we can determine the BF frame that satifies the Eckart conditions b) by computing the matrix C. The result of applying the Eckart conditions is that: X T T X T T = 21 ω BF I BF ω BF + ω BF mk d˙k d˙k mk dk × d˙k + 12 (243) k k Let us assume now that the 3n displacements dk that satisfy the six Eckart conditions depend on 3n − 6 internal coordinates, namely by: 1 d = L Q = M − /2 L Q 43 (244) (see Chapter 2). The first six Qi are zero because of the Eckart conditions (see Section 3.5). The other Qi are the normal coordinates of vibration, which in turn depend on 3n − 6 internal coordinates s (bond lengths, angles) −1 as Q = L ′ s. Substitution of these results into the kinetic energy, while remembering that LT = L−1 , yields: T = 1 2 ω BF T I BF ω BF + X T ωαBF QT ζ α Q̇ + 12 Q̇ Q̇ (245) α The matrices ζ α are antisymmetric Coriolis coupling matrices, with elements defined as: 3 X X mk ǫαβγ Lkβ,i Lkγ,j ζijα = −ζjiα = (246) k β,γ=1 where ǫαβγ is the Levi-Civita symbol which is +1 if (α, β, γ) is an even permutation of (1,2,3), −1 if it is an odd permutation, and 0 if two or three of its subscripts are equal. The result in eq. (245) is the classical kinetic energy, in terms of the rotations ω BF and the normal coordinates of vibration Q. 0.5.3 Semi-rigid molecules in quantum mechanics Before we can quantize the rotational-vibrational Hamiltonian T , we first have to switch to conjugate momenta also for the vibrations. The vibrational momenta Pi conjugated with the normal coordinates are: Pi = XX ∂T = Q̇i − ωαBF ζijα Qj ∂ Q̇i α j (247) The vibrational angular momentum Π is defined as: Πα = QT ζ α P (248) and the components of the total angular momentum are defined as: JαBF = ∂T = I BF ω BF α + QT ζ α Q̇ ∂ωα (249) Substituting these expressions into the classical expression for the kinetic energy of a semi-rigid molecule (eq. 245), we obtain: T = 1 2 T J BF − Π µ J BF − Π + 12 P T P 44 (250) where µ = I′ −1 (251) and the components of I ′ — the effective inertia tensor — are ′ BF Iαβ = Iαβ + QT ζ α ζ β Q (252) The effective inertia tensor depends both indirectly, through I BF , and directly on the vibrational coordinates Qi . Note that equation (250) is not quantum mechanical, it is just a classical expression in terms of conjugate momenta. The quantum expression is then, according to Podolski T 1 1 1 1 T = 12 µ /2 J BF − Π µ− /2 µ J BF − Π + 12 µ /2 P T µ− /2 P (253) with µ = det(µ) and the vibrational linear and angular momentum operators P and Π given by: h̄ ∂ i ∂Qi Πα = QT ζ α P Pi = (254) (255) The total angular momentum operator J BF was already defined in eq. (196). The total Hamiltonian is H =T +V (256) The potential V depends only on the internal coordinates, it describes the so-called molecular force field. This “Eckart” Hamiltonian was obtained by Wilson and Howard in 1936, [E.B. Wilson and J.B. Howard, The Vibration-Rotation Energy Levels of Polyatomic Molecules I. Mathematical Theory of Semirigid Asymmetrical Top Molecules, J. Chem. Phys. 4, 260-268 (1936)] who followed the procedure described above, and it was further refined by Darling and Dennison in 1940 [B.T. Darling and D.M. Dennison, The water vapor molecule, Phys. Rev. 57, 128-139 (1940)]. It remained the standard until 1968, when Watson [J.K.G. Watson, Simplification of the molecular vibration-rotation Hamiltonian, Mol. Phys. 15, 479-490 (1968)] was able to simplify it drastically by commuting the determinant of the metric tensor through the derivatives. The ro-vibrational Hamiltonian obtained by Watson, often referred to as the “Watson Hamiltonian”, is T H = 12 J BF − Π µ J BF − Π + 21 P T P + U + V (257) 45 The potential-like term U is the “Watson term” 3 1X U =− µαα 8 α=1 (258) It is proportional to the trace of the effective reciprocal inertia tensor and depends only on the internal coordinates, not on the momenta. The next question of interest is how to arrive at the harmonic oscillator + rigid rotor model? Three approximations must be made: 1. Assume that the effective inertia tensor I ′ is constant, i.e., does not depend on the internal coordinates Q. Then µ commutes with Π and the terms quadratic in Π vanish. Also, replace I ′ with I BF (0), i.e., by the inertia tensor of the equilibrium structure. This implies a neglect of distortions of the structure by centrifugal forces. 2. Neglect Coriolis interactions, i.e., the cross terms between the total angular momentum J BF and the vibration angular momentum Π. 3. Neglect anharmonic terms in the potential V , i.e., terms with higher than quadratic dependence on the vibrational coordinates Q. After these three approximations we get the rigid rotor + harmonic oscillator Hamiltonian: H =T +V = T BF −1 BF 1 T I (0) J + 2 P P + 12 QT Λ Q = = 21 J BF X X = 21 JαBF I −1 (0) αβ JβBF + 12 Pi2 + λi Q2i α,β | {z } rotations | i {z harmonic vibrations (259) } The Coriolis and anharmonic terms that were neglected are sometimes taken into account by means of perturbation theory. 0.5.4 Floppy molecules and Van der Waals molecules The Eckart and Watson Hamiltonians were derived with the assumption that the vibrations of the molecule involve only small displacements of the atoms, with respect to a well-defined equilibrium structure. There are also many examples where this assumption is not valid, they are called floppy molecules (or molecular complexes). 46 Floppy molecules These are molecules where some vibrations have large amplitudes. Examples: · internal rotation in ethane, CH3 —CH3 · inversion-tunneling in ammonia, NH3 A generalization of the Eckart approach was given by Sayvetz: in essence, he treated the “floppy” internal coordinates in the same way as Eckart treated the external coordinates. This imposes additional (Eckart-Sayvetz) conditions on the remaining internal coordinates. Van der Waals molecules These are weakly bound complexes of “normal” molecules. The molecules are held together by noncovalent forces: hydrogen bonding or Van der Waals (dispersion) forces. All intermolecular coordinates are “floppy” in this case. The “Van der Waals vibrations” against the weak intermolecular bonds have large amplitudes and much lower frequencies than the intramolecular vibrations. Except for the mode that involves the stretch of the intermolecular bond length, they are more like (hindered) internal rotations, rather than small-amplitude vibrations. The potential V is strongly anharmonic and often shows multiple equivalent global (and local) minima, instead of a single equilibrium structure. The barriers between these minima are relatively low and the complex can quantum mechanically tunnel through these barriers, which gives rise to additional splittings of the rotation-vibration levels. These splittings can be observed in spectra, especially if these are measured with high resolution. Also here one can define a rotating (BF) frame and transform the coordinates from the SF to the BF frame. Coriolis coupling between the internal and overall rotations is usually quite important, however. The potential V is simpler in BF coordinates, because it only depends on the internal coordinates. 47 Symmetry In semi-rigid molecules one can use the point group symmetry of the equilibrium structure to simplify the calculation of the (normal mode) vibrations and to derive selection rules. In “floppy” molecules or complexes one may apply the permutation-inversion symmetry group, which is based on the fact that the atomic nuclei are bosons or fermions. Not all permutations of identical nuclei are “feasible”, however, because some permutations may interconvert structures that are separated by high energy barriers. Only those permutation-inversion operations are called feasible that interconvert structures separated by small barriers, and give rise to observable level splittings due to tunneling through these barriers. The only permutations of identical nuclei that are feasible for semi-rigid molecules correspond to the overall rotations of the whole molecule. The permutation-inversion group is isomorphic to the point group of the equilibrium structure in this case. For “floppy” molecules it contains additional feasible permutation-inversion operations corresponding to tunneling between different, but equivalent, equilibrium structures (minima in the potential surface). For more information, see for example: P. R. Bunker and P. Jensen, Molecular Symmetry and Spectroscopy, NRC Research Press, Ottawa, Second edition (1998). Acknowledgement I am very grateful to Piotr Gniewek, for his excellent contributions to the preparation of these lecture notes. 48