Spin-orbit coupling: atom versus semiconductor crystal
Monique Combescot, Shiue-Yuan Shiau, and Valia Voliotis

TL;DR
This paper presents a novel approach to understanding spin-orbit coupling in semiconductors, emphasizing the role of crystal symmetry and degeneracy, and clarifies the nature of exciton-photon interactions.
Contribution
It introduces a symmetry-based method for analyzing valence band splitting due to spin-orbit coupling without relying on atomic-like angular momentum concepts.
Findings
Valence band eigenstates are determined by crystal symmetry and degeneracy.
Atomic and semiconductor spin-orbit eigenstates share the same structure despite different potentials.
A simple derivation of exciton-photon interaction based on spin conservation.
Abstract
We reconsider a key point in semiconductor physics, the splitting of the valence band states induced by the spin-orbit interaction, through a novel approach which uses neither the group theory formalism, nor the usual formulation valid for atoms but conceptually incorrect for periodic lattices, the angular momenta and having no meaning due to the absence of spherical symmetry. We show that for zinc-blende structures, the valence band eigenstates resulting from spin-orbit coupling are uniquely determined by: (i) the equivalence of the () crystal axes, (ii) the three-fold degeneracy of the valence band. The fact that these two conditions are also fulfilled by atomic states allows us to understand why the spin-orbit eigenstates for three-fold atomic and valence electrons have exactly the same structure, albeit the drastic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Spin-orbit coupling: atom versus semiconductor crystal
Monique Combescot
Sorbonne Université, CNRS, Institut des NanoSciences de Paris, INSP, 75005-Paris, France
Shiue-Yuan Shiau
Physics Division, National Center for Theoretical Sciences, Hsinchu, 30013, Taiwan
Valia Voliotis
Sorbonne Université, CNRS, Institut des NanoSciences de Paris, INSP, 75005-Paris, France
Abstract
We reconsider a key point in semiconductor physics, the splitting of the valence band states induced by the spin-orbit interaction, through a novel approach which uses neither the group theory formalism, nor the usual formulation valid for atoms but conceptually incorrect for periodic lattices, the angular momenta L and J having no meaning due to the absence of spherical symmetry. We show that for zinc-blende structures, the valence band eigenstates resulting from spin-orbit coupling are uniquely determined by: (i) the equivalence of the () crystal axes, (ii) the three-fold degeneracy of the valence band. The fact that these two conditions are also fulfilled by atomic states allows us to understand why the spin-orbit eigenstates for three-fold atomic and valence electrons have exactly the same structure, albeit the drastic differences in the potential and electronic symmetries. We also come back to the commonly accepted understanding of the exciton-photon interaction in terms of bright and dark excitons having total angular momenta respectively and present a simple derivation of this interaction which only relies on spin conservation.
I Introduction
Spin-orbit coupling is a relativistic effect that is easy to study in the atomic physics context For a general introduction see, C. Cohen-Tannoudji et al. (1977). Regrettably, the same approach is commonly used for semiconductor crystals which do not have the spherical symmetry as do atomsYu and Cardona (1996); Lew-Yan-Voon and Willatzen (2009); Fishman (2010); Wenckebach (1999). In practical terms, it means that the orbital angular momentum L associated with the spherical harmonics , and the total angular momentum , have no meaning for electrons in a periodic crystal. Yet, taking the valence band in zinc-blende-like semiconductors as a atomic state leads to the same spin-orbit eigenstates and energy splittings as the ones correctly derived from group theory Ivchenko and Pikus (1997). In view of the physically obscure derivation based on group theory, it is tempting to adopt a pragmatic attitude by forgetting about the inconsistency of using the total angular momentum as quantum indices to characterize the spin-orbit eigenstates of semiconductor crystals.
The purpose of this work is to understand why the spin-orbit eigenstates for three-fold level in a zinc-blende-like semiconductor crystal with cubic symmetryDresselhaus et al. (1955), have the same structure as the atomic spin-orbit eigenstates, without resorting to group theory and the hard-to-grasp classification of the various bands. Here, we derive the effects of the spin-orbit coupling on crystal structures from scratch, by using only the fact that the valence band is three-fold and the () crystal axes are equivalent in a zinc-blende structure.
To better understand the deep reason why the atomic procedure incorrectly used for semiconductor crystals still leads to the correct spin-orbit eigenstates, we first rederive the spin-orbit splitting for atoms using two different methods: (i) in the first, standard, method, we from the very first line, make use of the spherical symmetry of the electrostatic potential felt by the electrons, as induced by the nucleus. This readily leads us to write the spin-orbit coupling in terms of and then of J; (ii) in the second method, we propose a pedestrian approach that does not start with any assumption on the symmetry of the potential. This second method is the one that can be used for semiconductor crystals because their electrostatic potential is not spherically symmetric, but has the lattice periodicity, , with a being a lattice vector. We show that the Clebsch-Gordan coefficients which relate J to L states and provide the spin-orbit atomic eigenstates, appear in a natural way for semiconductor valence-band eigenstates having a cubic symmetry. This result is not trivial at all because the electrostatic potentials felt by atomic and semiconductor electrons do not have the same symmetry; moreover, atomic states have an odd parity while orbital states for valence electrons can be even or odd.
Our pedestrian approach provides a transparent way to understand the structure of the semiconductor valence band, and its further coupling to photons, as necessary to correctly predict the very rich pattern of polarization effects that result from band symmetries. In particular, the spin-orbit coupling does not mix up and down spins in two valence states only, these two states being the ones out of which the two dark excitons are constructed.
The paper is organized as follows: in Sec. II, we recall the microscopic expression of the spin-orbit interaction and provide a short derivation of the spin-orbit interaction along Thomas’ understanding Thomas (1926), with details given in Appendix A. Section III deals with atoms. The spin-orbit eigenstates are derived in two different ways, the standard one that uses the total angular momentum J being definitely the smartest. Section IV deals with semiconductor crystals. We consider three-fold degenerate states with periodic symmetry and even or odd parity. We show that the zinc-blende-like semiconductor spin-orbit eigenstates have exactly the same structure as the ones of atomic states, and that this structure does not depend on the valence state parity. In Sec. V, we provide some key results on spin-orbit eigenstates within the language of group theory. We also provide a physical understanding that a posteriori explains the similarity in the spin-orbit eigenstates of atoms and cubic semiconductors. We discuss some implications for two-dimensional materials. We finally discuss the importance of separating the (intrinsic) spin subspace from the subspace in which electrons and photons move. This will in particular avoid the incorrect understanding of the exciton-photon interaction in terms of photons having a spin and dark excitons having a spin . We ultimately conclude.
II Spin-orbit interaction
The general expression of the spin-orbit interaction for an electron having an electrostatic energy , reads as For a general introduction see, C. Cohen-Tannoudji et al. (1977)
[TABLE]
where is the electron momentum operator and is the electron spin operator, the components of the operators being the Pauli matrices, . The prefactor is given by
[TABLE]
where is the free electron mass and is the speed of light. We wish to stress that the spin-orbit interaction given in Eq. (1) is valid for whatever potential . This general expression reduces to the well-known formula in the case of atoms due to the spherical symmetry. In the case of crystals having a periodic symmetry, we must stay with Eq. (1), which is less convenient to handle than the form.
The physical origin of the factor contained in has puzzled the leading physicists of the 1920’s for quite a long timeTomonaga (1997). Appendix A presents a detailed derivation of this factor along Thomas’ ideaThomas (1926). It relies on a succession of different physical effects that we find of interest to outline below.
(1) An electron with mass , charge , and spin S, has a magnetic moment ,
[TABLE]
the Landé factor being equal to 2 for the electron spin.
(2) In an external magnetic field , the electron energy associated with its magnetic moment reads
[TABLE]
(3) We now consider an electron moving with a velocity in the laboratory frame F in which exists an electromagnetic field . The Lorentz transformation gives the magnetic part of this field in a frame F*′* that moves at a velocity v with respect to the F frame, as
[TABLE]
(4) In the F*′* frame, the electron also feels an electric force, which at lowest order in is equal to . So, the charged electron feels an acceleration a given by
[TABLE]
Consequently, the frame in which the electron is at rest is not the F*′* frame that moves at a constant velocity v, but a frame that accelerates. As a result, Eq. (5), which results from a Lorentz transformation valid for constant velocity, does not give the correct magnetic field felt by the electron.
(5) It is possible to solve this problem by using the Dirac equationLandau and Lifschitz (1977a). This equation gives relativistic corrections associated with spin, up to the order, as
[TABLE]
The second term corresponds to the spin-orbit interaction. It is absent from the Pauli equation (see Eq. (67)), which only is correct up to the order.
(6) Actually, it is possible to bypass relativistic quantum theory by using Thomas’ argumentThomas (1926). It relies on the keen observation that the acceleration of the frame in which the electron is at rest, corresponds to a rotation with respect to the laboratory frame, with an angular precession velocity given by
[TABLE]
(see Appendix A.4 for details on this key result).
(7) The derivation then follows by noting that an electron spin S in a magnetic field H along the direction has an energy that makes it rotate around H with a frequency which corresponds to an angular precession velocity
[TABLE]
So, this magnetic field H gives to the spin-S electron a rotational kinetic energy that can be written as . Conversely, the angular precession velocity induced by the electron acceleration a in Eq. (6) corresponds to an effective magnetic field which gives the spin-S electron an energy
[TABLE]
This electron energy has to be added to the energy due to the magnetic field given in Eq. (5). So, the total energy of a spin-S electron moving with a velocity v in a field reads as
[TABLE]
Since , while the electrostatic energy can be written in terms of the electrostatic potential as , the second and third terms of the above equation combine to give the spin-orbit interaction as
[TABLE]
in agreement with Eqs. (1) and (2). The prefactor directly follows from given in Eq. (8). To prove this crucial result is rather lengthy; this is why we have relegated its detailed derivation to Appendix A.4.
III Spin-orbit splitting for atoms
In the case of atoms, the electrostatic potential due to the nuclear charge has spherical symmetry. So, for ,
[TABLE]
The spin-orbit interaction in Eq. (1) then reduces to
[TABLE]
where is a positive scalar that depends on as
[TABLE]
Indeed, the electrostatic potential felt by the electron is attractive, that is, negative, being minimum for small.
III.1 Derivation using the standard formulation
The standard way to derive the spin-orbit energy splitting of atomic levels is to note that is just L the electron orbital angular momentumFor a general introduction see, C. Cohen-Tannoudji et al. (1977); Baym (1974); Messiah (1995). So, Eq. (1) also reads
[TABLE]
where is the total angular momentum of the atomic electron. This spin-orbit coupling splits the ()-fold degeneracy of an electron with spin in a atomic level, into four-fold and two-fold states which correspond to with and . Since all states are made of orbital states and spin states, the above equation gives for the two different values,
[TABLE]
the bracket being equal to 1 for and to for . The above equation shows that the eigenstates correspond to the four-fold states and the two-fold states.
The normalized states read in terms of the states as (see Appendix B.2)
[TABLE]
for . By using Landau-Lifschitz phase factor for the spherical harmonicsLandau and Lifschitz (1977b), the states read in terms of the states as
[TABLE]
The orbital part of the atomic state (, where is the principal quantum number, corresponds to , which gives, for the atomic states labeled by ,
[TABLE]
Note that the state norm
[TABLE]
does not depend on ; so, this norm is also equal to the norms with .
Using Eq. (17), this gives the eigenvalues of the spin-orbit operator as
[TABLE]
with equal to for 2 and to for .
All this shows that the spin-orbit interaction, , induces a splitting between the -fold atomic states into eigenstates having an energy difference . Since is positive, the four states have an energy increase equal to , while the other two states have an energy decrease equal to , making the average energy unchanged by the spin-orbit interaction.
III.2 Pedestrian approach
The fact that depends only on readily leads to , that is, L, and ultimately J. However, for electrons in a periodic lattice, is periodic, ; so, we must find a procedure to derive spin-orbit eigenstates that does not use the orbital angular momentum L as in Eq. (14). To this end, we introduce the two vectors
[TABLE]
The spin-orbit interaction given in Eq. (1) then reads
[TABLE]
with , and similarly for .
As S_{\pm 1}\left|\eta/2\right\rangle=(\hbar/2)\big{(}1\mp\eta\big{)}\left|-\eta/2\right\rangle while , the spin part of leads to
[TABLE]
This evidences a key point of the derivation: the orbital operator associated with a spin flip is .
Next, we turn to the orbital part. The wave functions of the states for defined in Eq. (20), appear as
[TABLE]
where and . In the same way,
[TABLE]
For a spherical potential, , the components of are such that
[TABLE]
This makes all factors in equal to zero. So, we are left with , and and respectively proportional to and ; the other terms are obtained from cyclic permutations.
Next, we note that , but we cannot readily conclude that acting on gives because depends on . Actually, what we need to do is diagonalize the spin-orbit interaction in the degenerate subspace . In this subspace,
[TABLE]
Using Eq. (28), we find that is equal to zero when any two of are the same, like . The non-zero terms follow by cyclic permutations from
[TABLE]
where reads, using Eqs. (III.1) and (28),
[TABLE]
This leads, for the () states defined in Eq. (19), to
[TABLE]
which yield, with the help of Eq. (29),
[TABLE]
Using Eq. (25), we end with acting in the degenerate subspace as
[TABLE]
Equation (III.2) taken for readily shows that the states for are eigenstates of with eigenvalue , in agreement with Eq. (III.1). This equation also shows that the other four eigenstates are linear combinations of and , namely
[TABLE]
By noting that
[TABLE]
we see that is eigenstate of with the eigenvalue , provided that and . Non-zero values, solution of these two linear homogeneous equations, impose , which gives and . It is easy to check that the corresponding four states are and given in Eq. (18), with the spin-orbit shifts given in Eq. (III.1).
Obviously, this approach is not as smart as the one that uses angular momentum operators J and L. However, since J has no meaning for electrons in a periodic lattice, an approach that does not rely on J and L is mandatory. Let us now see what this second approach gives for the spin-orbit eigenstates of semiconductor crystals with cubic symmetry.
IV Spin-orbit splitting for semiconductor crystals
The first problem is to understand the consequences of having the brackets in Eqs. (III.2) and (27) different from zero, as for potentials without spherical symmetry. The second problem is to understand the consequences of the valence state parity because atomic states are odd, while for zinc-blende structures, the three-fold valence orbital states do not have a defined parity since these structures do not possess inversion symmetryLuttinger and Kohn (1955); Bir and Pikus (1974); Yu and Cardona (1996); Ivchenko and Pikus (1997).
IV.1 Periodic potential
To derive the effect of the spin-orbit interaction given in Eq. (1) for a periodic crystal, we fundamentally follow the procedure used in Sec.III.2, that is, we write the spin-orbit interaction as in Eq. (24) with defined in Eq. (23), and we handle the periodicity of the potential through its expansion
[TABLE]
on reciprocal lattice vectors Q that are such that ,* i.e.*, Q vectors quantized in , in order to fulfill . This gives
[TABLE]
IV.2 Valence and conduction states
The atomic levels are non-degenerate, with an even wave function , while the atomic levels are three-fold, with an odd wave function reading as for , or any linear combination, being arbitrary orthogonal axes due to the spherical symmetry of the problem.
In a crystal, there are two types of non-degenerate states improperly called : one type is even as atomic levels, the other type is odd. Similarly, there are two types of degenerate states improperly called ; they both are three-fold but one type is odd like atomic levels, while the other type is even. Due to the lack of defined parity, the semiconductor conduction and valence bands are linear combinations of these even and odd states.
Electrons in a periodic crystal are characterized by a momentum k and a band index . Their wave functions read
[TABLE]
for a sample volume . This wave function contains a part with momentum k quantized in , that just corresponds to a plane wave in free space, and a part that has the lattice periodicity, .
We look for the shift of the band extrema induced by the spin-orbit interaction. In a GaAs-like direct gap semiconductorKireev (1975); Klingshirn (1997), these extrema are located at , called point. To handle the periodicity, we do as for , that is, we expand it on the reciprocal lattice vectors, . Equation (40) then gives
[TABLE]
Next, we note that the valence states are characterized by an additional three-fold index that can still be labeled as , but are now the three axes of the cubic crystal at hand.
(i) Odd valence states are such that . In the reciprocal space, this implies that can be written as
[TABLE]
where .
(ii) Even valence states are such that . This implies that can be written as
[TABLE]
IV.3 in the degenerate subspace
To get the spin-orbit interaction acting on the three-fold degenerate states , we first have to calculate the matrix elements of , defined in Eq. (23), in this subspace.
By noting that
[TABLE]
we readily get, for given in Eq. (39),
[TABLE]
Next, as in Eq. (29), we look for acting in the degenerate subspace , namely
[TABLE]
The above matrix element follows from Eq. (IV.3) as
[TABLE]
with reducing to .
To go further, we note that the electrostatic potential of an electron in a cubic crystal with axes, fulfills
[TABLE]
which corresponds in real space to . We then find that, like for atoms, all the matrix elements in Eq. (IV.3) are equal to zero except
[TABLE]
and similar terms obtained from cyclic permutations.
IV.4 On the state parity
Equation (49) holds for even and odd states. To show it, let us consider for three-fold states having an odd parity, that is, for given in Eq. (42). This matrix element then appears as
[TABLE]
When , the above quantity is equal to zero for (or ), as seen by changing into and into . When , it also is equal to zero for , as seen by changing into and into .
The same argument holds for three-fold states having even parity, that is, for given in Eq. (43). Note that linear combinations of even and odd valence states also give zero because the operator is an even operator, .
The last step is to diagonalize the spin-orbit interaction for even or odd valence states. This diagonalization follows exactly the same procedure as the one for atoms in Sec.III.2. Therefore, whatever their parity, the three-fold orbital states with spin split into two degenerate states and four degenerate states, these states having exactly the same structure as the ones for atoms given in Eq. (18).
V Discussion
V.1 Spin-orbit splitting from group theory
The group theory formalism and its tables of charactersKoster (1957); Falicov (1966) commonly proposed to properly study the spin-orbit coupling in semiconductor crystalsCardona et al. (1988); Ivchenko and Pikus (1997) are, to our opinion, too cumbersome to deal with just three-fold orbital states. Still, in more complicated structures, using group theory may remain the only convenient way to tackle the problem. While this work purposely avoids using group theory, we nevertheless wish, for completeness, to recall some key results within this language.
Orbital states belong to the so-called “simple group”. The non-degenerate orbital state, which for atoms corresponds to , is called in the case of semiconductor crystals, while the three-fold orbital state which for atoms corresponds to , is called .
Including the spin degrees of freedom transforms the simple group into the “double group”. The state of the simple group gives rise to two states in the double group, called , which for an atom would correspond to the two states (see Fig. 1). In the same way, the three states of the simple group, give rise to six states in the double group, which are further split by the spin-orbit interaction into four states called , which correspond to the four states of atoms, and into two states called , which correspond to the two states of atoms. Note that despite the fact that they correspond to the same atomic quantum number , the and states are fundamentally different because they are made of orbital states that respectively are non-degenerate and three-fold degenerate. Being eigenstates of the spin-orbit interaction with different eigenvalues, these states are orthogonal.
V.2 Physical understanding
The fact that the eigenstates of the spin-orbit interaction for three-fold orbital states have the same structure regardless of the state parity, odd or even, and the potential symmetry, spherical or periodic, can be proven in a mathematically rigorous way. For a spherical potential as in the case of atoms, the state symmetry is handled in the real space, while for a periodic potential as in the case of semiconductor crystals, it is handled in the reciprocal space. Similarity in these dual spaces is largely due to the fact that the spin-orbit eigenstates are derived for electrons having two spin states only, and , as seen from Eq. (25). So, the spin can either stay the same or flip. For spin states quantized along the direction, the orbital operator associated with spin conservation is , while the orbital operator associated with spin flip is with . In a bulk cubic crystal, the physically relevant orbital states for and are and , whatever the degeneracy and symmetry of the electron states.
Taking this key point into account, the remaining task is to find the set of orthogonal combinations of spin and orbital states, built on and that fulfills Eq. (25), whatever the symmetry of the potential felt by the electron. Within group theory, orbital and spin states are mixed into the double group, which totally hides the state and potential symmetries. Indeed, the orbital state can be associated with the spin state or . Thanks to Eq. (25), we readily see that if is associated with , another orbital state has to enter the eigenstate and this state must have a spin. This is easily seen from the J eigenstates but the same argument stays valid for a periodic potential with cubic symmetry, that is, playing the same role. The proper combination of spin and orbital states just follows from the spin-conserving and spin-flipping operators and appearing in Eq. (25).
V.3 Two-dimensional materials
The proposed procedure can be extended to orbital states having a degeneracy higher than three-fold, like for materials, that have recently attracted a lot of interest. In single-layer grapheneKonschuh et al. (2010) and transition metal dichalcogenidesLiu et al. (2013), the orbital states appear to play a more important role in the spin-orbit splitting than the orbital states.
For single-layer graphene, the crystal symmetry at the K and K*′* points allow the two higher orbital states to enter into play in the bandKonschuh et al. (2010); so, the relevant orbital states of the problem at the Dirac points are and . The same equation (25) leads us to see that if is associated with , the other spin state that enters the eigenstate must have an orbital state , which is absent at the Dirac points. As a result, the spin-orbit splitting comes from the spin-conserving operator between , in addition to small second-order contribution from the state of the band and the states of the band.
For transition metal dichalcogenides, the situation is even more complex because all five orbital states from the metal atom and the orbital states from the chalcogen atom play a role in the spin-orbit splitting. The study of these complex materials is beyond the scope of the present work. Yet, in view of its simplicity for the orbital states, we expect the present procedure to be quite valuable to physically understand the spin-orbit eigenstates of these complex structures.
V.4 Misleading notations
We would like to end this work by stressing that not only it is physically incorrect to extend the spin-orbit procedure for atoms to periodic crystals but, far worse, labeling the spin-orbit eigenstates in the same way, , as if electrons in a crystal had an orbital momentum, is quite misleading. In particular, this tends to mess up the whole understanding of the exciton-photon interaction.
Indeed, what is commonly said is the following: valence holes come from valence states; they thus have an orbital momentum , which with their spin , gives them a total momentum with . Conduction electrons are also said to be in a state, with an orbital momentum ; so, they are only labeled by their spin . This would give to the conduction electron-valence hole pairs a total momentum , with . It is then claimed that the two excitons made from electron-hole pairs are dark because they cannot be coupled to photons since photons have a “spin” which correspond to polarizations and .
The correct understanding is quite different, even if in the very end only two exciton states are not coupled to photons, these excitons having the lowest energy for the very same reason that they are dark.
First, photons are known not to have spin in the proper sense but a two-fold polarization associated with a vector in the two-dimensional plane perpendicular to the photon propagation axis . This polarization vector is a linear combination of , the one associated with circular polarizations being
[TABLE]
In an electron-photon interaction, the photon does not act on the electron spin but, via its polarization, it induces a change in the electron orbital wave function. So, the fact that the photon “spin” is never equal to , cannot be linked to the fact that excitons, commonly labeled as or , are dark.
Actually, it is just because the photon does not change the electron spin that some exciton states are dark, the fact that there are two dark states only being a direct consequence of the spin-orbit interaction.
When going from the valence band to the conduction band under a photon absorption, the electron conserves its spin. As the hole spin is opposite to the spin of the missing valence electron, the electron-hole pair coupled to photon through its absorption, has a total spin equal to zero (see Fig. 2).
In the absence of spin-orbit interaction, there are hole states, labeled by and and the two electron states labeled by their spin only, for a non-degenerate conduction band; so, they would be six excitons with total spin equal to zero, namely and .
We have shown that the spin-orbit interaction couples the spin and orbital indices of the valence electron in such a way that there are two states only with a well-defined spin, namely , the other four valence states being linear combinations of states with and spins. The destruction of such valence electrons which are pure in spin, leads to hole states also pure in spin. So, when combined with a conduction electron having a spin, they form two dark excitons with total spin 1 and two bright excitons with total spin 0.
As a result, the existence of two dark exciton states only is due to the fact that: (i) the electron-photon interaction conserves the spin, and (ii) two valence states only stay pure of spin under the spin-orbit coupling.
All this shows that inadequate notations tend to lead to incorrect physical understanding. Even if done for years, we suggest to stop labeling spin-orbit valence states by as for atomic states, but by and call them valence spin-orbit indices. This would prevent considering as a naive angular momentum and adding it to the conduction spin, to end with a “total angular momentum of the exciton”, which is physically meaningless.
Finally, Coulomb interaction conserves spin, just as electron-photon interaction does. So, the electron-hole pairs that suffer interband Coulomb interaction have a total spin equal to zero, just like the pairs that are coupled to photons (see Fig. 3). This (repulsive) Coulomb interaction pushes the energy of bright excitons above the one of dark excitons. So, the excitons that have the lowest energy, are dark for the very same reason that they are not coupled to photons.
VI Conclusion
This work considers a very fundamental aspect of semiconductor crystals that is either ignored when treating the valence band as a true state, or not physically understood when using group theory.
We present a direct procedure—easy to follow by anyone with no background on group theory—to derive the spin-orbit energy shifts of three-fold orbital states in cubic semiconductor crystals, whatever the parity of these states. We show that the state degeneracy matters, but not the state parity, even or odd, nor the potential symmetry, spherical or periodic.
We show that the spin-orbit eigenstates have the same structure for semiconductor crystals and atoms with same orbital degeneracy, and we physically explain why this has to be so. Nevertheless, we urge to stop calling these valence states through () indices like atoms, and mostly to stop relating these indices to “total angular momentum” because L and J only have a meaning for problem with spherical symmetry.
The simplicity of the approach we here propose to spin-orbit interaction should appear as quite valuable in the case of complex materials having valence electrons with degeneracy higher than three-fold, as many materials of today major interest. We leave these studies to future works.
VI.1 Acknowledgments
The authors would like to thank Yia-Chung CHANG for enlightening arguments on the parity of valence and conduction states, and also Benoit EBLE for discussions at the beginning of this work.
Appendix A Spin-orbit interaction
The spin-orbit interaction is most often described in the context of atomic physics. This is why it is commonly identified with the interaction. To get the whole story straight is quite complicated, in particular the derivation of the factor which enters the prefactor of the coupling. The derivation we here present is a combination of what can be found in various textbooks, in particular Baym Baym (1974), TomonagaTomonaga (1997), and Landau-LifschitzLandau and Lifschitz (1977a), with the aim to make the presentation as simple as possible, a particular attention being paid on choosing transparent notations.
A.1 Magnetic moments
We consider an electron with mass and charge .
The electron angular momentum reads as
[TABLE]
where is the electron kinetic momentum, and is an integer, the projection on the axis taking any integer values between . The electron magnetic moment associated with its orbital angular momentum is
[TABLE]
where is called Bohr magneton. Its value is erg/Gauss.
The electron also has a magnetic moment associated with its spin where the components of the vector are the Pauli matrices . It reads
[TABLE]
the Landé factor for electron being .
So, the total magnetic moment of an electron with orbital angular momentum L and spin S is given by
[TABLE]
On the other hand, in an external magnetic field , that is, a field which has nothing to do with the charge and velocity of the particle at hand, this particle has a magnetic energy associated with the magnetic moment M equal to
[TABLE]
A.2 Magnetic field
Let us consider an electron moving with a velocity in an electrostatic potential due to an external electric field . Its Hamiltonian reads
[TABLE]
We put this electron in a strong external magnetic field . (We will see later on why this field has to be strong for the following to be true). The vector potential associated with reads, in the Coulomb gauge, as
[TABLE]
which indeed fulfills . If we neglect spin, the electron Hamiltonian would be
[TABLE]
The spin magnetic moment brings an additional energy which, according to Eq. (56), reads
[TABLE]
So, with this spin contribution, the electron Hamiltonian appears as
[TABLE]
By noting that , as fulfilled by given in Eq. (58), we can rewrite the term linear in as
[TABLE]
The Hamiltonian given in Eq. (61) then appears at first order in as
[TABLE]
The second term is just the magnetic energy given in Eq. (56) associated with the electron magnetic moment given in Eq. (55).
The above Hamiltonian is valid when the external field is large compared to the internal field felt by the moving electron. This internal field follows from the fact that, due to the Lorentz transformation, the electromagnetic fields become in a frame moving at a constant velocity v. The link between and reads
[TABLE]
So, the field components parallel to v do not change under the Lorentz transformation, but the field components perpendicular to v do. These equations are commonly referred to as Biot-Savart law. Consequently, using Eqs. (54, 64), an electron with velocity v feels an additional magnetic energy given by
[TABLE]
Actually, this is not fully correct because the electron in an electrostatic field feels a force that produces an acceleration; so, the electron velocity changes. As a result, the Lorentz transformation (64), valid for a frame moving at constant velocity, cannot correctly give the internal magnetic field felt by the accelerating electron. As first shown by ThomasThomas (1926) and confirmed by DiracDirac (1958), the changing velocity has the effect of changing into in Eq. (65). Therefore, the spin contribution to the energy of a spin-S electron having a velocity v in an external magnetic field and electrostatic field , ultimately reads as
[TABLE]
A.3 Pauli and Dirac equations
The Dirac equation for an electron in an external magnetic field reduces, up to terms in , to the Pauli equationPauli (1927), namely
[TABLE]
By writing the spin term as
[TABLE]
we readily see that the electron Landé factor, , implicitly appears in this equation.
If we go one step further and write the Dirac equationLandau and Lifschitz (1977a) up to terms in , we find
[TABLE]
with being the Laplace operator.
(i) The first bracket follows from the expansion of
[TABLE]
(ii) The second bracket of Eq. (69) corresponds to the spin contribution. The part coming from the electrostatic field can be rewritten as
[TABLE]
since . This term leads to the spin-orbit interaction, absent in the Pauli equation given in Eq. (67). So, the total spin contribution to the Dirac equation up to terms reads as
[TABLE]
The first term corresponds to the fields seen in a frame having a constant velocity v, which are Lorentz-transformed from the fields of the laboratory frame. The second term corresponds to the Thomas’ correction due to the electron acceleration induced by the electrostatic force . This force leads to effectively replacing with but in the term only.
(iii) The second term in the last bracket of Eq. (69) differs from zero when local charges are present, as can be seen from the Ohm’s law, where is the charge density.
A.4 Thomas’ understanding
The Dirac equation definitely gives the correct spin-orbit interaction, including its numerical prefactor. However, it is hard from it to physically catch why the Landé factor is changed to in one part only of the magnetic energy.
The derivation of the change from to proposed by Thomas has the great advantage to trace its physics to the fact that the electron velocity is not constant, due to the presence of the electrostatic field and the force this field induces on the electron. Thomas’ result is identical to the one obtained from the Dirac equation taken up to terms in . Actually, his result pushed Pauli to admit that relativistic quantum theory is not the only way to handle the spin properly.
Thomas’ derivation can be divided into four steps:
(1) First, we introduce a frame F that we call laboratory frame, a frame moving with a velocity v along x, and another frame moving with a velocity , with u along y.
The coordinates in the F frame and in the frame are related by a Lorentz transformation, namely
[TABLE]
As the frame has a u velocity with respect to the frame, the coordinates in the frame and in the frame are also related by a Lorentz transformation. So, in the frame read in terms of in the F frame as
[TABLE]
(2) We now consider the origin of the frame. Its coordinates in the frame are by construction whatever . Equation (74) gives its coordinates () in the F frame through
[TABLE]
which leads to
[TABLE]
So, the components of the velocity of the frame origin, , are in the F frame not equal to but to (see Fig. 4a).
In the same way, the coordinates of the O origin of the F frame are whatever . Equation (74) gives them in the frame through
[TABLE]
From these equations, we find that the components of the velocity of the F frame origin, O, are in the frame not equal to () but to (see Fig. 4b).
Before going further, let us discuss these results. The two velocities and have the same modulus . This value differs from the modulus of the velocity () of the frame with respect to the F frame when . Indeed, when or is equal to zero, and the F and frames are related by a bare translation, as seen from Fig. 5. By contrast, when , the two frames are related by a rotation around the axis. To obtain this rotation, we note that obtained from by a rotation around , are given by (see Fig. 6)
[TABLE]
When used for the velocity in the frame and the velocity in the F frame, that is, for and (see Fig. 4), these two equations give, since for ,
[TABLE]
So, the rotation angle of the frame with respect to the F frame is related to through
[TABLE]
For , the above equations give , that is no rotation, as expected.
(3) Next, we consider the origin of the frame in the F frame. When , the point has a velocity v along x. After a short time delay , its velocity is , which corresponds to a velocity change along y. The coordinates of in the F frame are then given, according to Eq. (76) and as illustrated in Fig. 4a, by
[TABLE]
When compared to the velocity in the F frame for , the velocity after a time delay corresponds to an acceleration
[TABLE]
along y. This acceleration brings a rotation angle between the F and frames, which according to Eq. (81) for small, is given by
[TABLE]
As a result, the origin of the frame rotates with respect to the F frame, with an angular precession velocity along z, which for reduces, due to Eqs. (83) and (84), to
[TABLE]
The above derivation is done by considering a velocity change orthogonal to v, i.e., an acceleration a orthogonal to v. When is parallel to v, no rotation occurs. This supports the fact that the angular precession velocity for arbitrary v and a has the following form
[TABLE]
The rotation reduces to zero when or when a is along v.
(4) The last step is to use the above results for an electron moving in an external electromagnetic field .
First, we note that in a magnetic field H, a spin-S electron with magnetic moment (see Eq. (54)) rotates with an angular precession velocity
[TABLE]
This follows from the fact that the energy of a magnetic moment M in a magnetic field H gives rise to an interaction term
[TABLE]
with for H taken along z. The eigenstates are with eigenvalues . Thus, the time evolution of a spin, which is along the direction when (see Fig. 6b), reads as
[TABLE]
This shows that does not change with time while rotates with a velocity around the z axis, parallel to H, in agreement with Eq. (87).
Next, we note that the spin-S particle located in a frame that moves with a velocity v with respect to the laboratory frame F in which the electromagnetic field is , feels a magnetic field . This magnetic field induces an angular precession velocity given, according to Eq. (86), by
[TABLE]
The electron also feels in the frame an electrostatic force which leads to an acceleration a given by . This acceleration brings an additional angular precession velocity given by Eq. (86). So, we end with
[TABLE]
According to Eq. (87), this angular velocity produces an effective magnetic field , that is, a magnetic energy
[TABLE]
in agreement with Eq. (72).
The above derivation, which essentially follows Thomas’ idea, has the great advantage to shed light on the physical origin of the spin term appearing in the Dirac equation.
Appendix B Standard derivation
B.1 Spherical harmonics
The spherical harmonics for read, according to Landau-LifschitzLandau and Lifschitz (1977b) phase factor, as
[TABLE]
Compared to more common expressions, they contain an additional phase factor that insures , as required from particle-antiparticle symmetry. This particle symmetry is necessary for the consistency of problems that deal with valence holes, like semiconductor excitons.
B.2 Derivation of the eigenstates
By applying to
[TABLE]
we find the other two states as
[TABLE]
The two states made from the same states as but orthogonal to them, are given by
[TABLE]
within a phase factor irrelevant for the problem at hand.
B.3 Orbital wave function
The orbital wave functions with defined in Eq. (20), can be written as
[TABLE]
with . The orthonormalization of these states is fulfilled by such that
[TABLE]
Replacing by or and ultimately by when the three axes play the same role as for cubic symmetry, yields
[TABLE]
and similarly for the states with .
B.4 eigenstates
The expressions of the three-fold orbital states in terms of the states (see Eq. (19)), give the two-fold eigenstates , given in Eq. (18), as
[TABLE]
from which we get
[TABLE]
As the even part of is equal to , the above quantity reduces to
[TABLE]
In the same way, the four-fold eigenstates associated with read as
[TABLE]
Their eigenvalue is given by . Therefore, the spin-orbit interaction brings a splitting between the orbital states with : four states have an energy shift and two states have an energy shift .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1For a general introduction see, C. Cohen-Tannoudji et al. (1977) For a general introduction see, C. Cohen-Tannoudji, B. Diu, and F. Laloë, Quantum Mechanics (Wiley, New York, 1977).
- 2Yu and Cardona (1996) P. Yu and M. Cardona, Fundamentals of Semiconductors (Springer, 1996).
- 3Lew-Yan-Voon and Willatzen (2009) L. C. Lew-Yan-Voon and M. Willatzen, The k.p method (Springer, 2009).
- 4Fishman (2010) G. Fishman, Semiconducteurs: les bases de la théorie k.p (Ecole Polytechnique, 2010).
- 5Wenckebach (1999) T. Wenckebach, Essentials of Semiconductor Physics (Wiley, New York, 1999).
- 6Ivchenko and Pikus (1997) E. Ivchenko and G. E. Pikus, Superlattices and Other Heterostructures, Springer Series in Solid-State Sciences, vol 110 (Springer, 1997).
- 7Dresselhaus et al. (1955) G. Dresselhaus, A. F. Kip, and C. Kittel, Phys. Rev. 98 , 368 (1955).
- 8Thomas (1926) L. H. Thomas, Nature 117 , 514 (1926).
