Why the effective-mass approximation works so well for nano-structures
Pedro Pereyra

TL;DR
This paper re-derives the effective-mass approximation within the theory of finite periodic systems, providing a theoretical explanation for its success in nanostructures and demonstrating its validity through optical-response calculations.
Contribution
It offers a new derivation of the effective-mass approximation based on finite periodic systems theory, clarifying why it works well for nano-structures.
Findings
The derivation justifies the effective-mass approximation for nanostructures.
Explicit calculations show rapidly varying eigenfunctions can be neglected in inter-band transition matrix elements.
The approach explains the approximation's success in optical properties of nano-structures.
Abstract
The reason why the effective-mass approximation, derived for wave packets constructed from infinite-periodic-systems' wave functions, works so well with nanoscopic structures, has been an enigma and a challenge for theorists. To explain and clarify this issue, we re-derive the effective-mass approximation in the framework of the theory of finite periodic systems, i.e., using energy eigenvalues and fast-varying eigenfunctions, obtained with analytical methods where the finiteness of the number of primitive cells per layer, in the direction of growth, is a prerequisite and an essential condition. This derivation justifies and explains why the effective-mass approximation works so well for nano-structures. We show also with explicit optical-response calculations that the rapidly varying eigenfunctions of the one-band wave functions…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Why the effective-mass approximation works so well for nano-structures
Pedro Pereyra
Física Teórica y Materia Condensada, UAM-Azcapotzalco, C.P. 02200, Ciudad de México, México
Abstract
The reason why the effective-mass approximation works so well with nanoscopic structures has been an enigma and a challenge for theorists. To explain this issue, we re-derive the effective-mass approximation using, instead of the wave functions for infinite-periodic-systems and the ensuing continuous bands, the eigenfunctions and eigenvalues obtained in the theory of finite periodic systems, where the finiteness of the number of primitive cells in the nanoscopic layers, is a prerequisite and an essential condition. This derivation justifies and shows why this approximation works so well for nano-structures. We show also with explicit optical-response calculations that the rapidly varying eigenfunctions of the one-band wave functions , can be safely dropped out for the calculation of inter-band transition matrix elements.
I Introduction
The effective-mass approximation (EMA) is, without a doubt, the most recurrent and widely used approximation in theoretical calculations involving semiconductor structures. The formal justification of why this approximation, where the wave packets are constructed in terms of infinite periodic system wave functions,Wannier ; Slater ; Luttinger ; Altarelli ; AltarelliLesHuches ; Pollak ; Dingle works so well for finite micro and nano-structures, has been an enigma and a challenge for theorists. Despite the various guises of the EMA, the correct explanation has remained elusive. M. G. Burt in a number of papersBurt analysed critically the drawbacks of the “conventional” EMA, and tried to overcome these attempts providing a “new” envelope-function method, using again wave functions of infinite periodic systems. Now that the theory of finite periodic systems (TFPS) has evolved and has shown the ability to obtain the true, bona fide, energy eigenvalues and eigenfunctions of finite periodic structures with a finite number of unit cells,Abeles ; Erdos ; Claro1982 ; Ricco ; Vezzetti ; Kolatas ; Griffiths ; Peisakovich ; PereyraPRL ; PereyraJPA ; PereyraCastillo it is worth reviewing and re-deriving the EMA within the TFPS to understand why it works so well. The purpose of this letter is to re-derive the effective mass approximation taking into account the system and layers finiteness as the fundamental requisite.
Superlattices and layered structures are characterized by the simultaneous presence of two length scales: the crystalline unit cells in the semiconductor layers of atomic size and the layers widths. While the primitive cells lengths are of the order of 0.5nm, depending on the the semiconductor, the layers widths are of the order of 5nm, depending on the number of atomic cells per layer. This important difference in size is behind the factorization of the heterostructure wave function (HWF) in terms of rapid and slowly varying functions. The finiteness of the number of primitive cells, in the direction of growth, of layer (=A,B,…), and the finiteness of the number of layers in the heterostructure or number of superlattice (SL) unit cells , is not only an obvious characteristic, but also an essential requisite in the TFPS.
II Finiteness of periodic layers. An outline of the TFPS
Soon after the semiconductor SLs were introduced,Keldysh1962 ; EsakiTsu1970 and the subbands (or minibands) structures of direct and indirect band gap semiconductors were experimentally and theoretically confirmed,Esaki1972 ; Dingle1974 ; Mukherji1975 ; Miller1976 ; Chang1977 ; SaiHalaszChang1978 ; Capasso1986 ; LuoFurdyna1990 ; Rauch1997 ; Petrov1997 ; Heer1998 Leo Esaki noticed that whereas in reality SLs contain a finite number of layers, with a finite number of atomic cells each, the standard theoretical approaches tacitly assume that SLs are infinite-periodic structures with alternating layers containing also an infinite number of atomic cells.EsakiLesHuches In fact, the HWF and SL wave functions are generallyLuttinger ; Altarelli ; Dresselhauss ; Breitenecker ; Sanders ; Bastard1987NATO ; Smith ; Baraff written as , with the periodic part of the host-semiconductor Bloch’s function at band , and the envelope wave function, with the perpendicular wave number assumed, generally, a constant of motion.Bastard1987NATO At the end, it is common to assume wave functions set up from wave functions of only one band, evaluated at the center of the Brillouin zone or at the subband edge . For SLs the envelope function is, again, written in terms of Bloch-type functions , characterized by a subband index and a continuous wave number that is then artificially discretized, via the cyclic boundary condition.
On the other side, the theory of finite periodic systems has grown, and has been generalized to include periodic structures with arbitrary potential profiles, arbitrary but finite number of unit cells and arbitrary but finite number of propagating modes for open, bounded and quasi-bounded periodic structures.PereyraPRL ; PereyraJPA ; PereyraCastillo ; Pereyra2005 The TFPS is based on the transfer matrix properties and the rigorous fulfillment of continuity conditions, that make possible to express the -cells transfer matrix as , where , for time reversal invariant systems, is the single-cell transfer matrix of dimention 2N$$\times$$2N
[TABLE]
The accurate calculation of this matrix is crucial in this approach. The complex matrix functions and depend strongly on the atomic or heterostructure potential profiles. The relation
[TABLE]
that was the source of errors in numerical calculations,Luque has been rigorously transformed, after defining the matrix function , into the matrix-recurrence relationPereyraPRL ; PereyraJPA
[TABLE]
with analytic solutions. In the single mode approximation, of interest here, this relation becomes the recurrence relation of Chebyshev polynomials of the second kind , evaluated at the real part of . The -cell transfer matrix elements, and , can straightforwardly be determined, through the simple relations
[TABLE]
The eigenvalues of any quasi-bounded (qb) periodic system defined between and , see figure 1, with , can be obtained by solving the equationPereyra2005
[TABLE]
and are the wave numbers at the left (right) and right (left) of the discontinuity point () and the imaginary part of . The eigenfunctions of the quasi-bounded superlattice are given byPereyra2005
[TABLE]
with a normalization constant and any point in the cell. , ,… are matrix elements of the transfer matrix that connects the state vectors at points separated by exactly unit cells. , … , where stands for part of a unit cell, are the matrix elements of the transfer matrix that connects the state vectors at and , for .
Our purpose here is to derive the effective mass approximation for the Schrödinger equation of a layered semiconductor heterostructure , using the eigenvalues and eigenfunctions obtained in the TFPS. We will assume, without loss of generality, that our system is a binary structure , where the periodic semiconductor layers and contain and unit cells and , respectively, in the growing direction . We will show that the effective-mass approximation (EMA) can be derived when the heterostructure wave function is written as the product , where is the envelope function and is the fast-varying function obtained in the TFPS, evaluated at the band-edges defined by the energy band index and the intra-band (or wave number) index . In the particular case of periodic heterostructures, i.e. of SLs , the envelope functions are straightforwardly obtained in the EMA and the TFPS. It is worth emphasizing that since the transfer matrices are the matrix representation of the continuity and boundary conditions and the phase evolution of the quantum states, it is clear that the fast-varying and envelope wave functions, obtained in the TFPS, fulfill the continuity and boundary conditions. We will show also, for a specific example, that the optical response calculated with the matrix elements is practically the same as the optical response obtained with the matrix elements , were the fast-varying wave functions are ignored.
III An alternative derivation of the effective-mass approximation
Suppose now that for each layer (with equal or ) we can write the one-particle Schrödinger equation
[TABLE]
where the potential is periodic, at least in the growing direction . To simplify this problem we can follow the confined geometry method in Ref. [Bagwell, ] and the multichannel transfer matrix method in Refs. [PereyraPRL, ] and [PereyraJPA, ]. If we assume that the transverse widths are and and we write the potential as the sum of a confining potential , which is infinite for and , and the function periodic in , the orthonormal wave functions , which are solutions of
[TABLE]
can be used to express the wave function as
[TABLE]
If we replace this function in the Schrödinger equation (11), multiply from the left by and integrate upon and , we obtain the set of coupled equations
[TABLE]
Here is the number of propagating modes in layer , or the number of open channels (defined by the condition ), and
[TABLE]
are the coupling-channels matrix elements. In this way the 3D multichannel problem is reduced into the 1D multichannel problem. It was shown in Refs. [PereyraPRL, ] and [PereyraJPA, ], and mentioned before, that a general solution for the 1D multichannel periodic system can be obtained in terms of the matrix polynomials , when the single-cell transfer matrix is known. In actual semiconductor layers, the number of propagating modes depends on the Fermi energy and the cross section . When the multichannel problem for a specific semiconductor , with unit cells is solved, one obtains the energy eigenvalues (which determine the conduction and valence bands) and the corresponding eigenfuntions . In the widely used 1D one channel approximation, with , and , equation (14) becomes
[TABLE]
In this limit and given the periodic atomic potentials and , in the semiconductor layers and , one can obtain the unit-cell transfer matrices and and determine, applying the TFPS, the band structures and , and using the Eq. (II), the eigenfunctions and . A very good approximation for the atomic potentials and , are the effective potentials in the Hartree-Fock approximation. The quantum numbers denote the bands, and the quantum numbers the intra-band energy levels. We will denote the valence and the conduction bands with =c=1 and =v=2, respectively. The intra-band energy levels correspond to 1, 2, … , +1. In terms of these energies the fundamental energy gap in layer is given by
[TABLE]
with the first energy eigenvalue of the conduction band, i.e. the conduction band-edge denoted later as , and the last energy eigenvalue of the valence band, i.e. the upper-edge of the valence band. As is well known, the band edges of layers and do not coincide, in general (see figure 2), and their difference gives rise to the conduction and valence band split offs, as well as, to piecewise constant superlattice or heterostructure potential. Bastard1987NATO We will assume from here on that the semiconductor layers and are such that . If energies are below the barrier height (), see inset in figure 2, the eigenfunctions are propagating functions while are evanescent.Pereyra2005
For each value of the quantum number we have the corresponding wave number . To keep some analogy with conventional notation, we can represent the energy eigenvalues as or just as , that can be written also as , keeping in mind that is discrete.
It is clear that if we are able to determine the eigenvalues and eigenfunctions , we are close to obtain the full solution for the heterostructure or SL. Having the wave functions , we must still fulfill the continuity and boundary conditions at the layered structure interfaces. Although this task could, in principle, be accomplished, it is not so simple for these functions (as for the envelope functions) and it is not our purpose here. We will, instead, turn our attention into the derivation of the effective mass approximation based on the existence of the set of rapidly-varying orthogonal functions .
To derive the EMA in the TFPS we need to expand the heterostructure or SL wave functions in terms of the local wave functions and , defined inside the layers and respectively. To simplify the discussion let us assume that we have the SL . If , with the width of layer , the length of the SL unit-cell, and H(w) is the Heaviside function, we can write a rapidly-varying wave function as (see figure 3)
[TABLE]
As mentioned before, in the conventional derivations of the effective-mass approximation, the wave functions inside each layer are expanded in terms of the periodic parts of the band-edge Bloch functions, or , which are generally assumed to be equal.Enderleinpg252 ; Bastardpg67 Setting up the SL wave function , the assumptions of only one-band and small k-vectors are also made.Enderleinpg252 In the theory of finite periodic systems, the bands and wave functions and are the energy eigenvalues and the eigenfunctions of the periodic systems , . In figure 4 we show a simplified calculation in the TFPS of the energy spectrumsimplified and transmission coefficients for a specific (confined and open) semiconductor , with energy gap 2.6eV and unit-cell length = 5.15nm. On the left hand side of figure 4, we show the valence and the conduction bands (VB and CB) of the periodic sequence bounded by cladding layers , and, on the right hand side, the transmission coefficients through the same semiconductor but open. At the top of the left hand side column, we plot also the subbands (or minibands) of the SL for 2.6eV, 2.9eV, 5.15nm and =10. These graphs show that as the layer width gets thinner, the energy levels separation, , and the energy-levels widths, , increase. On the other hand, it is known that whereas the energy gap remains constant when the number of unit cells varies, the subbands of the superlattice , for a fixed barrier width , move with the band-edge energy level upwards when decreases, and downwards when , hence , increases. This behavior of the energy spectra, justifies the one-band ‘ansatz’ and strengthens the relevance of the band-edge functions as the number of unit cells gets smaller. In the specific example of figure 4, the level width is of the order of the subband widths 10meV), and the energy levels separation for a semiconductor with 5 ( 25nm) is approximately 600meV, which is much larger than the bands split off in the conduction and valence bands of layers and . Thus, in order to define the heterostructure or SL wave function in terms of the envelope and the fast-varying functions, it is justified to consider the band-edge and one-band assumptions. Therefore, we can consider the expansion
[TABLE]
Here and in the following, the quantum numbers and represent the set and , respectively. For a simple and compact notation, we will denote the expansion coefficient , known also as the envelope function, as or . If we introduce the function of Eq. (20) into the SL Schrödinger equation
[TABLE]
where
[TABLE]
multiply by and integrate, we have
[TABLE]
Since
[TABLE]
the sectionally constant periodic potential , known as the split off, appears here naturally as a consequence of the difference in the energy band structures of layers and , both in the conduction and valence bands. Therefore, we are left with
[TABLE]
We can now, as usual, multiply by and sum the Fourier series to obtain
[TABLE]
If we further approximate by a quadratic function of , near the band edge, assuming that the k-vector at the edge is small and an effective mass , defined as usual for each layer, we have
[TABLE]
with and =1 for the conduction band and and =+1 for the valence band. If we define the energy eigenvalues
[TABLE]
measured from the band edges, we can write the Schrödinger equation in the effective mass approximation
[TABLE]
that we were looking for and was used for SLs and heterostures, without a specific proof. As mentioned before, for SLs we can use the TFPS to solve this equation and to determine the eigenvalues and the eigenfunctions , known as envelope functions. It is worth noting that this derivation of EMA does not require that the layered structure be periodic. Therefore, the EMA is valid for any layered heterostructure.
All the assumptions behind this derivation imply that the wave functions can be written as
[TABLE]
with the SL eigenfunction (envelope functions) and the rapid oscillating wave functions. In figure 5 we plot the functions and , in the conduction band, and the functions and of the valence band. these functions can in principle be determined within the TFPS.
Dealing with transport properties, one can neglect the function , however, for calculations involving two bands, the whole wave function should, in principle, be considered. We will show now that the fast-varying factor can effectively be ignored in optical response calculations.
IV On the redundancy of the fast-varying functions
To determine the effect of the rapidly-oscillating factor on the optical response, let us consider the blue emitting superlattice studied in Refs. [NakamuraPaper, ] and [PereyraEPL, ]. We will calculate the optical response
[TABLE]
taking into account the fast-varying functions , which means = = and = = . These results are compared in figure (6) with the optical response
[TABLE]
calculated in Ref. [PereyraEPL, ], ignoring the fast-varying functions. As was shown in this reference and can be seen in figure 6, this optical response agrees extremely well with the experimental results in panel (c). NakamuraPaper ; PereyraEPL In (32) and (33), is the emitted photon energy, the energy levels in the first subband of the CB, the (heavy hole) energy levels in the second subband of the VB, the exciton binding energy, the occupation probabilities and the level broadening energy.
Besides the overall amplification, by a factor of 2.4, our calculations show that the rapidly-varying functions have no effect on the optical spectrum.
According with the mean value theorem for definite integrals, the optical response in equation (32) can be written as
[TABLE]
with a number, which in principle depends on the quantum numbers and . Specific calculations show that this factor is almost constant (see figure 7), and consistent with the differences in the numerical values of the optical responses and in figure 6.
V Conclusions
We have derived the effective mass approximation for the Schrödiger equation of layered hetrostructures, based on the energy eigenvalues and rapidly- oscillating eigenfunctions obtained, for each layer, in the theory of finite periodic systems. This derivation that is based on physical quantities of finite structures explains why the EMA works so well when applied to this kind of systems. We have shown also that, in order to calculate interband transition matrix elements, the rapidly-oscillating wave functions , that should be multiplied by the envelope functions, , can safely be ignored.
VI Acknowledgement
I acknowledge the useful comments of Herbert P. Simanjuntak.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1) G. H. Wannier Phys. Rev. 52 191 (1937).
- 2(2) J. M. Luttinger and W. Kohn Phys. Rev. 97 869 (1955)
- 3(3) M. Allarelli and F. Bassani Handbook on Semiconductom vol I Band Theory and Transport Properties vol 1 , ed W. Paul (Amsterdam: North-Holland 1982) p 269.
- 4(4) M. Altarelli in Heterojunctions and Semiconductor Superlattices: Proceedings of the Winter School Les Houches Ed. by Guy Allan and Gerald Bastard, France, March 12-21, 1985.
- 5(5) F. H. Pollak and M. Cardona J. Phys. Chem. 27 423 (1966)
- 6(6) R. Dingle, W. Wiegmann and C. H. Henry Phys. Rev. Lett. 33 , 827 (1974).
- 7(7) J. C. Slater, Phys. Rev. 76 , 452 (1949).
- 8(8) M. G. Burt J. Phys: Condens. Matter 4 6551490(1992), and references therein.
