Why the effective-mass approximation works so well for nano-structures

Pedro Pereyra

arXiv:1706.08673·cond-mat.mtrl-sci·February 21, 2018

Why the effective-mass approximation works so well for nano-structures

Pedro Pereyra

PDF

TL;DR

This paper re-derives the effective-mass approximation within the theory of finite periodic systems, providing a theoretical explanation for its success in nanostructures and demonstrating its validity through optical-response calculations.

Contribution

It offers a new derivation of the effective-mass approximation based on finite periodic systems theory, clarifying why it works well for nano-structures.

Findings

01

The derivation justifies the effective-mass approximation for nanostructures.

02

Explicit calculations show rapidly varying eigenfunctions can be neglected in inter-band transition matrix elements.

03

The approach explains the approximation's success in optical properties of nano-structures.

Abstract

The reason why the effective-mass approximation, derived for wave packets constructed from infinite-periodic-systems' wave functions, works so well with nanoscopic structures, has been an enigma and a challenge for theorists. To explain and clarify this issue, we re-derive the effective-mass approximation in the framework of the theory of finite periodic systems, i.e., using energy eigenvalues and fast-varying eigenfunctions, obtained with analytical methods where the finiteness of the number of primitive cells per layer, in the direction of growth, is a prerequisite and an essential condition. This derivation justifies and explains why the effective-mass approximation works so well for nano-structures. We show also with explicit optical-response calculations that the rapidly varying eigenfunctions $Φ_{ϵ_{0}, η_{0}} (z)$ of the one-band wave functions…

Equations57

\displaystyle M(z_{i+1},z_{i})=\left(\begin{array}[]{cc}\alpha&\beta\cr\beta^{*}&\alpha^{*}\end{array}\right).

\displaystyle M(z_{i+1},z_{i})=\left(\begin{array}[]{cc}\alpha&\beta\cr\beta^{*}&\alpha^{*}\end{array}\right).

\displaystyle M_{n}=M^{n}=\left(\begin{array}[]{cc}\alpha_{n}&\beta_{n}\cr\beta_{n}^{*}&\alpha_{n}^{*}\end{array}\right),

\displaystyle M_{n}=M^{n}=\left(\begin{array}[]{cc}\alpha_{n}&\beta_{n}\cr\beta_{n}^{*}&\alpha_{n}^{*}\end{array}\right),

p_{n} - (β^{- 1} α β + α^{*}) p_{n - 1} + p_{n - 2} = 0,

p_{n} - (β^{- 1} α β + α^{*}) p_{n - 1} + p_{n - 2} = 0,

α_{n} = U_{n} - α^{*} U_{n - 1}, and β_{n} = β U_{n - 1} .

α_{n} = U_{n} - α^{*} U_{n - 1}, and β_{n} = β U_{n - 1} .

Re (α_{n} e^{ik d}) - \frac{k ^{2} - q _{w}^{2}}{2 q _{w} k} Im (α_{n} e^{ik d}) - \frac{k ^{2} + q _{w}^{2}}{2 q _{w} k} β_{n I} = 0.

Re (α_{n} e^{ik d}) - \frac{k ^{2} - q _{w}^{2}}{2 q _{w} k} Im (α_{n} e^{ik d}) - \frac{k ^{2} + q _{w}^{2}}{2 q _{w} k} β_{n I} = 0.

Ψ_{μ, ν}^{q b}

Ψ_{μ, ν}^{q b}

(\frac{p ^{2}}{2 m} + V_{X} (r)) Φ^{X} (r) = E Φ^{X} (r),

(\frac{p ^{2}}{2 m} + V_{X} (r)) Φ^{X} (r) = E Φ^{X} (r),

\displaystyle\left(\!-\frac{\hbar^{2}}{2m}\Bigl{(}\frac{\partial^{2}}{\partial x^{2}}\!+\!\frac{\partial^{2}}{\partial y^{2}}\!\Bigr{)}\!+\!V_{X}^{C}(x,y)\!\right)\!\chi_{j}^{X}(x,y)\!=\!\varepsilon_{j}^{X}\chi_{j}^{X}(x,y),\hskip 14.45377pt

\displaystyle\left(\!-\frac{\hbar^{2}}{2m}\Bigl{(}\frac{\partial^{2}}{\partial x^{2}}\!+\!\frac{\partial^{2}}{\partial y^{2}}\!\Bigr{)}\!+\!V_{X}^{C}(x,y)\!\right)\!\chi_{j}^{X}(x,y)\!=\!\varepsilon_{j}^{X}\chi_{j}^{X}(x,y),\hskip 14.45377pt

Φ^{X} (r) = i \sum χ_{i}^{X} (x, y) ϕ_{i}^{X} (z) .

Φ^{X} (r) = i \sum χ_{i}^{X} (x, y) ϕ_{i}^{X} (z) .

- \frac{ℏ ^{2}}{2 m} \frac{\partial ^{2}}{\partial z ^{2}} ϕ_{j}^{X} (z) + i = 1 \sum N_{X} V_{ij}^{X} (z) ϕ_{i}^{X} (z) = (E - ε_{j}^{X}) ϕ_{j}^{X} (z) .

- \frac{ℏ ^{2}}{2 m} \frac{\partial ^{2}}{\partial z ^{2}} ϕ_{j}^{X} (z) + i = 1 \sum N_{X} V_{ij}^{X} (z) ϕ_{i}^{X} (z) = (E - ε_{j}^{X}) ϕ_{j}^{X} (z) .

V_{ij}^{X} (z) = \int_{0}^{w_{x}} \int_{0}^{w_{y}} d x d y χ_{j}^{X *} (x, y) V_{X}^{L} (x, y, z) χ_{j}^{X} (x, y),

V_{ij}^{X} (z) = \int_{0}^{w_{x}} \int_{0}^{w_{y}} d x d y χ_{j}^{X *} (x, y) V_{X}^{L} (x, y, z) χ_{j}^{X} (x, y),

(\frac{p _{z}^{2}}{2 m} + V_{X} (z)) ϕ^{X} (z) = E^{X} ϕ^{X} (z) .

(\frac{p _{z}^{2}}{2 m} + V_{X} (z)) ϕ^{X} (z) = E^{X} ϕ^{X} (z) .

E_{g}^{X} = E_{1, 1}^{X} - E_{2, n_{X} + 1}^{X} \equiv E_{c, 1}^{X} - E_{v, n_{X} + 1}^{X} X = A, B .

E_{g}^{X} = E_{1, 1}^{X} - E_{2, n_{X} + 1}^{X} \equiv E_{c, 1}^{X} - E_{v, n_{X} + 1}^{X} X = A, B .

Φ_{ϵ_{A}, κ_{A}}^{ϵ_{B}, κ_{B}} (z)

Φ_{ϵ_{A}, κ_{A}}^{ϵ_{B}, κ_{B}} (z)

ψ (z) = κ_{0}^{A}, κ_{0}^{B} \sum ⟨ ϵ_{0}, κ_{0} ∣ ψ ⟩ Φ_{ϵ_{0}, κ_{0}} (z) .

ψ (z) = κ_{0}^{A}, κ_{0}^{B} \sum ⟨ ϵ_{0}, κ_{0} ∣ ψ ⟩ Φ_{ϵ_{0}, κ_{0}} (z) .

(\frac{p _{z} ^{2}}{2 m} + V_{S L} (z)) ψ (z) = E ψ (z),

(\frac{p _{z} ^{2}}{2 m} + V_{S L} (z)) ψ (z) = E ψ (z),

V_{S L} (z) = H (- ζ) V_{A} (z mod [l_{c}]) + H (ζ) V_{B} (z mod [l_{c}]),

V_{S L} (z) = H (- ζ) V_{A} (z mod [l_{c}]) + H (ζ) V_{B} (z mod [l_{c}]),

κ_{0}^{A}, κ_{0}^{B} \sum [H (- ζ) E_{ϵ_{0}^{A}, κ_{0}^{A}} δ_{κ_{0}^{A}, κ_{0}^{A^{'}}} + H (ζ) E_{ϵ_{0}^{B}, κ_{0}^{B}} δ_{κ_{0}^{B}, κ_{0}^{B^{'}}}] ⟨ ϵ_{0}, κ_{0} ∣ ψ ⟩

κ_{0}^{A}, κ_{0}^{B} \sum [H (- ζ) E_{ϵ_{0}^{A}, κ_{0}^{A}} δ_{κ_{0}^{A}, κ_{0}^{A^{'}}} + H (ζ) E_{ϵ_{0}^{B}, κ_{0}^{B}} δ_{κ_{0}^{B}, κ_{0}^{B^{'}}}] ⟨ ϵ_{0}, κ_{0} ∣ ψ ⟩

= E ⟨ ϵ_{0}, κ_{0} ∣ ψ ⟩ .

E_{ϵ_{0}^{B}, κ_{0}^{B}} = E_{ϵ_{0}^{A}, κ_{0}^{A}} + V_{κ_{0}^{A}, κ_{0}^{B}} = E_{ϵ_{0}^{A}, κ_{0}^{A}} + ⟨ κ_{0}^{A} ∣ V_{ϵ P} ∣ κ_{0}^{B} ⟩,

E_{ϵ_{0}^{B}, κ_{0}^{B}} = E_{ϵ_{0}^{A}, κ_{0}^{A}} + V_{κ_{0}^{A}, κ_{0}^{B}} = E_{ϵ_{0}^{A}, κ_{0}^{A}} + ⟨ κ_{0}^{A} ∣ V_{ϵ P} ∣ κ_{0}^{B} ⟩,

E_{ϵ_{0}^{A}}^{A} (κ_{0}^{A}) φ^{ϵ_{0}} (κ_{0}^{A}) + κ_{0}^{B} \sum ⟨ κ_{0}^{A} ∣ V_{ϵ P} ∣ κ_{0}^{B} ⟩ φ^{ϵ_{0}} (κ_{0}^{B}) = E φ^{ϵ_{0}} (κ_{0}^{A}) .

E_{ϵ_{0}^{A}}^{A} (κ_{0}^{A}) φ^{ϵ_{0}} (κ_{0}^{A}) + κ_{0}^{B} \sum ⟨ κ_{0}^{A} ∣ V_{ϵ P} ∣ κ_{0}^{B} ⟩ φ^{ϵ_{0}} (κ_{0}^{B}) = E φ^{ϵ_{0}} (κ_{0}^{A}) .

E_{ϵ_{0}}^{A} (- i \frac{\partial}{\partial z}) Ψ^{ϵ_{0}} (z) + V_{P} (z) Ψ^{ϵ_{0}} (z) = E Ψ^{ϵ_{0}} (z) .

E_{ϵ_{0}}^{A} (- i \frac{\partial}{\partial z}) Ψ^{ϵ_{0}} (z) + V_{P} (z) Ψ^{ϵ_{0}} (z) = E Ψ^{ϵ_{0}} (z) .

[\frac{p _{z}^{2}}{2 m _{ϵ_{0}}^{*}} + V_{P} (z)] Ψ_{μ, ν}^{ϵ_{0}} (z) = (E - E_{ϵ_{0}, η_{0}}^{A})_{μ, ν} Ψ_{μ, ν}^{ϵ_{0}} (z),

[\frac{p _{z}^{2}}{2 m _{ϵ_{0}}^{*}} + V_{P} (z)] Ψ_{μ, ν}^{ϵ_{0}} (z) = (E - E_{ϵ_{0}, η_{0}}^{A})_{μ, ν} Ψ_{μ, ν}^{ϵ_{0}} (z),

E_{μ, ν} = (E - E_{ϵ_{0}, η_{0}}^{A})_{μ, ν},

E_{μ, ν} = (E - E_{ϵ_{0}, η_{0}}^{A})_{μ, ν},

[\frac{p _{z}^{2}}{2 m _{ϵ_{0}}^{*}} + V_{P} (z)] Ψ_{μ, ν}^{ϵ_{0}} (z) = E_{μ, ν} Ψ_{μ, ν}^{ϵ_{0}} (z),

[\frac{p _{z}^{2}}{2 m _{ϵ_{0}}^{*}} + V_{P} (z)] Ψ_{μ, ν}^{ϵ_{0}} (z) = E_{μ, ν} Ψ_{μ, ν}^{ϵ_{0}} (z),

ψ (z) \to Ψ_{μ, ν}^{ϵ_{0}} (z) Φ_{ϵ_{0}, η_{0}} (z)

ψ (z) \to Ψ_{μ, ν}^{ϵ_{0}} (z) Φ_{ϵ_{0}, η_{0}} (z)

\displaystyle\text{$\chi_{{}_{\Phi{\it\Psi}}}$ }=\sum_{\nu,\nu^{\prime}}f_{eh}\frac{\displaystyle\Bigl{|}\langle\psi^{v}_{\rm f}|H_{\rm int}|\psi^{c}_{\rm i}\rangle\Bigr{|}^{2}}{(\hbar\omega-E_{1,\nu}^{c}+E_{2^{\prime},\nu^{\prime}}^{v}+E_{B})^{2}+\Gamma^{2}}\hskip 14.45377pt

\displaystyle\text{$\chi_{{}_{\Phi{\it\Psi}}}$ }=\sum_{\nu,\nu^{\prime}}f_{eh}\frac{\displaystyle\Bigl{|}\langle\psi^{v}_{\rm f}|H_{\rm int}|\psi^{c}_{\rm i}\rangle\Bigr{|}^{2}}{(\hbar\omega-E_{1,\nu}^{c}+E_{2^{\prime},\nu^{\prime}}^{v}+E_{B})^{2}+\Gamma^{2}}\hskip 14.45377pt

\displaystyle\text{$\chi_{{}_{{\it\Psi}}}$ }=\sum_{\nu,\nu^{\prime}}f_{eh}\frac{\displaystyle\Bigl{|}\langle{\it\Psi}^{v}_{2^{\prime},\nu^{\prime}}|H_{\rm int}|{\it\Psi}^{c}_{1,\nu}\rangle\Bigr{|}^{2}}{(\hbar\omega-E_{1,\nu}^{c}+E_{2^{\prime},\nu^{\prime}}^{v}+E_{B})^{2}+\Gamma^{2}}\hskip 14.45377pt

\displaystyle\text{$\chi_{{}_{{\it\Psi}}}$ }=\sum_{\nu,\nu^{\prime}}f_{eh}\frac{\displaystyle\Bigl{|}\langle{\it\Psi}^{v}_{2^{\prime},\nu^{\prime}}|H_{\rm int}|{\it\Psi}^{c}_{1,\nu}\rangle\Bigr{|}^{2}}{(\hbar\omega-E_{1,\nu}^{c}+E_{2^{\prime},\nu^{\prime}}^{v}+E_{B})^{2}+\Gamma^{2}}\hskip 14.45377pt

χ_{_{Φ Ψ}} = ν, ν^{'} \sum ϕ_{v, ν^{'}}^{c, ν} (z_{0}) χ_{_{Ψ}}

χ_{_{Φ Ψ}} = ν, ν^{'} \sum ϕ_{v, ν^{'}}^{c, ν} (z_{0}) χ_{_{Ψ}}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Why the effective-mass approximation works so well for nano-structures

Pedro Pereyra

Física Teórica y Materia Condensada, UAM-Azcapotzalco, C.P. 02200, Ciudad de México, México

Abstract

The reason why the effective-mass approximation works so well with nanoscopic structures has been an enigma and a challenge for theorists. To explain this issue, we re-derive the effective-mass approximation using, instead of the wave functions for infinite-periodic-systems and the ensuing continuous bands, the eigenfunctions and eigenvalues obtained in the theory of finite periodic systems, where the finiteness of the number of primitive cells in the nanoscopic layers, is a prerequisite and an essential condition. This derivation justifies and shows why this approximation works so well for nano-structures. We show also with explicit optical-response calculations that the rapidly varying eigenfunctions $\Phi_{\epsilon_{0},\eta_{0}}(z)$ of the one-band wave functions $\Psi^{\epsilon_{0},\eta_{0}}_{\mu,\nu}(z)={\it\Psi}^{\epsilon_{0}}_{\mu,\nu}(z)\Phi_{\epsilon_{0},\eta_{0}}(z)$ , can be safely dropped out for the calculation of inter-band transition matrix elements.

I Introduction

The effective-mass approximation (EMA) is, without a doubt, the most recurrent and widely used approximation in theoretical calculations involving semiconductor structures. The formal justification of why this approximation, where the wave packets are constructed in terms of infinite periodic system wave functions,Wannier ; Slater ; Luttinger ; Altarelli ; AltarelliLesHuches ; Pollak ; Dingle works so well for finite micro and nano-structures, has been an enigma and a challenge for theorists. Despite the various guises of the EMA, the correct explanation has remained elusive. M. G. Burt in a number of papersBurt analysed critically the drawbacks of the “conventional” EMA, and tried to overcome these attempts providing a “new” envelope-function method, using again wave functions of infinite periodic systems. Now that the theory of finite periodic systems (TFPS) has evolved and has shown the ability to obtain the true, bona fide, energy eigenvalues and eigenfunctions of finite periodic structures with a finite number of unit cells,Abeles ; Erdos ; Claro1982 ; Ricco ; Vezzetti ; Kolatas ; Griffiths ; Peisakovich ; PereyraPRL ; PereyraJPA ; PereyraCastillo it is worth reviewing and re-deriving the EMA within the TFPS to understand why it works so well. The purpose of this letter is to re-derive the effective mass approximation taking into account the system and layers finiteness as the fundamental requisite.

Superlattices and layered structures are characterized by the simultaneous presence of two length scales: the crystalline unit cells in the semiconductor layers of atomic size and the layers widths. While the primitive cells lengths are of the order of 0.5nm, depending on the the semiconductor, the layers widths are of the order of 5nm, depending on the number of atomic cells per layer. This important difference in size is behind the factorization of the heterostructure wave function (HWF) in terms of rapid and slowly varying functions. The finiteness of the number $n_{X}$ of primitive cells, in the direction of growth, of layer $X$ (=A,B,…), and the finiteness of the number of layers in the heterostructure or number of superlattice (SL) unit cells $n_{S}$ , is not only an obvious characteristic, but also an essential requisite in the TFPS.

II Finiteness of periodic layers. An outline of the TFPS

Soon after the semiconductor SLs were introduced,Keldysh1962 ; EsakiTsu1970 and the subbands (or minibands) structures of direct and indirect band gap semiconductors were experimentally and theoretically confirmed,Esaki1972 ; Dingle1974 ; Mukherji1975 ; Miller1976 ; Chang1977 ; SaiHalaszChang1978 ; Capasso1986 ; LuoFurdyna1990 ; Rauch1997 ; Petrov1997 ; Heer1998 Leo Esaki noticed that whereas in reality SLs contain a finite number of layers, with a finite number of atomic cells each, the standard theoretical approaches tacitly assume that SLs are infinite-periodic structures with alternating layers containing also an infinite number of atomic cells.EsakiLesHuches In fact, the HWF and SL wave functions are generallyLuttinger ; Altarelli ; Dresselhauss ; Breitenecker ; Sanders ; Bastard1987NATO ; Smith ; Baraff written as $\psi({\bf r})=\sum_{l}u_{n_{l}}({\bf r})f_{l}({\bf r})$ , with $u_{n_{l}}({\bf r})$ the periodic part of the host-semiconductor Bloch’s function at band $n_{l}$ , and $f_{l}({\bf r})\propto\exp[{i{\bf k}_{\perp}\cdot{\bf r}_{\perp}}]\chi_{l}(z)$ the envelope wave function, with ${\bf k}_{\perp}={\bf k}_{x}+{\bf k}_{y}$ the perpendicular wave number assumed, generally, a constant of motion.Bastard1987NATO At the end, it is common to assume wave functions $\psi({\bf r})$ set up from wave functions $u_{n_{0}}$ of only one band, evaluated at the center of the Brillouin zone or at the subband edge ${\bf k}=0$ . For SLs the envelope function is, again, written in terms of Bloch-type functions $\chi_{\mu}(z)$ $=\exp(iqz)u_{\mu}(z)$ , characterized by a subband index $\mu$ and a continuous wave number $q$ that is then artificially discretized, via the cyclic boundary condition.

On the other side, the theory of finite periodic systems has grown, and has been generalized to include periodic structures with arbitrary potential profiles, arbitrary but finite number $n$ of unit cells and arbitrary but finite number $N$ of propagating modes for open, bounded and quasi-bounded periodic structures.PereyraPRL ; PereyraJPA ; PereyraCastillo ; Pereyra2005 The TFPS is based on the transfer matrix properties and the rigorous fulfillment of continuity conditions, that make possible to express the $n$ -cells transfer matrix $M_{n}$ as $M^{n}$ , where $M$ , for time reversal invariant systems, is the single-cell transfer matrix of dimention $2N$$\times$$2N$

[TABLE]

The accurate calculation of this matrix is crucial in this approach. The complex matrix functions $\alpha$ and $\beta$ depend strongly on the atomic or heterostructure potential profiles. The relation

[TABLE]

that was the source of errors in numerical calculations,Luque has been rigorously transformed, after defining the matrix function $p_{n-1}=\beta^{-1}\beta_{n}$ , into the matrix-recurrence relationPereyraPRL ; PereyraJPA

[TABLE]

with analytic solutions. In the single mode approximation, of interest here, this relation becomes the recurrence relation of Chebyshev polynomials of the second kind $U_{n}$ , evaluated at the real part of $\alpha=\alpha_{R}+i\alpha_{I}$ . The $n$ -cell transfer matrix elements, $\alpha_{n}$ and $\beta_{n}$ , can straightforwardly be determined, through the simple relations

[TABLE]

The eigenvalues of any quasi-bounded (qb) periodic system defined between $z_{L}$ and $z_{R}$ , see figure 1, with $z_{0}-z_{L}=z_{R}-z_{n}=d/2$ , can be obtained by solving the equationPereyra2005

[TABLE]

$q_{w}$ and $k$ are the wave numbers at the left (right) and right (left) of the discontinuity point $z_{L}$ ( $z_{R}$ ) and $\beta_{nI}$ the imaginary part of $\beta_{n}$ . The eigenfunctions of the quasi-bounded superlattice are given byPereyra2005

[TABLE]

with $a_{o}$ a normalization constant and $z$ any point in the $j+1$ cell. $\alpha_{j}$ , $\beta_{j}$ ,… are matrix elements of the transfer matrix $M_{j}(z_{j},z_{0})$ that connects the state vectors at points separated by exactly $j$ unit cells. $\alpha_{p}$ , $\beta_{p}$ … , where $p$ stands for part of a unit cell, are the matrix elements of the transfer matrix $M_{p}(z,z_{j})$ that connects the state vectors at $z_{j}$ and $z$ , for $z_{j}\leq z\leq z_{j+1}$ .

Our purpose here is to derive the effective mass approximation for the Schrödinger equation of a layered semiconductor heterostructure $A/B/C...$ , using the eigenvalues and eigenfunctions obtained in the TFPS. We will assume, without loss of generality, that our system is a binary structure $A/B/A...B/A$ , where the periodic semiconductor layers $A=(a_{A})^{n_{A}}$ and $B=(b_{B})^{n_{B}}$ contain $n_{A}$ and $n_{B}$ unit cells $a_{A}$ and $b_{B}$ , respectively, in the growing direction $z$ . We will show that the effective-mass approximation (EMA) can be derived when the heterostructure wave function $\psi(z)$ is written as the product $\Phi_{\epsilon_{0},\kappa_{0}}(z)$ ${\it\Psi}^{\epsilon_{0}}_{\mu,\nu}(z)$ , where ${\it\Psi}^{\epsilon_{0}}_{\mu,\nu}(z)$ is the envelope function and $\Phi_{\epsilon_{0},\kappa_{0}}(z)$ is the fast-varying function obtained in the TFPS, evaluated at the band-edges defined by the energy band index $\epsilon_{0}$ and the intra-band (or wave number) index $\kappa_{0}$ . In the particular case of periodic heterostructures, i.e. of SLs $(AB)^{n}=((a_{A})^{n_{A}}(b_{B})^{n_{B}})^{n}$ , the envelope functions are straightforwardly obtained in the EMA and the TFPS. It is worth emphasizing that since the transfer matrices are the matrix representation of the continuity and boundary conditions and the phase evolution of the quantum states, it is clear that the fast-varying and envelope wave functions, obtained in the TFPS, fulfill the continuity and boundary conditions. We will show also, for a specific example, that the optical response calculated with the matrix elements $\langle{\it\Psi}^{\epsilon^{\prime}_{0}}_{\mu^{\prime},\nu^{\prime}}\Phi^{A}_{\epsilon^{\prime},\kappa^{\prime}}(z)|H_{\rm int}|{\it\Psi}^{\epsilon_{0}}_{\mu,\nu}\Phi^{A}_{\epsilon,\kappa}(z)\rangle$ is practically the same as the optical response obtained with the matrix elements $\langle{\it\Psi}^{\epsilon^{\prime}_{0}}_{\mu^{\prime},\nu^{\prime}}|H_{\rm int}|{\it\Psi}^{\epsilon_{0}}_{\mu,\nu}\rangle$ , were the fast-varying wave functions $\Phi^{A}_{\epsilon,\kappa}(z)$ are ignored.

III An alternative derivation of the effective-mass approximation

Suppose now that for each layer $X$ (with $X$ equal $A$ or $B$ ) we can write the one-particle Schrödinger equation

[TABLE]

where the potential $V_{X}({\bf r})$ is periodic, at least in the growing direction $z$ . To simplify this problem we can follow the confined geometry method in Ref. [Bagwell, ] and the multichannel transfer matrix method in Refs. [PereyraPRL, ] and [PereyraJPA, ]. If we assume that the transverse widths are $w_{x}$ and $w_{y}$ and we write the potential $V_{X}({\bf r})$ as the sum of a confining potential $V_{X}^{C}(x,y)$ , which is infinite for $|x|>w_{x}/2$ and $|y|>w_{y}/2$ , and the function $V_{X}^{L}(x,y,z)$ periodic in $z$ , the orthonormal wave functions $\chi_{j}(x,y)$ , which are solutions of

[TABLE]

can be used to express the wave function $\Phi^{X}({\bf r})$ as

[TABLE]

If we replace this function in the Schrödinger equation (11), multiply from the left by $\chi_{j}^{X*}(x,y)$ and integrate upon $x$ and $y$ , we obtain the set of coupled equations

[TABLE]

Here $N_{X}$ is the number of propagating modes in layer $X$ , or the number of open channels (defined by the condition $E>\varepsilon_{j}^{X}$ ), and

[TABLE]

are the coupling-channels matrix elements. In this way the 3D multichannel problem is reduced into the 1D multichannel problem. It was shown in Refs. [PereyraPRL, ] and [PereyraJPA, ], and mentioned before, that a general solution for the 1D multichannel periodic system can be obtained in terms of the matrix polynomials $p_{n}$ , when the single-cell transfer matrix $M(z_{i+1},z_{i})$ is known. In actual semiconductor layers, the number of propagating modes depends on the Fermi energy and the cross section $w_{x}w_{y}$ . When the multichannel problem for a specific semiconductor $X$ , with $n_{X}$ unit cells is solved, one obtains the $N_{X}n_{X}$ energy eigenvalues $E^{X}_{\epsilon,\eta}$ (which determine the conduction and valence bands) and the corresponding eigenfuntions $\phi^{X}_{\epsilon,\eta}(z)$ . In the widely used 1D one channel approximation, with $V_{X}(z)=V_{11}^{X}(z)$ , $E^{X}=E-\varepsilon_{1}^{X}$ and $\phi^{X}(z)=\phi_{1}^{X}(z)$ , equation (14) becomes

[TABLE]

In this limit and given the periodic atomic potentials $V_{A}(z)$ and $V_{B}(z)$ , in the semiconductor layers $A=(a_{A})^{n_{A}}$ and $B=(b_{B})^{n_{B}}$ , one can obtain the unit-cell transfer matrices $M_{a}$ and $M_{b}$ and determine, applying the TFPS, the band structures $E^{A}_{\epsilon,\eta}$ and $E^{B}_{\epsilon,\eta}$ , and using the Eq. (II), the eigenfunctions $\phi^{A}_{\epsilon,\eta}(z)$ and $\phi^{B}_{\epsilon,\eta}(z)$ . A very good approximation for the atomic potentials $V_{A}(z)$ and $V_{B}(z)$ , are the effective potentials in the Hartree-Fock approximation. The quantum numbers $\epsilon$ denote the bands, and the quantum numbers $\eta$ the intra-band energy levels. We will denote the valence and the conduction bands with $\epsilon$ =c=1 and $\epsilon$ =v=2, respectively. The intra-band energy levels correspond to $\eta=$ 1, 2, … , $n_{X}$ +1. In terms of these energies the fundamental energy gap in layer $X$ is given by

[TABLE]

with $E^{X}_{{\rm c},1}$ the first energy eigenvalue of the conduction band, i.e. the conduction band-edge denoted later as $E^{X}_{\epsilon_{0X}}$ , and $E^{X}_{{\rm v},n_{X}+1}$ the last energy eigenvalue of the valence band, i.e. the upper-edge of the valence band. As is well known, the band edges of layers $A$ and $B$ do not coincide, in general (see figure 2), and their difference gives rise to the conduction and valence band split offs, as well as, to piecewise constant superlattice or heterostructure potential. Bastard1987NATO We will assume from here on that the semiconductor layers $A$ and $B$ are such that $E^{A}_{g}<E^{B}_{g}$ . If energies are below the barrier height ( $E<V_{b}$ ), see inset in figure 2, the eigenfunctions $\phi^{A}_{\epsilon,\kappa}(z)=\phi^{A}(z,E)|_{E=E^{A}_{\epsilon,\eta}}$ are propagating functions while $\phi^{B}(z,E)|_{E=E^{A}_{\epsilon,\eta}}$ are evanescent.Pereyra2005

For each value of the quantum number $\eta$ we have the corresponding wave number $\kappa_{\eta}$ . To keep some analogy with conventional notation, we can represent the energy eigenvalues $E_{\epsilon,\eta}$ as $E_{\epsilon,\kappa_{\eta}}$ or just as $E_{\epsilon,\kappa}$ , that can be written also as $E_{\epsilon}(\kappa)$ , keeping in mind that $\kappa$ is discrete.

It is clear that if we are able to determine the eigenvalues $E^{A,B}_{\epsilon,\kappa}$ and eigenfunctions $\phi^{A,B}_{\epsilon,\kappa}$ , we are close to obtain the full solution for the heterostructure or SL. Having the wave functions $\phi^{A,B}_{\epsilon,\kappa}$ , we must still fulfill the continuity and boundary conditions at the layered structure interfaces. Although this task could, in principle, be accomplished, it is not so simple for these functions (as for the envelope functions) and it is not our purpose here. We will, instead, turn our attention into the derivation of the effective mass approximation based on the existence of the set of rapidly-varying orthogonal functions $\phi^{A,B}_{\epsilon,\kappa}$ .

To derive the EMA in the TFPS we need to expand the heterostructure or SL wave functions $\psi(z)$ in terms of the local wave functions $\phi^{A}_{\epsilon,\kappa}(z)$ and $\phi^{B}_{\epsilon,\kappa}(z)$ , defined inside the layers $A$ and $B$ respectively. To simplify the discussion let us assume that we have the SL $(AB)^{n}A$ . If $\zeta=z{\rm\,mod}\,[l_{c}{\rm]}-a$ , with $a$ the width of layer $A$ , $l_{c}=a+b$ the length of the SL unit-cell, and H(w) is the Heaviside function, we can write a rapidly-varying wave function as (see figure 3)

[TABLE]

As mentioned before, in the conventional derivations of the effective-mass approximation, the wave functions inside each layer are expanded in terms of the periodic parts of the band-edge Bloch functions, $u_{l,k_{0}}^{A}$ or $u_{l,k_{0}}^{B}$ , which are generally assumed to be equal.Enderleinpg252 ; Bastardpg67 Setting up the SL wave function $\psi(z)$ , the assumptions of only one-band and small k-vectors are also made.Enderleinpg252 In the theory of finite periodic systems, the bands and wave functions $\phi^{A}_{\epsilon,\kappa}(z)$ and $\phi^{B}_{\epsilon,\kappa}(z)$ are the energy eigenvalues and the eigenfunctions of the periodic systems $(a_{A})^{n_{A}}$ , $(b_{B})^{n_{B}}$ . In figure 4 we show a simplified calculation in the TFPS of the energy spectrumsimplified and transmission coefficients for a specific (confined and open) semiconductor $A=(a_{A})^{n_{A}}$ , with energy gap $E_{gA}\simeq$ 2.6eV and unit-cell length $l_{A}$ = 5.15nm. On the left hand side of figure 4, we show the valence and the conduction bands (VB and CB) of the periodic sequence $(a_{A})^{n_{A}}$ bounded by cladding layers $C$ , and, on the right hand side, the transmission coefficients through the same semiconductor but open. At the top of the left hand side column, we plot also the subbands (or minibands) of the SL $(AB)^{n}$ for $E_{gA}\simeq$ 2.6eV, $E_{gB}\simeq$ 2.9eV, $l_{A}\sim l_{B}$ 5.15nm and $n$ =10. These graphs show that as the layer width $w_{A}=l_{A}n_{A}$ gets thinner, the energy levels separation, $\Delta E_{c}$ , and the energy-levels widths, $\Gamma E_{\mu}$ , increase. On the other hand, it is known that whereas the energy gap $E_{gA}$ remains constant when the number of unit cells $n_{A}$ varies, the subbands of the superlattice $(AB)^{n}$ , for a fixed barrier width $w_{B}$ , move with the band-edge energy level upwards when $n_{A}$ decreases, and downwards when $n_{A}$ , hence $w_{A}$ , increases. This behavior of the energy spectra, justifies the one-band ‘ansatz’ and strengthens the relevance of the band-edge functions as the number of unit cells $n_{A}$ gets smaller. In the specific example of figure 4, the level width $\Delta\Gamma_{1}$ is of the order of the subband widths $\sim$ 10meV), and the energy levels separation for a semiconductor with $n_{A}\sim$ 5 ( $w_{A}\sim$ 25nm) is approximately 600meV, which is much larger than the bands split off in the conduction and valence bands of layers $A$ and $B$ . Thus, in order to define the heterostructure or SL wave function $\psi(z)$ in terms of the envelope and the fast-varying functions, it is justified to consider the band-edge and one-band assumptions. Therefore, we can consider the expansion

[TABLE]

Here and in the following, the quantum numbers $\epsilon_{0}$ and $\kappa_{0}$ represent the set $\epsilon^{A}_{0},\epsilon^{B}_{0}$ and $\kappa^{A}_{0},\kappa^{B}_{0}$ , respectively. For a simple and compact notation, we will denote the expansion coefficient $\langle\epsilon_{0},\kappa_{0}|\psi\rangle$ , known also as the envelope function, as $\varphi^{\epsilon_{0}}_{\kappa_{0}}(z)$ or $\varphi^{\epsilon_{0}}(\kappa_{0},z)$ . If we introduce the function $\psi(z)$ of Eq. (20) into the SL Schrödinger equation

[TABLE]

where

[TABLE]

multiply by $\Phi_{\epsilon_{0},\kappa^{\prime}_{0}}(z)$ and integrate, we have

[TABLE]

Since

[TABLE]

the sectionally constant periodic potential $V_{P}(z)$ , known as the split off, appears here naturally as a consequence of the difference in the energy band structures of layers $A$ and $B$ , both in the conduction and valence bands. Therefore, we are left with

[TABLE]

We can now, as usual, multiply by $(1/\Omega)e^{i\kappa z}$ and sum the Fourier series to obtain

[TABLE]

If we further approximate $E_{\epsilon_{0}}(-i\partial/\partial z)$ by a quadratic function of $-i\partial/\partial z$ , near the band edge, assuming that the k-vector at the edge is small and an effective mass $m^{*}_{\epsilon_{0}}$ , defined as usual for each layer, we have

[TABLE]

with $\epsilon_{0}=c$ and $\eta_{0}$ =1 for the conduction band and $\epsilon_{0}=v$ and $\eta_{0}$ = $n_{A}$ +1 for the valence band. If we define the energy eigenvalues

[TABLE]

measured from the band edges, we can write the Schrödinger equation in the effective mass approximation

[TABLE]

that we were looking for and was used for SLs and heterostures, without a specific proof. As mentioned before, for SLs we can use the TFPS to solve this equation and to determine the eigenvalues $E_{\mu,\nu}$ and the eigenfunctions ${\it\Psi}^{\epsilon_{0}}_{\mu,\nu}(z)$ , known as envelope functions. It is worth noting that this derivation of EMA does not require that the layered structure be periodic. Therefore, the EMA is valid for any layered heterostructure.

All the assumptions behind this derivation imply that the wave functions $\psi(z)$ can be written as

[TABLE]

with ${\it\Psi}^{\epsilon_{0}}_{\mu,\nu}(z)$ the SL eigenfunction (envelope functions) and $\Phi_{\epsilon_{0},\eta_{0}}(z)$ the rapid oscillating wave functions. In figure 5 we plot the functions ${\it\Psi}^{c}_{1,1}(z)$ and $\Phi_{c,1}(z)$ , in the conduction band, and the functions ${\it\Psi}^{v}_{2^{\prime},1^{\prime}}(z)$ and $\Phi_{v,1}(z)$ of the valence band. these functions can in principle be determined within the TFPS.

Dealing with transport properties, one can neglect the function $\Phi_{\epsilon_{0},\eta_{0}}(z)$ , however, for calculations involving two bands, the whole wave function $\psi(z)$ should, in principle, be considered. We will show now that the fast-varying factor $\Phi_{\epsilon_{0},\eta_{0}}(z)$ can effectively be ignored in optical response calculations.

IV On the redundancy of the fast-varying functions

To determine the effect of the rapidly-oscillating factor $\Phi_{\epsilon_{0},\eta_{0}}(z)$ on the optical response, let us consider the blue emitting $(In_{0.2}Ga_{0.8}N\backslash In_{0.05}Ga_{0.95}N)^{10}\backslash In_{0.2}Ga_{0.8}N$ superlattice studied in Refs. [NakamuraPaper, ] and [PereyraEPL, ]. We will calculate the optical response

[TABLE]

taking into account the fast-varying functions $\Phi_{\epsilon_{0},\eta_{0}}(z)$ , which means $\psi^{c}_{\rm i}$ = ${\it\psi}^{c,1}_{1,\nu}(z)$ = $\Phi_{c,1}(z){\it\Psi}^{c}_{1,\nu}(z)$ and $\psi^{v}_{\rm f}$ = ${\it\psi}^{v,n_{A}+1}_{2^{\prime},\nu^{\prime}}(z)$ = $\Phi_{v,n_{A}+1}(z){\it\Psi}^{v}_{2^{\prime},\nu^{\prime}}(z)$ . These results are compared in figure (6) with the optical response

[TABLE]

calculated in Ref. [PereyraEPL, ], ignoring the fast-varying functions. As was shown in this reference and can be seen in figure 6, this optical response agrees extremely well with the experimental results in panel (c). NakamuraPaper ; PereyraEPL In (32) and (33), $\hbar\omega$ is the emitted photon energy, $E^{c}_{1,\nu}$ the energy levels in the first subband of the CB, $E^{v}_{2^{\prime},\nu^{\prime}}$ the (heavy hole) energy levels in the second subband of the VB, $E_{B}$ the exciton binding energy, $f_{eh}$ the occupation probabilities and $\Gamma$ the level broadening energy.

Besides the overall amplification, by a factor of $\simeq$ 2.4, our calculations show that the rapidly-varying functions have no effect on the optical spectrum.

According with the mean value theorem for definite integrals, the optical response $\chi_{{}_{\Phi{\it\Psi}}}$ in equation (32) can be written as

[TABLE]

with $\phi^{c,\nu}_{v,\nu^{\prime}}(z_{0})$ a number, which in principle depends on the quantum numbers $\nu$ and $\nu^{\prime}$ . Specific calculations show that this factor is almost constant (see figure 7), and consistent with the differences in the numerical values of the optical responses $\chi_{{}_{\Phi{\it\Psi}}}$ and $\chi_{{}_{{\it\Psi}}}$ in figure 6.

V Conclusions

We have derived the effective mass approximation for the Schrödiger equation of layered hetrostructures, based on the energy eigenvalues and rapidly- oscillating eigenfunctions obtained, for each layer, in the theory of finite periodic systems. This derivation that is based on physical quantities of finite structures explains why the EMA works so well when applied to this kind of systems. We have shown also that, in order to calculate interband transition matrix elements, the rapidly-oscillating wave functions $\Phi_{\epsilon_{0},\eta_{0}}(z)$ , that should be multiplied by the envelope functions, ${\it\Psi}^{\epsilon_{0}}_{\mu,\nu}(z)$ , can safely be ignored.

VI Acknowledgement

I acknowledge the useful comments of Herbert P. Simanjuntak.

Bibliography47

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1(1) G. H. Wannier Phys. Rev. 52 191 (1937).
2(2) J. M. Luttinger and W. Kohn Phys. Rev. 97 869 (1955)
3(3) M. Allarelli and F. Bassani Handbook on Semiconductom vol I Band Theory and Transport Properties vol 1 , ed W. Paul (Amsterdam: North-Holland 1982) p 269.
4(4) M. Altarelli in Heterojunctions and Semiconductor Superlattices: Proceedings of the Winter School Les Houches Ed. by Guy Allan and Gerald Bastard, France, March 12-21, 1985.
5(5) F. H. Pollak and M. Cardona J. Phys. Chem. 27 423 (1966)
6(6) R. Dingle, W. Wiegmann and C. H. Henry Phys. Rev. Lett. 33 , 827 (1974).
7(7) J. C. Slater, Phys. Rev. 76 , 452 (1949).
8(8) M. G. Burt J. Phys: Condens. Matter 4 6551490(1992), and references therein.