Adiabatic corrections for velocity-gauge simulations of electron dynamics in periodic potentials
Vladislav S. Yakovlev, Michael S. Wismer

TL;DR
This paper introduces analytical corrections for velocity-gauge simulations that reduce computational complexity and resolve divergence issues in modeling electron dynamics in crystalline solids.
Contribution
The authors derive band-energy-dependent corrections that improve convergence and eliminate divergences in velocity-gauge electron dynamics simulations.
Findings
Significantly reduces the number of energy bands needed.
Overcomes divergence problems in susceptibility calculations.
Enhances accuracy and efficiency of light-matter interaction modeling.
Abstract
We show how to significantly reduce the number of energy bands required to model the interaction of light with crystalline solids in the velocity gauge. We achieve this by deriving analytical corrections to the electric current density. These corrections depend only on band energies, the matrix elements of the momentum operator, and the macroscopic vector potential. Thus, the corrections can be evaluated independently from modeling the interaction with light. In addition to improving the convergence of velocity-gauge calculations, our analytical approach overcomes the long-standing problem of divergences in expressions for linear and nonlinear susceptibilities.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Adiabatic corrections for velocity-gauge simulations of electron dynamics in periodic potentials
Vladislav S. Yakovlev
Max-Planck-Institut für Quantenoptik, Hans-Kopfermann-Str. 1, 85748 Garching, Germany
Ludwig-Maximilians-Universität, Am Coulombwall 1, 85748 Garching, Germany
Michael S. Wismer
Max-Planck-Institut für Quantenoptik, Hans-Kopfermann-Str. 1, 85748 Garching, Germany
Abstract
We show how to significantly reduce the number of energy bands required to model the interaction of light with crystalline solids in the velocity gauge. We achieve this by deriving analytical corrections to the electric current density. These corrections depend only on band energies, the matrix elements of the momentum operator, and the macroscopic vector potential. Thus, the corrections can be evaluated independently from modeling the interaction with light. In addition to improving the convergence of velocity-gauge calculations, our analytical approach overcomes the long-standing problem of divergences in expressions for linear and nonlinear susceptibilities.
I Introduction
The velocity and length gauges are two most frequent choices for a theoretical description of the interaction between electromagnetic radiation and matter. Even though, in principle, all physical observables must be gauge-independent, the gauge choice is very important for numerical simulations because approximations and discretization errors violate the gauge invariance in its strict sense Scully and Zubairy (1997). In atomic and molecular physics, where quantum systems are finite, both gauges are easily implemented. The situation is different for modeling the interaction of light with bulk crystals, where an infinite periodic lattice potential and the degenerate energy states present difficulties for length-gauge implementations Aversa and Sipe (1995); Virk and Sipe (2007). These difficulties also pertain to the closely related approach of modeling quantum dynamics in the basis of Houston states Houston (1940), also known as accelerated Bloch states Krieger and Iafrate (1986). The fundamental problem here is that these methods require differentiation with respect to the crystal momentum, , while numerically evaluated Bloch states are, in general, not smooth functions of Marzari et al. (2012); Attaccalite and Grüning (2013). The problem becomes particularly severe for modeling strong-field phenomena, where electrons traverse a significant part of the Brillouin zone during a laser cycle Ghimire et al. (2011); Schubert et al. (2014); Luu et al. (2015); Wismer et al. (2016). Solutions to these problems are known Virk and Sipe (2007); Grüning et al. (2016), but they either do not ensure the periodicity with respect to Lindefelt et al. (2004) or require the evaluation of so-called covariant derivatives Nunes and Gonze (2001); Souza et al. (2004); Virk and Sipe (2007). Evaluating the covariant derivatives at each propagation step slows down computations. In contrast, the velocity gauge does not require differentiation with respect to the crystal momentum, which often makes it advantageous for modeling light-driven electron dynamics Yabana et al. (2012); Goncharov (2013); Simonsen et al. (2014); Wachter et al. (2014); Yakovlev et al. (2015); Chizhova et al. (2016). However, performing velocity-gauge simulations in a basis of Bloch states has a serious drawback: for convergence, they usually require many more energy bands than length-gauge or Houston-basis simulations Wu et al. (2015). In this paper, we propose a solution to this problem. We do so by deriving analytical corrections to the polarization response evaluated with a relatively small number of bands (section III). Although we derive our corrections in the limit of a weak external field, we show in section IV that they work surprisingly well even for strong optical fields.
II Velocity-gauge description of light-solid interaction
Let denote a Bloch state with band index and crystal momentum . The Bloch states are eigenstates of an unperturbed Hamiltonian . That is, , where is energy. Even though is a single-particle operator, it may represent an outcome of self-consistent many-body calculations. We describe the interaction with a classical electromagnetic field by adding an interaction operator, , to the Hamiltonian. Unitary transformations allow one to write in different forms. One of them is . Here, is the vector potential of the electromagnetic field, is the momentum operator, is the elementary charge, and is the electron mass. In SI units, the relation between and the electric field is . In the following, we assume the dipole approximation, where the dependence of on coordinate, , is neglected. The term can be eliminated by the unitary transformation . The results is the velocity-gauge form of the Hamiltonian:
[TABLE]
In this equation, the external field preserves the spatial periodicity of the lattice potential (which may or may not be local). Therefore, the Bloch theorem applies not only to , but also to . Consequently, transitions induced by the external field formally preserve the crystal momentum, which is one of the most important reasons why velocity-gauge simulations are so attractive for numerical calculations.
However, the convenience of preserving the crystal momentum comes at a price. This becomes apparent if we consider the interaction of a dielectric with a constant electric field () that is sufficiently weak to neglect interband tunneling. Even though, in the absence of free charge carriers, all physical observables are expected to take constant values, the velocity-gauge wave functions have a nontrivial dependence on time because the vector potential, , makes the Hamiltonian time-dependent. In this case, numerical approximations, such as truncating the basis, lead to a spurious divergence of the polarization response in the low-frequency limit Aspnes (1972); Moss et al. (1990); Sheik-Bahae et al. (1991); Sipe and Ghahramani (1993); Aversa and Sipe (1995). This is one of the underlying reasons why velocity-gauge calculations require many more bands than length-gauge or Houston-basis ones.
Another, closely related, problem with the velocity gauge can be understood by considering a particular Bloch state exposed to a weak long half-cycle of the electric field. Because the field performs only a half of the oscillation, the vector potential has a nonzero value at the end of the pulse. In the adiabatic limit, the pulse transforms an initial state into another Bloch state, , provided that the -th band is nondegenerate along the reciprocal-space path prescribed by the acceleration theorem: . This adiabatic intraband motion is guaranteed in length-gauge and Houston-basis calculations, making these approaches a good choice when only ad hoc transition matrix elements are available Apalkov and Stockman (2012); Schiffrin et al. (2013). In contrast, the correct adiabatic behavior is not explicitly encoded in the velocity-gauge equations of motion—it is implicit in the parameters of the model, demanding accurate energies and matrix elements.
The only matrix elements required in the velocity-gauge simulations are those of the momentum operator:
[TABLE]
where the integration is performed over a unit cell, is the cell volume, and are Bloch functions in the coordinate representation. Indeed, let us search for time-dependent wave functions using the following ansatz:
[TABLE]
Inserting this ansatz into the time-dependent Schrödinger equation (TDSE), , and assuming that , we obtain a system of coupled differential equations for the probability amplitudes:
[TABLE]
In these equations, the medium properties are fully defined by and . Once the probability amplitudes are evaluated, the electric current density can be obtained according to
[TABLE]
Here, is the Fermi factor of band in the initial (stationary) state. The summation in Eq. (5) is carried out over all the bands that contain electrons in the unperturbed state (these would be valence bands for a dielectric), the integration is performed over the first Brillouin zone (BZ), and the contribution from crystal momentum in initial band is given by
[TABLE]
In this equation, we have added a superscript to the probability amplitudes in order to denote the initial band: . That is, in each initial state, only one band is occupied; by adding the contributions from all the initial states, as prescribed by Eq. (5), we model a solid with occupied valence bands. Once is evaluated, the polarization response, which is the key quantity in linear and nonlinear optics, is readily given by King-Smith and Vanderbilt (1993); Resta (1998)
[TABLE]
III Velocity-gauge corrections
In the previous section, we argued that velocity-gauge simulations tend to violate adiabaticity where adiabaticity is expected, which is why they demand high-quality band energies and matrix elements. In this section, we derive analytical corrections that alleviate this issue in the case where a finite number of energy bands is the main source of discretization errors. In the next section, we will demonstrate that our corrections work well even if the optical response is far from being adiabatic (that is, the polarization (t) is not a function of the electric field at time ). In spite of that, we derive the corrections by considering an adiabatic limit. It is worth clarifying this point before we start the derivation. If velocity-gauge simulations require bands to produce physically meaningful results, while length-gauge simulations require only bands, then the bands that are unnecessary in the length gauge serve a rather numerical than physical purpose. They are required in the velocity gauge to ensure that the basis set is reasonably complete. After the interaction with an external field, the occupation of these highly excited states is negligibly small. Even if the polarization response due to “physically relevant” bands is very nontrivial, the contributions due to very high conduction bands (and very low valence ones) are expected to be, in some sense, “simple”. We argue that this simplicity consists in the adiabaticity with respect to the vector potential: the difference between the exact current density, , and that evaluated with a finite number of bands, is, approximately, a function of the vector potential, , at time . In this case, one may use any function that is particularly well suited for analytical calculations. We chose
[TABLE]
where is a constant vector (it is complex-valued unless is linearly polarized), is a carrier-wave frequency, and is a small parameter that controls how slowly the field is turned on. The next step will be the evaluation of the current density using time-dependent perturbation theory, where we only consider a time interval where the external field may be viewed as a perturbation. Once this step is accomplished, we will take the adiabatic limit: . Here comes our key idea. When the amplitude of the vector potential, , is fixed and its frequency is decreased, then the electric field decreases as well; consequently, we expect for a dielectric and, more generally,
[TABLE]
if there are partially occupied states in the unperturbed solid. The adiabatic intraband current density, , is produced by charge carriers that change their crystal momentum according to the acceleration theorem, without leaving their initial bands:
[TABLE]
Here, it is sufficient to carry out the sum over partially occupied bands. Indeed, the contribution from empty bands is obviously zero, while for a fully occupied band, where , the integral can be shown to be zero: .
In the velocity gauge, the equations of motion do not automatically satisfy Eq. (9). We will soon see that this equation is, in general, violated by truncating the basis set, which is an unavoidable operation for numerical calculations. However, knowing that Eq. (9) holds in the ideal case, we arrive at a simple recipe for adiabatic corrections to the electric current density evaluated with a finite number of bands:
[TABLE]
In the adiabatic limit, is a function of . In the rest of this section, we will be searching for a decomposition of into the powers of the vector potential. As a first step, we decompose the intraband adiabatic current:
[TABLE]
The first two terms in this decomposition are
[TABLE]
The other terms are straightforward to obtain by continuing the Taylor expansion.
We proceed with a detailed derivation of the first-order correction to the current density. In first-order perturbation theory, the solution of Eq. (4) with an initial condition is
[TABLE]
where we have introduced transition frequencies:
[TABLE]
From now on, we will omit, for brevity, in the arguments of the transition frequencies and momentum matrix elements. Substituting Eq. (15) into Eq. (6), neglecting the terms quadratic with respect to , reintroducing by substituting with , and taking the limit , we obtain
[TABLE]
Before we take the limit , we notice that those terms under the sum where vanish in this limit. In the rest of this paper, the notation will imply that not only the term but also all other terms where , if present, must be omitted (otherwise, a zero would appear in the denominator). With this in mind, we take the limit , use Eq. (5), and write the sum of the zero- and first-order approximations to the adiabatic current density as
[TABLE]
The first term on the right-hand side of this expression, representing the electric current that flows without external field, is identical with the zero-order expansion term of the intraband adiabatic current, Eq. (13). The two terms cancel when we evaluate according to Eq. (11). In the rest of Eq. (17), we recognize the Thomas-Reiche-Kuhn sum rule in the form applicable to solids Luttinger and Kohn (1955). For our purposes, it is convenient to write this rule as
[TABLE]
where is a Cartesian coordinate, and is a unit vector along the corresponding coordinate axis. Remembering Eq. (14), we see that if the summation in Eq. (17) is performed over the entire infinite manifold of bands, then the requirement encoded in Eq. (9) leads to the Thomas-Reiche-Kuhn sum rule. If we truncate the basis set, we violate the sum rule.
Following Eq. (11), we obtain the following first-order correction to the electric current density obtained by solving the TDSE numerically with a finite number of bands:
[TABLE]
Here and in the following, summation is performed only over those bands that was evaluated with. As a reminder that the basis is truncated, we have replaced with .
For a dielectric, where , for valence bands, and for conduction bands, we can write Eq. (19) as
[TABLE]
Here, is the number of valence bands in a simulation to which the correction is applied; we denote this set of bands as “VB”. To emphasize that we consider the case where all the valence bands are initially occupied, while the conduction bands are empty, we have replaced with . The summation over goes over all the valence and conduction bands, except band and those terms where ; however, it is easy to see that the terms with cancel.
For dielectrics with a crystal symmetry that demands that the electric current must flow along the laser polarization (e.g. cubic symmetry), Eq. (20) is equivalent to substituting the actual number of valence bands, , with an effective one, . Indeed, by multiplying both sides of Eq. (20) with and relying on , we obtain
[TABLE]
where is a unit vector pointing along . Comparing this expression with Eq. (6), we see that our correction gives the same effect as substituting, in Eq. (6), with , where
[TABLE]
The concept of an effective number of valence electrons, , is well known Philipp and Ehrenreich (1963); Smith and Shiles (1978); Wooten (2013), but it is not as widely known that can be used to adjust the polarization response obtained with an insufficient number of bands. Also, in the next section, we will demonstrate that the applicability of this concept is not limited to the linear response—Eq. (19) works surprisingly well even if the polarization response is very nonlinear.
Note that does not depend on the field magnitude; consequently, our corrections retain their importance in the weak-field limit. In other words, no matter how weak the electric field is, one has to use a sufficient number of bands to either satisfy the Thomas-Reiche-Kuhn sum rule or be able to compensate for its violation.
The derivation of higher-order corrections is similar to the one presented above, but it is much more laborious. Therefore, we only present the final results. For the second-order correction, we get
[TABLE]
This expression necessarily equals zero if is symmetric with respect to time reversal and .
Finally, for the third-order correction, we obtain
[TABLE]
Even though it is a rather lengthy expression, we will demonstrate that it may be very useful.
IV A one-dimensional demonstration of the method
We derived our adiabatic corrections in three spatial dimensions; nevertheless, we demonstrate their power by means of one-dimensional simulations. On the one hand, it is much easier to control and achieve numerical convergence in one-dimensional calculations; on the other hand, the equations derived in the previous section become more manageable once we replace all vector quantities with scalar ones. Also, we further simplify the equations by considering a dielectric and eliminating . We write our corrections to the current density as an expansion in powers of the vector potential:
[TABLE]
In Eqs. (5), (19), (22), and (23), we replace with , substitute scalar quantities for the vector ones, and, after some simplifications, obtain
[TABLE]
[TABLE]
and
[TABLE]
These coefficients depend only on the stationary electronic structure of the solid, and they do not depend on the laser pulse. Also, evaluating the coefficients, we have dropped the adiabatic intraband current density, which is supposed to be zero for dielectrics but may acquire nonzero values when the integral in Eq. (10) is evaluated numerically. We found that these values were so small that accounting for them had a negligible effect on the results presented in this section.
In the following, we use atomic units () unless specified otherwise. For our numerical demonstration, we employ a stationary Hamiltonian, , with a noncentrosymmetric lattice potential:
[TABLE]
Here, Å is the lattice constant. Evaluating the band structure, we Fourier-expanded into 81 plane waves. We regard the lowest two energy bands of this potential as the valence ones. The energy gap between the second and the third bands is equal to eV, which is close to the band gap of quartz. is symmetric with respect to time reversal; therefore, up to round-off errors, in Eq. (26) evaluates to zero.
We numerically solved the TDSE for 61 uniformly spaced crystal momenta using only those stationary states that had an energy below a certain level. We measure this level from the bottom of the lowest conduction band and refer to it as the cut-off energy, . In other words, the subset of eigenstates of that we use to solve the TDSE consists of all the Bloch states with energies .
We used the following expression for the vector potential of the external field:
[TABLE]
Here, is the amplitude of the electric field, is the central frequency of the laser pulse, is the Heaviside step function, and is related to the full width at half maximum (FWHM) of by
[TABLE]
We chose the central wavelength of the laser pulse to be 750 nm ( at. u.; eV) and set the FWHM to 4 fs ( at. u.).
In Fig. 1, we compare electric current densities evaluated in different approximations for two peak values of the electric field: V/Å for panel (a) and V/Å for panel (b). The spectra of these current densities are compared in Fig. 2. We multiplied the current density with a constant factor (0.96333) that made the refractive index at the central laser wavelength equal to 1.47, which is the refractive index of thin-film fused silica at 750 nm Gao et al. (2013). This procedure allows us to label the -axis of Fig. 1 with units of .
The current density labeled as “accurate” was obtained with eV, which corresponds to using 40 energy bands. We verified that, for this large value of , corrections are unnecessary, so we did not apply them. The other results in Fig. 1 were evaluated for eV, which corresponds to using just the lowest three conduction bands of our one-dimensional model. Without corrections, the velocity gauge fails dramatically for this low value of . However, already the first-order correction, specified by Eqs. (24) and (25), greatly improves the accuracy. In Fig. 1(a), the curves representing the accurate and corrected current densities are virtually indistinguishable. When we increase the peak electric field to V/Å, we see significant deviations between the accurate (thick gray curve) and that obtained in the simulation with eV, followed by applying the first-order correction (solid blue curve). These deviations disappear as soon as we add the third-order correction to the first-order one (red dashed curve).
In the frequency domain, the first-order correction only affects the spectral range at the central laser frequency, while the third-order correction affects the spectral components at and . This is clearly seen in Fig. 2, where we also observe that high frequencies () are much less demanding to the number of bands than the low frequencies are. Note that the current density in Figs. 1(b) and Figs. 2(b) shows signatures of highly nonlinear processes. In particular, the fast oscillations in Figs. 1(b) emerge due to the multiphoton absorption Korbman et al. (2013) (in our case, the ratio of the band gap to the photon energy is equal to 5.4). On average, electrons per unit cell are excited to conduction bands by the pulse with V/Å; this value grows up to for V/Å. For , the three curves obtained with eV coincide because the vector potential is zero, and so are the correction terms. The fact that these curves are very close to our most accurate current density means that a negligible number of electrons is excited to energies above . This is expected because bands that carry residual population are responsible for processes more complex than those that our adiabatic corrections account for.
We quantify the discrepancy between an accurate current density, , and an approximate one, , using the following measure:
[TABLE]
Figure 3 shows how depends on .
For the moderately strong peak field equal to V/Å, already the first-order correction decreases by two orders of magnitude in the range of cut-off energies between 25 eV and 200 eV. The third-order correction decreases the discrepancy even further. The improvement due to the analytical corrections is not as dramatic when the laser pulse has a peak field as large as V/Å, but we see that the third-order correction is particularly useful in this case. For example, without corrections, we get at eV; with the first-order corrections, this level of discrepancy is first reached at eV, while the third-order correction provides this value of already at eV.
Figure 4 gives more details on how the discrepancy depends on the peak electric field. In this figure, we represent using false colors.
Panel (a) shows the result of applying the first-order correction. For panel (b), we corrected the current density using both and . This figure confirms the usefulness of the third-order correction. In both panels, we observe the same general trend: a stronger laser field requires a larger value of . However, when the third-order correction is used, the dependence is not monotonous. We also notice that the local minima of in Fig. 4(b) tend to maintain their positions as is increased.
V Conclusions and outlook
We have shown how to significantly reduce the number of bands required by velocity-gauge simulations performed in a stationary basis of Bloch states. The original motivation for this work was to speed up modeling of highly nonlinear ultrafast phenomena that take place when intense few-cycle laser pulses interact with crystalline solids. However, our method is very general, and it may find other applications. In particular, it can improve real-time methods of evaluating linear and nonlinear optical susceptibilities (the results presented here are only relevant to , , and , but higher-order corrections can be derived as well).
Equations (19), (22), and (23) present the main results of this paper. The key idea of our approach was to consider the limit where the frequency of the field, , approaches zero, while the amplitude of the vector potential is fixed. In this limit, the electric field becomes infinitesimally weak, which is a significant difference between our approach and that used to analyze the polarization divergence Aspnes (1972); Sipe and Ghahramani (1993), where the low-frequency limit was applied with a fixed amplitude of the electric field, which made the vector potential grow indefinitely.
Our approach thrives on the fact that the adiabatic approximation works well for those contributions to the polarization response that involve highly excited stationary states, where the excitation energy is much larger than . Hence, our method works best when is small. We have demonstrated excellent performance for a near-infrared laser pulse interacting with a model dielectric.
Acknowledgements
The authors thank S. Yu. Kruchinin for his helpful remarks. Supported by the DFG Cluster of Excellence: Munich-Centre for Advanced Photonics. M. S. W. was supported by the International Max Planck Research School of Advanced Photon Science (IMPRS-APS).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Scully and Zubairy (1997) M. Scully and M. Zubairy, Quantum Optics (Cambridge University Press, 1997) Chap. 5.
- 2Aversa and Sipe (1995) C. Aversa and J. Sipe, Phys. Rev. B 52 , 14636 (1995) . · doi ↗
- 3Virk and Sipe (2007) K. S. Virk and J. E. Sipe, Phys. Rev. B 76 , 035213 (2007) . · doi ↗
- 4Houston (1940) W. V. Houston, Phys. Rev. 57 , 184 (1940) . · doi ↗
- 5Krieger and Iafrate (1986) J. B. Krieger and G. J. Iafrate, Phys. Rev. B 33 , 5494 (1986) . · doi ↗
- 6Marzari et al. (2012) N. Marzari, A. A. Mostofi, J. R. Yates, I. Souza, and D. Vanderbilt, Rev. Mod. Phys. 84 , 1419 (2012) . · doi ↗
- 7Attaccalite and Grüning (2013) C. Attaccalite and M. Grüning, Phys. Rev. B 88 , 235113 (2013) . · doi ↗
- 8Ghimire et al. (2011) S. Ghimire, A. D. Di Chiara, E. Sistrunk, P. Agostini, L. F. Di Mauro, and D. A. Reis, Nature Phys. 7 , 138 (2011).
