Qubit parametrization of the variational discrete action theory for the multiorbital Hubbard model
Zhengqian Cheng, Chris A. Marianetti

TL;DR
This paper develops a qubit parametrization of the variational discrete action theory (VDAT) at =3 for the multiorbital Hubbard model, enabling efficient ground state calculations and analytical insights into Mott and Hund physics.
Contribution
It introduces a qubit-based variational approach for VDAT at =3, improving computational efficiency and analytical understanding of complex multiorbital systems.
Findings
Successfully applied to models with 2-7 orbitals.
Derived analytical expressions for critical U in large N_{orb} limit.
Demonstrated equivalence to slave spin mean-field theory for certain variants.
Abstract
The variational discrete action theory (VDAT) at \mathcal{N}=3 is a potent tool for accurately capturing Mott and Hund physics at zero temperature in d=\infty at a cost comparable to the Gutzwiller approximation, which is recovered by VDAT at \mathcal{N}=2. Here we develop a qubit parametrization of the gauge constrained algorithm of VDAT at \mathcal{N}=3 for the multiorbital Hubbard model with general density-density interactions. The qubit parametrization yields an explicit variational trial energy, and the variational parameters consist of the momentum density distribution, the shape of a reference fermi surface, and the pure state of a qubit system with dimension of the local Hilbert space. To illustrate the power of the qubit parametrization, we solve for the ground state properties of the multiorbital Hubbard model with Hund coupling for local orbital number N_{orb}=2-7. A Taylor…
| (L) | |||
| (HF) |
| (L) | |||||
| (HF) |
| Functional | |
|---|---|
| HF | |
| MBB | |
| power | |
| CA | |
| CGA | |
| HF | |
| MBB | |
| power | |
| CA | |
| CGA |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgebraic structures and combinatorial models · Black Holes and Theoretical Physics · Advanced Topics in Algebra
Qubit parametrization of the variational discrete action theory for
the multiorbital Hubbard model
Zhengqian Cheng and Chris A. Marianetti
Department of Applied Physics and Applied Mathematics, Columbia University, New York, NY 10027
(August 18, 2025)
Abstract
The variational discrete action theory (VDAT) at is a potent tool for accurately capturing Mott and Hund physics at zero temperature in at a cost comparable to the Gutzwiller approximation, which is recovered by VDAT at . Here we develop a qubit parametrization of the gauge constrained algorithm of VDAT at for the multiorbital Hubbard model with general density-density interactions. The qubit parametrization yields an explicit variational trial energy, and the variational parameters consist of the momentum density distribution, the shape of a reference fermi surface, and the pure state of a qubit system with dimension of the local Hilbert space. To illustrate the power of the qubit parametrization, we solve for the ground state properties of the multiorbital Hubbard model with Hund coupling for local orbital number . A Taylor series expansion of the partially optimized trial energy is used to explain how the Hund’s coupling changes the order of the Mott transition. For the case of the Hubbard model, an explicit approach for computing the critical for the Mott transition is provided, yielding an analytical expression for in the large limit. Additionally, we provide an analytical solution for the ground state properties of the single band Hubbard model with a special density of states. Finally, we demonstrate that the qubit parametrization can also be applied to , for both G-type and B-type variants, where the G-type yields an identical expression to the slave spin mean-field theory. The qubit parametrization not only improves the efficiency and transparency of VDAT at , but also provides the key advances for the construction of a one-body reduced density matrix functional capable of capturing Mott and Hund physics.
I Introduction
The multiorbital Hubbard model is a minimal model of interacting electrons which accounts for key elements of typical strongly correlated electron systems Imada19981039 ; Fernandes2017014503 , and the solution in infinite dimensions provides a natural starting point for understanding the solution in finite dimensions Georges199613 ; Kotliar2006865 . The de facto standard for accurately solving the multiorbital Hubbard model in is the dynamical mean-field theory (DMFT) Georges199613 ; Kotliar200453 ; Kotliar2006865 ; Vollhardt20121 , which requires self-consistently solving an effective Anderson impurity model. Therefore, the proficiency of DMFT is determined by the state-of-the-art for quantum impurity solvers Gull2011349 , which are still extremely limited for the multiorbital case at zero temperature (see introduction of Ref. Cheng2022205129 ). Alternatively, the standard for efficiently solving the multi-orbital Hubbard model is the Gutzwiller approximation (GA), which semi-quantitatively describes the Fermi liquid phase and the metal-insulator transition, but only produces a crude description of the insulating phase Lu19945687 ; Okabe19972129 ; Bunemann19974011 ; Bunemann199829 ; Bunemann2007193104 ; Lanata2008155127 . A long standing goal has been to improve upon GA while not incurring the computational cost of DMFT. Fortunately, the variational discrete action theory (VDAT) provides an alternate route to the exact ground state properties in Cheng2021195138 ; Cheng2021206402 ; Cheng2022205129 ; Cheng2023035127 . The variational ansatz of VDAT is the sequential product density matrix (SPD), which is characterized by an integer and as either G-type or B-type, where G-type recovers the Hartree-Fock wave function, G-type recovers the Gutzwiller wave function, and recovers the exact solution. A remarkable fact is that the G-type SPD accurately captures Mott and Hund physics in Cheng2022205129 ; Cheng2023035127 , while maintaining a computational cost comparable to the Gutzwiller wave function.
An SPD with arbitrary can be exactly evaluated in using the self-consistent canonical discrete action theory (SCDA) Cheng2021195138 ; Cheng2021206402 . There are currently three algorithms for executing VDAT within the SCDA, with the first two being completely general and the third one having some restrictions. The first approach is the most straightforward, requiring iterations over two steps Cheng2021206402 . In the first step, given the variational parameters, the SCDA equations can be self-consistently solved, yielding the energy for the corresponding SPD. In the second step, the variational parameters must be updated to minimize the energy. This straightforward approach was successfully executed on the single band Hubbard model in . The second approach is the decoupled minimization algorithm Cheng2022205129 , where one simultaneously minimizes the variational parameters and updates the SCDA equations towards self-consistency. This decoupled minimization algorithm was executed for the G-type SPD’s in the two orbital Hubbard model with the full rotationally invariant local interactions, demonstrating that accurately describes Mott and Hund physics over all parameter space. The third approach is the gauge constrained algorithm Cheng2023035127 , which was originally applied to the case of a G-type and SPD with kinetic projectors that are diagonal in momentum space and interacting projectors that do not introduce off-diagonal terms at the single-particle level. The key feature of the gauge constrained algorithm is that the SCDA self-consistency can be automatically satisfied, greatly simplifying the task of minimizing over the variational parameters. Furthermore, the momentum density distribution is used to reparametrize the variational parameters in the kinetic projectors, and the trial energy can be straightforwardly optimized over the momentum density distribution. It is useful to note that the GA can only evaluate a G-type SPD under a Gutzwiller gauge Cheng2023035127 . The SCDA has no such restrictions, which is formally appealing, but imposing certain gauge constraints allows the SCDA self-consistency to be automatically satisfied, while not reducing the variational capacity.
One shortcoming of the original gauge constrained algorithm for the G-type SPD is that some of the variational parameters are unintuitive and there are two linear constraints on the momentum density distribution per spin-orbital Cheng2023035127 . In this paper, we introduce a new parametrization of the gauge constrained algorithm, referred to as the qubit parametrization, which is mathematically equivalent to the original gauge constrained algorithm and offers several important advantages. First, there is only one linear constraint per spin-orbital on the momentum density distribution, and therefore the number of variational parameters is reduced by one per spin-orbital. Second, the variational parameters are physically intuitive and facilitate a deeper understanding of how the SPD captures Mott and Hund physics. Therefore, the qubit parametrization achieves the long sought goal of resolving the shortcomings of the Gutzwiller approximation while maintaining the computational simplicity and physical appeal. The convenience of the qubit parametrization has allowed for the construction of a one-body reduced density matrix functional for the multi-orbital Hubbard model which exactly encapsulates the VDAT result at , and this is presented in a companion manuscript companion .
It is useful to compare to existing approaches in the literature which seek to achieve our same goal of efficiently and accurately solving the ground state properties of multiorbital Hubbard models. One approach is the ghost Gutzwiller approximation (gGA) Lanata2017195126 , which introduces the Gutzwiller wave function in an extended Hilbert space where the GA is then applied, though this comes with a substantial increase in computational cost over the usual GA given that ghost orbitals are introduced. In , the gGA exactly evaluates the Gutzwiller wave function in the extended Hilbert space, and therefore provides an upper bound on the exact energy with increasing accuracy as the extended Hilbert space grows. The gGA has not been proven to converge to the exact solution of the multiorbital Hubbard model with increasing number of ghost orbitals, though numerical evidence suggests that this might be the case Lee2023L121104 ; Lee2023245147 . Empirically, it has been demonstrated that at least two ghost orbitals per physical orbital are needed for a global improvement over the GA Lanata2017195126 ; Lee2023245147 ; Lee2023L121104 , which substantially increases the computational cost of the gGA relative to GA. Therefore, the important practical comparison between VDAT and gGA contrasts VDAT at to gGA with two ghost orbitals per physical orbital. VDAT at has been straightforwardly applied up to the eight orbital Hubbard model Cheng2023035127 , while the gGA has been applied up to the three orbital model MejutoZaera2023235150 ; Lee2023245147 ; Giuli2025020401 using exact diagonalization and the five orbital model Lee2024115126 using density matrix renormalization group (DMRG). The success of the gGA in the multiorbital Hubbard model critically depends on the ability of DMRG or related techniques to approximately obtain a ground state in an expanded local Hilbert space. In terms of accuracy, both approaches appear to be excellent over all parameter space, though it seems possible that VDAT is more adept at capturing subtle differences between competing phases (e.g. compare Fig. 4 in Ref. Cheng2022205129 with Fig. 2 in Ref. MejutoZaera2023235150 ).
Another attempt to resolve the limitations of the GA is the slave boson approach Kotliar19861362 , which formally recasts the Hubbard model as a Hamiltonian of constrained fermions and auxiliary bosons. The simplest saddle point approximation is typically referred to as slave boson mean-field theory (SBMF), which exactly recovers the GA in the multi-orbital Hubbard model Bunemann2007193104 and has extensive applications Lechermann2007155102 ; Isidori2009115120 ; Bunemann2011203 ; medici2017167003 ; Piefke2018125154 ; Isidori2019186401 ; Chatzieleftheriou2020205127 . Given that the slave boson approach is an exact formalism, there have been various results including fluctuations beyond the mean-field solution Lavagna1990142 ; Jolicoeur19912403 ; Li1991369 ; Raimondi199311453 ; Arrigoni19933178 ; Li199417837 ; Zimmermann199710097 , though none of these studies address corrections to the ground state properties in the thermodynamic limit. A related idea is the time dependent Gutzwiller approximation Seibold20012605 ; Lorenzana2003066404 ; Schiro2010076401 ; Oelsen2011076402 ; oelsen2011113031 ; Schiro2011165105 ; Bunemann2015550 ; Fabrizio2017075156 , which has mainly focussed on computing response functions. One study did demonstrate that the ground state properties of the single orbital Hubbard model can be improved by updating the double occupancy based on the density-density response function Seibold20012605 , but this would be impractical in the multi-orbital Hubbard model, and we are not aware of any such applications. Therefore, neither slave bosons nor time dependent GA has provided a path beyond the GA for ground state properties, though the idea of ghost orbitals has recently been adopted into the slave boson methodology Lanata2022045111 , where the saddle point solution recovers the gGA. Another approach related to SBMF is the slave spin approach De'medici2005205124 , which recasts the Hubbard model with density-density interactions as a Hamiltonian of constrained fermions and auxiliary spins, and the saddle point approximation also recovers the GA. As in the case of slave bosons, the slave spin approach has produced many interesting results on the multi-orbital Hubbard model Hassan2010035106 ; medici2017167003 ; Georgescu2015235117 ; Maurya2021425603 ; Maurya2022055602 ; Crispino2023155149 ; Gorni2023125166 , but has not offered a path beyond the mean-field solution. In the present work, we demonstrate that the qubit parametrization for a G-type SPD is identical to the slave spin mean field theory (see Appendix B).
Finally, it is useful to discuss the off-shell effective energy theory (OET) Cheng2020081105 , which can be reinterpreted through the G-type and B-type SPD for . OET introduces an approximation to evaluate the energy under the SPD, known as the central point expansion (CPE), and this is applied to both the G-type and B-type SPD for . Though the CPE is intrinsically different than the SCDA, it also exactly evaluates the SPD in , which is proven in the present work (see Appendix C). OET introduces corrections to the CPE for the G-type and B-type SPD, guaranteeing the limiting behaviors for the weak and strong coupling limits, respectively. Subsequently, one evaluates the total energy in both cases and selects the solution with lower energy. OET has yielded accurate results for the single band Hubbard model in , , and . The correction to the CPE becomes far more challenging in the multiorbital Hubbard model, which has not been pursued further. However, a similar idea of empirically correcting the Gutzwiller approximation has been developed in the context of the correlation matrix renormalization approximation Yao2014045131 ; Liu2021081113 ; Liu2021095902 ; Liu2022205124 .
The paper is organized in the following manner. Section II provides an overview of the SPD ansatz, the gauge symmetry of the SPD, the SCDA, and the tensor product representation for evaluating expectation values. Additionally, several methodological generalizations are provided. Section III provides a high level overview of the qubit energy form for the G-type and B-type SPD at and the G-type SPD at . Detailed derivations of the results from Section III are presented in Sections IV, V, and VI. Finally, the qubit energy form is used to examine the multi-orbital Hubbard model at half filling with in Section VII.
II Review of the variational discrete
action theory
Here we review several key ingredients of VDAT, including the sequential product density matrix (SPD), the self-consistent canonical discrete action theory (SCDA), and the tensor product representation for evaluating local expectation values within the SCDA. In addition to reviewing these concepts, we generalize the gauge symmetry of the SPD beyond the case of a diagonal transformation, and generalize the tensor product representation to arbitrary . These developments will facilitate the subsequent analysis in this work.
II.1 The sequential product density matrix
The SPD is the variational ansatz used within VDAT Cheng2021206402 ; Cheng2021195138 , and the SPD is a straightforward generalization of a variational wave function ansatz first articulated in Ref. Dzierzawa19951993 . In this work, we focus on the multiorbital Hubbard model, given as
[TABLE]
where and denote momentum and real-space site indices, respectively, is the spin-orbital index within a local site, and denotes the Coulomb interaction. In the context of the multiorbital Hubbard model, the SPD with an integer time step is given as
[TABLE]
where is the kinetic projector, is the interacting projector, is the integer time index, the diagonal Hubbard operator is defined as
[TABLE]
and is -th bit in the binary representation of , given as , and . It should be noted that in general, is a direct product of local projectors from all sites while is a general non-interacting projector. There are two schemes to ensure that the SPD is Hermitian and positive semi-definite, denoted as G-type or B-type Cheng2021195138 . In the following, we will examine both types for . For simplicity, we restrict and to be real numbers. It is also useful to reparametrize, up to a constant, the and as
[TABLE]
where the and are a reparametrization of and , and this form is used to derive the qubit parametrization (see Section III).
We first examine the SPD. The G-type SPD for is defined by , which is a non-interacting many-body density matrix. If where is a single Slater determinant, then corresponds with the Hartree-Fock wave function. Conversely, the B-type SPD for is defined as . Normally corresponds to a mixed state, but if with defined as a direct product of atomic states from all sites, then corresponds to a pure state.
The G-type SPD with is defined as
[TABLE]
If the center projector is taken as with being a single Slater determinant, then , where is the Gutzwiller wave function (GWF). The B-type SPD for is defined as
[TABLE]
If the center projector is taken as with defined as a direct product of atomic states from all sites, then , where and we have previously referred to as the Bearyswil wave-function Cheng2021206402 ; Cheng2021195138 . However, it should be noted that is distinct from the original Bearyswil wave-function Baeriswyl19870 , which is defined as , where is a variational parameter and is the fully projected GWF. Though is technically a special case of a G-type SPD, which can also be evaluated using the SCDA, it yields the same energy as for the insulating phase in for the multi-orbital Hubbard model (see Section VI.4.3 and Figure 3).
The G-type SPD for combines the variational power of and , and is defined as
[TABLE]
If corresponds to a single Slater determinant , then , where is a mild generalization of the original Gutzwiller-Baeriswyl wave function introduced by Otsuka (see Eq. (2.3) in Ref. Otsuka19921645 ), defined as , where is a variational parameter. Finally, the B-type SPD is defined as . If , then where and we have previously referred to as the Baeriswyl-Gutzwiller wave-function. However, it should be noted that is distinct from the original Baeriswyl-Gutzwiller wave-function Dzierzawa19951993 , which is defined as , where is a variational parameter and is the fully projected GWF. The is technically a special case of a G-type SPD, which can also be evaluated using the SCDA.
It is useful to understand how the SPD recovers the exact solution with increasing Cheng2021195138 . The Trotter-Suzuki decomposition Suzuki1976183 , given as
[TABLE]
can be considered as a special case of the SPD where the variational parameters are set by the parameters of the Hamiltonian and the temperature. Given that the large Trotter-Suzuki decomposition recovers the exact finite temperature density matrix, the SPD in the large limit possesses sufficient variational power to solve the Hamiltonian exactly. However, for a finite , the SPD leverages the full variational freedom while maintaining the integer time structure, leading to an observed exponential decrease of the energy error with increasing Cheng2021206402 ; Cheng2022205129 . In contrast, the energy error for the Trotter-Suzuki decomposition diminishes polynomially with increasing Suzuki1976183 . This key difference is the basis for the accuracy of the SPD for small .
Our initial efforts using VDAT have been exactly evaluating the SPD in via the SCDA, and this same approach can be used as a local approximation in finite dimensions. However, there is an existing body of literature which used variational quantum Monte-Carlo (VMC) to evaluate a simplified SPD in the Hubbard model. In the case of , it is most convenient to execute VMC in real space, which has been used to evaluate the Gutzwiller wave function in the Hubbard model Yokoyama19871490 ; Yokoyama19873582 ; Yokoyama19882482 . For in the one and two dimensional Hubbard model, a restricted form of the SPD has been evaluated using quantum Monte-Carlo to solve the single-orbital Hubbard model in two dimensions Otsuka19921645 ; Yamaji1998225 ; Yanagisawa19983867 ; Yanagisawa19993608 ; Koike200365 ; Yanagisawa2016114707 ; Yanagisawa2019054702 ; Yanagisawa20202040046 ; Yanagisawa202112 ; Yanagisawa2021127382 ; Sorella2023115133 ; Levy2024013237 , the - model Yanagisawa202127004 ; Yanagisawa2001184509 , and selected molecules Chen20234484 .
II.2 Gauge Symmetry of the SPD
A important aspect of the SPD is the corresponding gauge symmetry Cheng2022205129 ; Cheng2023035127 , which will prove to be an important tool for reducing the redundancy of the parametrization of the SPD. Consider the general SPD , where is a kinetic projector and and , and is a general interacting projector. The essence of the gauge symmetry lies in the fact that only the total product determines physical observables, while there is considerable flexibility in the decomposition of into different time steps and the separation between kinetic and interacting projectors. In order to analyze the gauge symmetry, we utilize the discrete action theory Cheng2021195138 , which is a formalism for studying integer time correlation functions of the SPD. The discrete action theory can be seen as a generalization of the many-body Green’s function formalism to the case of integer time, including an integer time Dyson equation given as
[TABLE]
where \text{\bm{g}}_{0} is the non-interacting integer time Green’s functions, is the interacting integer time Green’s functions, and is the exponential form of the integer time self-energy, where is the integer time self-energy. The integer time Dyson equation recovers the usual Dyson when the SPD is chosen as the Trotter-Suzuki decomposition in the large limit Cheng2021195138 . The \text{\bm{g}}_{0} and are naturally defined in the compound space Cheng2021195138 as \text{\bm{g}}_{0}=\left\langle\textrm{\@text@baccent{\hat{\bm{n}}}}\right\rangle_{\textrm{\@text@baccent{\hat{\spdsymb}}}_{0}} and \text{\bm{g}}=\left\langle\textrm{\@text@baccent{\hat{\bm{n}}}}\right\rangle_{\textrm{\@text@baccent{\hat{\spdsymb}}}}, where [\textrm{\@text@baccent{\hat{\bm{n}}}}]_{\ell\tau,\ell^{\prime}\tau^{\prime}}=\textrm{\@text@baccent{\hat{a}}}_{\ell}^{\dagger(\tau)}\textrm{\@text@baccent{\hat{a}}}_{\ell^{\prime}}^{(\tau^{\prime})}, the underbar indicates an operator in the compound space, enumerates the spin-orbitals for the entire system, and labels the integer time step. The discrete Dyson equation is a matrix equation of dimension of , and it exactly relates the interacting and noninteracting integer time Green’s function via . Given that is more convenient to work with, we refer to as the integer time self-energy for brevity. For the convenience of discussing the gauge transformation of the SPD, it is useful to define several additional quantities
[TABLE]
where and are matrices. Using these definitions, and can be defined succinctly as
[TABLE]
Generally, there are two types of gauge transformations of the SPD: intra-time-step and inter-time-step. The intra-time-step transformation occurs at the boundary of the kinetic and interacting projector for time step in the following way: and . Since the intra-time-step transformation does not alter the measurement at integer times, is invariant to this transformation. However, given that changes after the intra-time-step transformation, we have , where , while . Conversely, the inter-time-step transformation is defined at the boundary of the interacting projector at and the kinectic projector at in the following way: and , where . We first discuss how changes under this inter-time-step transformation, and it is useful to note the following identities (for details, see Eqns. (A22) and (A23) in Ref. Cheng2021195138 and Eqns. (A1)-(A4) in Ref. Cheng2022205129 )
[TABLE]
Using the above two equations, we can determine the transformation , where , and correspondingly . Using the transformation for the kinetic projector, we have , where and , and can be rewritten as . It should be noted that a general gauge transformation includes both intra-time-step and inter-time-step transformations, which can be parametrized by and , and the above results can be synthesized as
[TABLE]
Moreover, the form of the integer time Dyson equation is invariant after the transformation, which requires
[TABLE]
Using Eq. (20), we have
[TABLE]
while using Eqns. (23), (19), (21), and (22), we have
[TABLE]
which can also be directly verified by assuming is non-interacting. Correspondingly, the non-interacting integer time Green’s function changes as
[TABLE]
Notice that Eqns. (18)-(25) are exact relations. Within the SCDA Cheng2021195138 , will transform similarly to , where and are local. These gauge transformations provide a rigorous theoretical foundation for simplifying the task of achieving self-consistency within the SCDA. Moreover, choosing the appropriate gauge condition will also help to improve the numerical stability when minimizing over the variational parameters.
II.3 Review of the SCDA
The discrete action theory is a formalism designed to study integer time correlation functions of the SPD, and the self-consistent canonical discrete action theory (SCDA) is the integer time analogue of the dynamical mean-field theory Cheng2021206402 ; Cheng2021195138 . The SCDA exactly evaluates the SPD in infinite dimensions, and can be used as a local approximation for the SPD in finite dimensions. To appreciate the idea of the SCDA, we define the discrete action which encodes the integer time correlation of the SPD as
[TABLE]
where \textrm{\@text@baccent{\hat{\spdsymb}}}_{0}=\textrm{\@text@baccent{\hat{Q}}}\prod_{\tau=1}^{\mathcal{N}}\textrm{\@text@baccent{\hat{K}}}_{\tau}^{\left(\tau\right)} and \textrm{\@text@baccent{\hat{P}}}=\prod_{\tau=1}^{\mathcal{N}}\textrm{\@text@baccent{\hat{P}}}_{\tau}^{\left(\tau\right)} are the non-interacting and interacting parts of the discrete action ̱\hat{\varrho}, and the underbar denotes operators in the compound space.
The SCDA can be formulated as a scheme to evaluate the kinetic energy and local interacting energy through two self-consistent approximations of ̱\hat{\varrho} Cheng2022205129 , denoted as \textrm{\@text@baccent{\hat{\rho}}}_{K} and \textrm{\@text@baccent{\hat{\rho}}}_{loc}, respectively. Within the SCDA, the total energy is given as
[TABLE]
where \textrm{\@text@baccent{\hat{\rho}}}_{K} and \textrm{\@text@baccent{\hat{\rho}}}_{loc} are defined as
[TABLE]
where is the non-interacting integer time Green’s function for the local canonical discrete action \textrm{\@text@baccent{\hat{\rho}}}_{loc}. Both and are matrices, where is the number of spin-orbitals of the entire system. In the SCDA, the key assumption is that and are block diagonal in real space clusters, given by and where and are real space cluster indices. To determine and , we have the following two sets of conditions for each cluster index ,
The integer time Dyson equation, given as
[TABLE]
provides a way to compute by evaluating \textrm{\@text@baccent{\hat{\rho}}}_{loc;i}=\text{Tr}_{/i}\textrm{\@text@baccent{\hat{\rho}}}_{loc}, which can be expressed as
[TABLE] 2. 2.
The self-consistency condition for the local integer time Green’s function, given as
[TABLE]
where \boldsymbol{g}_{i}=\left\langle\textrm{\@text@baccent{\hat{\boldsymbol{n}}}}_{i}\right\rangle_{\textrm{\@text@baccent{\hat{\rho}}}_{loc}} and \boldsymbol{g}^{\prime}_{i}=\left\langle\textrm{\@text@baccent{\hat{\boldsymbol{n}}}}_{i}\right\rangle_{\textrm{\@text@baccent{\hat{\rho}}}_{K}}.
It is interesting to examine the sufficiency of Eq. (30) and (32) to determine both and by counting the number of unknown entries and equations. We denote the number of sites in the lattice as and the number of spin orbitals on site as . The number of unknowns in and is . For a given , the two matrix equations given by Eq. (30) and (32) provide entries. Therefore, we have a sufficient number of equations to determine and .
II.4 The tensor product representation for expectation values under \textrm{\@text@baccent{\hat{\rho}}}_{loc;i}
The key computational cost of implementing the SCDA is to compute local observables under \textrm{\@text@baccent{\hat{\rho}}}_{loc;i}, and a convenient approach was devised for using a tensor product representation Cheng2023035127 . Here we generalize the tensor product representation to arbitrary . We assume that the interacting projector has the form and is diagonal in the spin orbital index , given as , where is an matrix. For a local operator ̱\hat{O} at site , the expectation value can be written as
[TABLE]
where the non-interacting discrete action for \textrm{\@text@baccent{\hat{\rho}}}_{loc;i,0} is given by
[TABLE]
where the vector enumerates over the index with as the number of local diagonal Hubbard operators at site , the dimensional tensor \left(\textrm{\@text@baccent{\hat{O}}}\right)_{u} is defined as
[TABLE]
the direct product is defined as , and is the contraction between two -dimensional tensors as
[TABLE]
We now consider operators that can be written as a product over the spin orbitals, given as \textrm{\@text@baccent{\hat{O}}}=\prod_{\ell}\textrm{\@text@baccent{\hat{O}}}_{\ell}. Assuming that is diagonal in the spin orbital index , we have
[TABLE]
where is defined in Eq. (4). Therefore, we can write \left(\textrm{\@text@baccent{\hat{O}}}\right)_{u} as a direct product
[TABLE]
where the direct product is defined as
[TABLE]
with \left(\textrm{\@text@baccent{\hat{O}}}_{\ell}\right)_{u;\ell} defined as
[TABLE]
which only depends . For some cases, there is no interacting projector on a given time step, and the corresponding dimension of \left(\textrm{\@text@baccent{\hat{O}}}_{\ell}\right)_{u;\ell} can be traced out. Moreover, Eq. (33) can be conveniently rewritten into a matrix product for all cases we study, which will be further discussed in Sections IV, V, and VI.
III Overview of the qubit energy form
The SCDA can exactly evaluate an SPD in with arbitrary via a self-consistent solution, as outlined in Section II, and therefore the total energy can only be numerically evaluated for a given set of variational parameters in general. However, the recently developed gauge constrained SCDA algorithm Cheng2023035127 allows for an explicit evaluation of a G-type SPD with , circumventing the need for self-consistent solution (see Section VI.2 for a review). In the present work, we offer a further refinement of the gauge constrained SCDA by transforming all variational parameters into physically intuitive variables, referred to as the qubit parametrization. Moreover, the qubit parametrization analytically resolves one constraint, reducing the number of variational parameters by one per spin orbital. Given the complexity of deriving the qubit parametrization, we collect all key results in this section, providing a self-contained presentation of all details needed to implement a practical calculation. A G-type and B-type SPD at will be denoted as and , respectively. For pedagogical purposes, we first present results for and before finally considering , illustrating that can be seen as a unification of and . Detailed derivations of all results for , , and are provided in Sections IV, V, and VI, respectively, while explicit applications are presented in Section VII. It should be emphasized that all qubit parametrization results are mathematically identical to corresponding VDAT results.
For the sake of generality, we consider the multiorbital Hubbard Hamiltonian given as
[TABLE]
where is a polynomial function of the local density operators, given as
[TABLE]
where are parameters that define the n-body interactions. For a typical Hubbard model, only two-body interactions will be needed.
We begin by defining the local Hilbert space and the corresponding fermionic operators, and the corresponding map to qubit operators via the Jordan-Wigner transformation. For a given site , the atomic configurations form a complete basis for the local Hilbert space with dimension , and we use and to represent the creation and annihilation operators for spin-orbital . Then we express the fermionic operators in terms of spin operators via the Jordan-Wigner transformation defined as
[TABLE]
where and is defined as
[TABLE]
where and is a standard Pauli matrix. It should be noted that we choose a convention for the Jordan-Wigner transformation such that is associated with and is associated with . This local Jordan-Wigner transformation can be used to transform the usual Gutzwiller energy form into the qubit energy form, though spin operators will naturally emerge when using the tensor representation of local observables within the SCDA.
We proceed by evaluating the ground state energy under , where is parametrized as
[TABLE]
where the variational parameters and are real and . The variational parameters can be reparametrized as a pure state of an effective qubit system (i.e. a spin system) characterized by a many-body density matrix . The total trial energy for can then be written as
[TABLE]
with a constraint for each
[TABLE]
Here we have taken the continuum limit of the discretized and choose the convention . Equations (50)-(54) give an explicit functional form for the trial energy as a function of the variational parameters. The qubit energy form in Eqns. (50)-(54) is equivalent to a previous result obtained using the slave spin mean-field method De'medici2005205124 (see Appendix B for a detailed comparison). It should be emphasized that Eqns. (50)-(54) are simply a transformation of the SCDA algorithm, and therefore are completely equivalent to the usual Gutzwiller approximation.
We now proceed to study , which is the dual of , given as
[TABLE]
where the variational parameters and are real and . The qubit parametrization reparametrizes the and into and , where is a diagonal, positive semi-definite matrix and is the physical single particle density matrix, yielding the following total energy:
[TABLE]
with a constraint for each
[TABLE]
It should be noted that the operators and have the same expectation values under , which is evident from Eq. (57). This energy form provides a minimal description for the Mott insulating state in the multiorbital Hubbard model in , much like the result of the Gutzwiller approximation for the metallic phase.
We now proceed to study , which combines the variational capacity of both and . The key result of this paper is to recast the previously obtained explicit evaluation of the energy under into a physically intuitive form which can be viewed as a combination of the variational parameters of the qubit energy form for and . The is given as
[TABLE]
where the variational parameters , , and are real and . As demonstrated in Ref. Cheng2023035127 , when corresponds to a Slater determinant (i.e. or ), the total energy can be explicitly evaluated, and therefore we follow this condition. Under this restricted form of , it is useful to introduce the concept of a reference Fermi surface, which delineates the boundary between and throughout the Brillouin zone. We can reparametrize and in terms of and , where is matrix corresponding to a pure state of a qubit system and is the physical momentum density distribution. The total trial energy is given as
[TABLE]
and there are two constraints for each given as
[TABLE]
where the symbol indicates that the integration is over the region where for a given , and the symbol indicates the region where . For a given , the quantities , and are nontrivial analytical functions of the following five variables
[TABLE]
where and are constrained such that
[TABLE]
which ensures that , and are real numbers (see Eq. (73)). The explicit functional dependence is given by the following list of equations
[TABLE]
Several important points should be noted. First, the only enter the local interaction energy through , , , and the constraint . Second, the only enter the local interaction energy through the regions of integration (i.e. and ) and the constraint . Third, the operators and have the same expectation values under , as in the case of the . Finally, it should be clear that evaluating the has a similar computational cost as compared to and , where the largest computational cost is associated with evaluating the local interaction energy.
In the following, we briefly discuss how to numerically minimize the qubit energy form for . For a multiorbital Hubbard model with spin-orbitals, the formal variational parameters are , , and with , and there are two local density constraints per spin orbital (see Eqn. (64)). We choose corresponding to the momentum density distribution for the non-interacting Hamiltonian with local density , and therefore is determined from . It is important to realize that the local interaction energy does not depend on the full details of , but instead only on , , and . Therefore, for a given spin orbital , four Lagrange multipliers, , , , and , can be used to obtain the partially optimized as Cheng2023035127
[TABLE]
where is either or . Interestingly, the optimized yields a non-trivial dependence on , in contrast to the Gutzwiller approximation (see Figure 1). There is a local density constraint between and given by Eqn. (64), and there are two strategies to enforce this. The first strategy is to start from , which are parametrized by variational parameters , , , and , though only are independent given Eqn. (64). Subsequently, the can be parametrized by independent variational parameters Cheng2023035127 . Therefore, there are a total of independent variational parameters. This strategy motivates the construction of a one-body reduced density matrix functional, as presented in our companion paper companion . The second strategy is to start from , which can be parametrized by independent variational parameters, which determines . Subsequently, the , , and are chosen as independent variational parameters, which determines , yielding independent variational parameters. The second strategy allows for a trivial implementation of the inequality constraint among , , and (see Eq. (386)), for a given , which is very important in practice. In either case, one is left with minimizing over variational parameters, which may be achieved using a variety of standard approaches.
Finally, we summarize the applications that are studied in Section VII.2. Consider the multiorbital Hubbard model with density-density interactions given as
[TABLE]
where , , , and , with the orbital indices taking values of and . We consider the special case where is independent of and has particle-hole symmetry. In this case, we show that Eq. (62) can be optimized over and , with the assumption that the optimized is given as where is the Heaviside function and is chosen such that , yielding a trial energy purely as a functional of as
[TABLE]
where depends on and , and is independent of and has particle-hole symmetry. Here we have dropped the spin-orbital index (e. g. ) and define . An important result is that is non-analytic at , which can be used to infer the nature of the Mott transition. For , the can be numerically represented by a one dimensional spline function companion , while has an explicit expression which can be derived as
[TABLE]
The above equations finally deliver a closed form expression for a trial energy which qualitatively and quantitatively captures the Mott transition in the single band Hubbard model at half-filling in , exactly reproducing the VDAT results for a G-type SPD at .
IV Derivation of the qubit energy form for
In this section, we derive Eq. (50) using several different approaches. In Section IV.1, we review the GA using both a heuristic derivation and the central point expansion (CPE), and convert the standard energy form of the GA to the qubit energy form using the Jordan-Wigner transformation. In Section IV.2, we use the SCDA within the Gutzwiller gauge to evaluate , and the tensor product representation is used to obtain the qubit energy form.
IV.1 The derivation of the qubit energy form from the GA
In this section, we provide an elementary derivation of the qubit energy form, which consists of two steps. In the first step, we derive the standard form of the GA, using both a heuristic argument and the CPE Cheng2020081105 , where the energy is parametrized by and , where is the local reduced density matrix of the ansatz, which is a diagonal matrix in the basis of . The quasi-particle weight is constructed from and the fermionic creation and annihilation operators. In the second step, we convert the standard form of the GA to the qubit energy form which is parametrized by and , where is a pure state in a -qubit space, and the quasi-particle weight is constructed from and the Pauli spin operators.
IV.1.1 Derivation of the GA: a heuristic argument and the CPE
While the Gutzwiller approximation is well known, it is normally applied to the special case where is a pure state. Here we provide a heuristic derivation for a general . Additionally, we use the CPE to derive the same result from a different perspective. We begin by presenting the heuristic derivation of the GA. First consider the expectation values for the diagonal Hubbard operator measured under , given as
[TABLE]
where denotes the local density at a given site for the spin orbital under . The first assumption of the GA is that the atomic configuration distribution under can be approximated as
[TABLE]
which ignores off-site contributions in . Additionally, a constraint is applied to such that the local density is invariant, given as
[TABLE]
which we refer to as the Gutzwiller constraint. The denominator in Eq. (102) ensures that the sum of all expectation values for the diagonal Hubbard operators are normalized. Given that the off-diagonal Hubbard operators have zero expectation value due to the restriction of the SPD used in this study, the expectation values for all diagonal Hubbard operators allow the evaluation of any local observable. The second assumption of the GA lies within the evaluation for the hopping term between two distinct sites. The idea is to break a hopping term like into two processes: first annihilating an electron at site and then creating an electron at site , and the GA assumes the probabilities for the two steps are independent and only depend on the atomic distributions on site and . The probability of creating or destroying an electron is renormalized by the interacting projector, and the GA assumes this is obtained by counting all relevant one-particle excitation processes as
[TABLE]
To simply this expression, we introduce and , which are defined within the local Hilbert space as
[TABLE]
which are matrices of dimension , yielding
[TABLE]
The Gutzwiller constraint, Eq. (103), can be rewritten as . Given the second assumption of the GA, the single particle density matrix between two different sites and is renormalized as
[TABLE]
Combining Eq. (108) with the Gutzwiller constraint, the single particle density matrix for arbitrary and is given as
[TABLE]
The momentum density distribution can then be computed as
[TABLE]
where is the number of -points, , and . Therefore, the standard form for the total energy per site of the GA is given by
[TABLE]
with the density constraint
[TABLE]
The variational parameters satisfy and is a diagonal positive semi-definite matrix.
We now provide an overview of how to derive Eq. (112) using the CPE (see Appendix C.1). We begin by making several observations about the GA. First, when , the describes a collection of atoms, and in this case Eq. (112) yields an exact evaluation in any dimension. Second, there is a linear relation between and given by
[TABLE]
Third, is directly determined by and , and is independent of . Recall that the is defined as , where and . We now introduce , which has the same local density matrix as , and , which describes a collection of atoms. The can be reparametrized by the local reduced density matrix of , where and is constrained by Therefore, and are functionals of and . The CPE amounts to the expansion of observables in terms of about , and up to the first order, one recovers Eq. (116) and , proving that the first-order CPE recovers the GA.
IV.1.2 Converting the GA energy into the qubit form using the Jordan-Wigner
transformation
Here we discuss how to convert the standard form of the GA energy into the qubit form, which is mathematically equivalent. The qubit form provides a unified view of the energy evaluated using , , and . The qubit parametrization consists of two steps. First, we introduce a purified many-body density matrix from . Second, we perform the Jordan-Wigner transformation, which converts (Eq. (114)) from an expression involving fermionic operators and into an expression involving spin operators . We begin by defining , where , which yields
[TABLE]
Using the Jordan-Wigner transformation (see Eqns. (43) and (44)), we obtain
[TABLE]
Therefore, we can rewrite the numerator of (see Eq. (107)) as
[TABLE]
where we used the fact that is a real symmetric matrix and the following relation
[TABLE]
Moreover, the denominator of (see Eq. (107)) is given as
[TABLE]
Therefore, we have demonstrated that , and the local interaction can be written as
[TABLE]
Using the expression for and the local energy, we arrive at the energy expressions given in Eqns. (50)-(53). This proof demonstrates that the usual Gutzwiller approximation can be straightforwardly transformed into the qubit energy form, which is equivalent to the result of the slave spin mean-field theory De'medici2005205124 (see Appendix B for additional details).
IV.2 The derivation of the qubit energy form using the SCDA
In this section, we use the gauge constrained SCDA to derive the qubit energy form. The derivation consists of two steps. First, in Section IV.2.1, the Gutzwiller gauge is used to rederive the standard form of the GA energy. Second, in Section IV.2.2, we derive the qubit energy form using the tensor representation, which does not rely on the Jordan-Wigner transformation. Additionally, in Section IV.2.3, we discuss how the quantities , , and change under a general gauge transformation.
IV.2.1 The SCDA within the Gutzwiller
gauge
In this section, we demonstrate how the gauge of the SPD can be used to automatically satisfy the SCDA self-consistency condition Cheng2021195138 , and we utilize some notation from the CPE. Interestingly, the Gutzwiller constraint of the GA can be used to define an appropriate gauge for the SPD, which we refer to as the Gutzwiller gauge. Starting from with an arbitrary gauge, a gauge transformation and can always be performed to ensure that , where is chosen to satisfy the Gutzwiller constraint. Within this Gutzwiller gauge, we can choose , where the component for the spin-orbital is given by
[TABLE]
We now prove that this ensures that the SCDA self-consistency condition is automatically satisfied. The first step of the proof relies on the fact that under the Gutzwiller gauge \textrm{\@text@baccent{\hat{\rho}}}_{loc} within the SCDA is the discrete action of the central point of the SPD , which can be shown as follows. The local reduced density matrix of at site is , which yields a discrete action \textrm{\@text@baccent{\hat{\rho}}}_{G2;i}^{\star}=\textrm{\@text@baccent{\hat{\rho}}}_{G2;i;0}^{\star}\textrm{\@text@baccent{\hat{P}}}_{1;i}^{\left(1\right)}\textrm{\@text@baccent{\hat{P}}}_{1;i}^{\left(2\right)}, where \textrm{\@text@baccent{\hat{\rho}}}_{G2;i;0}^{\star}=\textrm{\@text@baccent{\hat{Q}}}\textrm{\@text@baccent{\hat{K}}}_{2;i}^{\star}. Given that , the integer time Green’s function for \textrm{\@text@baccent{\hat{\rho}}}_{G2;i;0}^{\star} is given by Eq. (127), and correspondingly \textrm{\@text@baccent{\hat{\rho}}}_{G2;i}^{\star} is equivalent to the local discrete action \textrm{\@text@baccent{\hat{\rho}}}_{loc;i} for the SCDA, proving \textrm{\@text@baccent{\hat{\rho}}}_{G2}^{\star}=\textrm{\@text@baccent{\hat{\rho}}}_{loc}. Local observables within the SCDA can then be evaluated under the central point of the SPD as
[TABLE]
consistent with the relation derived from the CPE (see Eq. (595)). Similarly, the local interacting integer time Green’s function can be evaluated as
[TABLE]
where
[TABLE]
and
[TABLE]
and the Gutzwiller gauge ensures that the diagonal part of is . To connect and with , we use the following relations
[TABLE]
which yields . The integer time self-energy can then be computed as , using , which yields
[TABLE]
Therefore, the interacting integer time Green’s function for a given -point is given by , where , which yields
[TABLE]
and the momentum density distribution is in agreement with Eq. (111). Finally, the SCDA self-consistency can be verified as
IV.2.2 Derivation of the qubit energy form
Here we show how to derive the qubit energy form, given in Eqns. (50)-(54), using the tensor product representation. We begin by evaluating the relevant observables under \textrm{\@text@baccent{\hat{\rho}}}_{loc}, where the tensor product representation in Eq. (33) simplifies to
[TABLE]
given that . The relevant components needed to evaluate the local integer time Green’s function in the Gutzwiller gauge are given by
[TABLE]
A linear transformation can be introduced such that
[TABLE]
where
[TABLE]
We choose such that the identity operator in the -representation is an identity matrix, which is given by
[TABLE]
and correspondingly, the components in the -representation are given by
[TABLE]
To connect with the qubit energy form, we define the many-body density matrix corresponding to a pure state of the qubit system as
[TABLE]
where the renormalization factor can now be rewritten as
[TABLE]
using Eqns. (167), (170), and (129). It should be emphasized that is a matrix, defined in Eq. (46). Additionally, the local expectation value of the interaction energy can be rewritten as
[TABLE]
using Eq. (173) and (161), where is defined in Eq. (45).
IV.2.3 SCDA under a general gauge transformation
Here we discuss the SCDA within an arbitrary gauge, where , as this will be important to understanding the gauge constrained algorithm for . A general gauge transformation which maintains a G-type form is defined as , , , where . Using the results of Section II.2, this gauge transformation can be decomposed in terms of an intra-time-step transformation given by , and an inter-time-step transformation given by . The corresponding transformations for the integer time Green’s functions can be found in Section II.2. In the following, we focus on the transformation with the form where is a real number. Using Eq. (18), the transformation of with the component is given as
[TABLE]
Using Eq. (24), the transformation for is given by
[TABLE]
Finally, the transformation for is given by
[TABLE]
We proceed by exploring a particular choice of gauge for the SPD, referred to as the anti-symmetric gauge, which will motivate the gauge choice for the case of . The essence of the anti-symmetric gauge is to choose such that , which can be accomplished as
[TABLE]
Under the anti-symmetric gauge, we have
[TABLE]
The anti-symmetric gauge can also automatically satisfy the SCDA self-consistency condition, and be used to derive the qubit energy form.
V Derivation of the qubit energy form for
In this section, we derive Eq. (56) using several different approaches. In Section V.1, we present two derivations: a heuristic approach and the central point expansion (CPE). In Section V.2, we use the SCDA to evaluate , and the tensor product representation is used to obtain the qubit energy form.
V.1 Derivation of the qubit energy form: a heuristic
approach and the CPE
To begin, it is useful to rewrite the local interaction Hamiltonian given in Eq. (42) as
[TABLE]
where , the density fluctuation is defined as , the index enumerates all possible subsets of the local spin orbitals, with when , and the parameters reparametrize the coefficients in Eq. (42). Using the alternative form of the local interaction Hamiltonian given in Eq. (195), the corresponding qubit form of the trial energy is given as
[TABLE]
where the variational parameters , the is a many-body density matrix for a qubit system which is diagonal in the Pauli-Z basis, , and . For a given , there is constraint given by
[TABLE]
Below, we present two approaches for deriving Eq. (196).
A heuristic approach for deriving Eq. (196) is via the formal duality between and . First, in , the center projector is constrained to have the same local density as , i.e., . Similarly, the center projector in is constrained to have the same local density as , i.e., . Second, in (see Eq. (50)), the momentum density fluctuation is renormalized from the bare momentum density fluctuation with a factor , which depends on . To the contrary, is renormalized from the reference value with a factor , where depends on . The expression for can be determined using a counting scheme similar to the one used within the GA. We begin by transforming into momentum space, which requires a summation over terms consisting of creation and annihilation operators, where is the number of spin orbitals in set . Each creation or annihilation process for will be scaled by
[TABLE]
where enumerates the empty and occupied states for a given and
[TABLE]
Using the infinite dimensional approximation where momentum conservation can be neglected, we obtain . By identifying as the local reduced density matrix of , we obtain the form given in Eq. (197).
A more rigorous way to derive the factor in Eq. (197) is to use the CPE. We outline the derivation here, and the details can be found in Appendix C.2. The CPE for is a dual version to the CPE of , where the latter is described in Section IV.1.1. We start by defining the central point as , where is a non-interacting projector chosen such that . The kinetic projector can be parametrized using with the constraint while can be parametrized using and . Therefore, the observables under are functionals of and . Performing a first order expansion in terms of about , we obtain and
[TABLE]
Finally, we explain how to obtain Eqns. (56)-(58) from Eq. (202). We begin by defining the effective density operator where , the corresponding fluctuation form , and a diagonal many-body density matrix for a qubit system . Using and Eq. (202), we have . Furthermore, implies that . Therefore, the expectation value of is given by , providing the connection to Eqns. (56)-(58).
V.2 The derivation of the qubit energy form via the SCDA
In this section, we use the gauge constrained SCDA to derive the qubit energy form. The derivation consists of two steps. First, in Section V.2.1, we use the Gutzwiller gauge to automatically satisfy the SCDA self-consistency condition. Second, in Section V.2.2, we derive the qubit energy form using the tensor representation. Additionally, in Section V.2.3, we discuss how the quantities , , and change under a general gauge transformation.
V.2.1 SCDA within the Gutzwiller gauge
In this section, we use the gauge constrained SCDA to evaluate , where and , recovering the results from Section V.1. The key observation is that for , the interacting projectors only act on the first integer time step. Consider a gauge transformation for given as and where . We choose the gauge transformation such that is given as , where
[TABLE]
and is the local density for the non-interacting SPD . Under this gauge choice, will ensure that with . The integer time self-energy \left[\bm{S}_{i}\right]$${}_{\ell\ell^{\prime}}=\delta_{\ell\ell^{\prime}}S_{\ell} can then be determined as
[TABLE]
The remaining entries of Eq. (203) should be determined from the self-consistency condition in Eq. (32). Given that is the identity matrix, the non-interacting and interacting integer time Green’s functions are given by and where
[TABLE]
and is the momentum density distribution. We can then verify that the self-consistency condition is fulfilled, given that
[TABLE]
where and the is defined as
[TABLE]
We analogously refer to this gauge as the Gutzwiller gauge given that , and therefore the diagonal elements of the two matrices are the same.
V.2.2 Derivation of the
qubit energy form
In this section, we use the tensor product representation to derive the qubit energy form, paralleling the procedure in Section IV.2.2. Given that we only have interacting projectors at the first integer time step, operators in the -representation \left(\textrm{\@text@baccent{\hat{O}}}\right)_{u} can be reduced from a matrix to a vector, given as
[TABLE]
and the expectation value of ̱\hat{O} at site is given as
[TABLE]
where and the dot denotes the normal dot product between two vectors. Given the direct product structure of \left(\textrm{\@text@baccent{\hat{O}}}\right)_{u}, we only need to compute the component for a given spin-orbital. Here, we only list the relevant matrix elements needed to derive the -representation, given as
[TABLE]
The -representation and -representation are related by such that
[TABLE]
and therefore
[TABLE]
When has a direct product form then
[TABLE]
and is chosen to obtain \left(\textrm{\@text@baccent{\hat{1}}}\right)_{w;\ell} as a vector of ones. The matrix elements are given as
[TABLE]
From the preceding equations, we have
[TABLE]
where
[TABLE]
and therefore we have
[TABLE]
We now proceed to reinterpret the local energy in terms of the qubit representation, identifying \left(\textrm{\@text@baccent{\hat{a}}}_{\ell}^{\dagger\left(1\right)}\textrm{\@text@baccent{\hat{a}}}_{\ell}^{\left(1\right)}\right)_{w}=\textrm{diag}(\hat{n}_{\ell}) and \left(\textrm{\@text@baccent{\hat{a}}}_{\ell}^{\dagger\left(2\right)}\textrm{\@text@baccent{\hat{a}}}_{\ell}^{\left(2\right)}\right)_{w}=\textrm{diag}(\hat{n}_{eff,\ell}), Eq. (228) becomes
[TABLE]
Furthermore, we define a diagonal many-body density matrix with , resulting in
[TABLE]
Therefore, the qubit energy form has been recovered.
V.2.3 SCDA under a general gauge transformation
Here we discuss the SCDA within an arbitrary gauge, analogous to Section IV.2.3. Given , a general gauge transformation is given as: , , , where . Therefore, we have and . Assuming , where is real, we can obtain the transformation for , , and as
[TABLE]
We can define an anti-symmetric gauge, similar to the case, where we choose , leading to , yielding
[TABLE]
The anti-symmetric gauge can also automatically satisfy the SCDA self-consistency condition, and be used to derive the qubit energy form.
VI Derivation of the qubit energy form for
In this section, we derive Eq. (62) using the gauge constrained SCDA. In section VI.1, we provide a high level comparison of the original gauge constrained algorithm and the qubit parametrization. In section VI.2, we provide a review of the original gauge constrained algorithm, which is necessary to understand the qubit parametrization in this work. In section VI.3, we propose the qubit parametrization which yields the qubit energy form. In section VI.4, we examine the case of half-filling, and explore how the qubit energy form for can recover the cases of and .
VI.1 Comparing the original gauge
constrained trial energy to the qubit energy form
In this section, we outline how the qubit parametrization improves the original gauge constrained algorithm Cheng2023035127 . In the original gauge constrained algorithm, the trial energy under is given as
[TABLE]
which is a function of , , , and , given that and are functions of and . It should be noted that and , and there are three constraints for a given
[TABLE]
where the functions and are explicitly defined in Ref. Cheng2023035127 . While this parametrization allows for an explicit evaluation of the total energy, there are several shortcomings of this parametrization. First, the function is highly non-trivial and thus the minimization under a fixed density is cumbersome. Previously, this problem was addressed by introducing a linear transformation over , known as the representation. Second, the function may yield a value outside the allowed bounds for . This problem can be addressed by imposing appropriate restrictions on . Finally, makes the physical interpretation of the total energy expression somewhat obscure.
In this paper, the aforementioned shortcomings are resolved using the qubit parametrization (see Section III). The qubit parametrization has two important differences. First, the qubit parametrization employs an effective many-body density matrix having dimension , corresponding to a pure state of a qubit system. The is constructed such that the density of is the same as the physical local density, and it can be viewed as a function of and . Second, the qubit parametrization uses to solve as a function of and , reducing the total number of constraints per spin orbital from three to two.
VI.2 Review of the gauge constrained SCDA algorithm
In this section, we use original gauge constrained SCDA to evaluate . It should be noted that there are several restrictions on the variational freedom of the SPD when using the gauge constrained SCDA. First, the kinetic projector must be diagonal in momentum space. Second, the interacting projector may not introduce off-diagonal terms at the single-particle level. These two restrictions guarantee that the local integer time self-energy and Green’s function are diagonal in the original basis. For Hamiltonians with density-density interactions and hopping parameters that are diagonal in the orbital index, such as the ones treated in this paper, the aforementioned variational restrictions do not limit the variational power of the SPD. There are two critical insights in the gauge constrained algorithm. First, the integer time self-energy only has non-trivial values within the time steps containing the interacting projector, and therefore only needs to be specified in the corresponding regions. Second, the gauge symmetry can be used to restrict the form of .
We start by examining the gauge symmetry of , where the gauge transformation is given by , , and , where and . The gauge transformation can be parametrized by
[TABLE]
as explained in Section II.2. In the following, we assume where is a real number, yielding
[TABLE]
Notice that the interacting projectors only act on the first and second time step, and therefore it is useful to split into the following block structure:
[TABLE]
A similar block structure is adopted for and .
The first step is to focus on the block, which is sufficient to determine the integer time self-energy. Similar to the case of , we only need to specify , which is sufficient to determine and and therefore . Moreover, similar to the derivation of Eq. (261), we have
[TABLE]
The gauge transformation can be used to further restrict the form of . Given that Eq. (266) involves the inverse of , it is more convenient to first use Eq. (259). Notice that given translation symmetry and the fact that the total particle number of a given spin-orbital commutes with . Therefore, Eq. (259) indicates that the diagonal elements of are invariant, while can be chosen such that . Notice that the interacting projectors are the same for the first and second time step, and therefore and . Now consider a gauge transformation with , which still preserves and , and we can choose such that . Therefore, after fully exploring the gauge symmetry of the SPD, it is sufficient to use just one parameter to parametrize as
[TABLE]
and correspondingly
[TABLE]
It is useful to define a new quantity
[TABLE]
which is different from the block of , denoted as . A similar quantity can also be defined as
[TABLE]
which is different from the block of , denoted as . Using the results of Section VI.2.2, the integer time Dyson equation within the block may be written as
[TABLE]
and can be determined as
[TABLE]
where Eqns. (267) and (268) can be used to obtain
[TABLE]
The is a matrix obtained from , yielding
[TABLE]
The second step is to examine the lattice integer time Green’s function and use the self-consistency condition to resolve the , , and blocks. The lattice integer time Green’s function can be parametrized by the physical momentum density distribution n_{k\ell}\equiv\langle\textrm{\@text@baccent{\hat{n}}}_{k\ell}^{(3)}\rangle_{\textrm{\@text@baccent{\hat{\rho}}}_{K}} and the integer time self-energy by assuming is a single Slater determinant with for and for , where the symbols or denote the occupied and unoccupied regions of . The local integer time Green’s function for the lattice is denoted as , where
[TABLE]
where , , and are , , and matrices, respectively, defined as
[TABLE]
where , , and are defined in Eqns. (68), (69), and (67).
Using Eqns. (275), (331), (332), and (333), the , , and blocks of Eq. (265) can be fully determined, completing the algorithm. Finally, the local interaction energy can then be written using the tensor product representation as
[TABLE]
while the kinetic energy is . The are subject to the two linear constraints given in Eqns. (255) and (256), which can be implemented in the -representation using
[TABLE]
while the are subject to .
It is useful to appreciate what can be gleaned from the form of the preceding equations. The block is reminiscent of the case where captures the quasi-particle renormalization, while and are reminiscent of the case where and capture the super-exchange effects. This observation helps illustrate how the simultaneously captures the physics of both and .
In the following, we further elaborate on two points which were not fully elucidated in Ref. Cheng2023035127 . First, we provide explicit expressions for the tensor product representation in the case of . Second, we explore various relations derived using the block structure of the integer time Dyson equation.
VI.2.1 Evaluating observables under \textrm{\@text@baccent{\hat{\rho}}}_{loc} using the tensor
product representation
Consider the tensor product representation of ̱\hat{O} evaluated under \textrm{\@text@baccent{\hat{\rho}}}_{loc}. Given that the interacting projector only acts on the first and second time step, the 3-dimensional tensor representation may be reduced to a 2-dimensional tensor representation as
[TABLE]
Furthermore, the tensor contraction can be simplified as (\textrm{\@text@baccent{\hat{O}}})_{u}\cdot\left(u_{1}\otimes u_{2}\right)=u^{T}(\textrm{\@text@baccent{\hat{O}}})_{u}u with . Therefore, even though the compound space of is larger than for a given original Hilbert space, the computational cost in the tensor representation for is similar to that of due to this dimension reduction. Similar to the case of the , \left(\textrm{\@text@baccent{\hat{O}}}\right)_{u} has a direct product structure given by Eq. (38). We first specify the components of the block quantities at a given spin orbital
[TABLE]
where indicates the symmetric part of the matrix, which is defined as
[TABLE]
which is useful given that u^{T}\left(\textrm{\@text@baccent{\hat{O}}}\right)_{u}u=u^{T}\left(\textrm{\@text@baccent{\hat{O}}}\right)_{u,S}u. Similarly, the -representation can be defined using , where with
[TABLE]
such that \left(\textrm{\@text@baccent{\hat{1}}}\right)_{w;\ell} is the identity matrix and \left(\textrm{\@text@baccent{\hat{a}}}_{\ell}^{\dagger\left(1\right)}\textrm{\@text@baccent{\hat{a}}}_{\ell}^{\left(1\right)}\right)_{w;\ell,S} is diagonal, resulting in
[TABLE]
and correspondingly, we have
[TABLE]
Finally, we provide explicit expressions for \left(\textrm{\@text@baccent{\hat{a}}}_{\ell}^{\dagger\left(3\right)}\textrm{\@text@baccent{\hat{a}}}_{\ell}^{\left(3\right)}\right)_{w;\ell}, with the components given by
[TABLE]
which can be used to compute the local interaction energy.
VI.2.2 Block structure of the integer time Dyson equation
Here we derive various useful equations using the block form of the integer time Dyson equation. To make our discussion general, we assume that the integer time Green’s functions are not diagonal in the orbital index, resulting in the following block matrix equation
[TABLE]
where the blocks for and can be explicitly expressed in terms of the blocks of and using the inverse formula for a block matrix, resulting in the following relations
[TABLE]
Using Eqns. (318), (319), (320), and (321), we obtain
[TABLE]
Similarly, using Eqns. (318), (319), and (322), we obtain
[TABLE]
Using Eqns. (326), (327), (324), and (325), we verify that and can be used to determine as
[TABLE]
Using Eqns. (327) and (328), we have
[TABLE]
Finally, using Eq. (328), we have
[TABLE]
which can be used to solve for . Additionally, it is useful to explicitly write expressions for , , and as
[TABLE]
VI.3 Derivation of the qubit energy form
In this section, we derive the qubit energy form, given in Eq. (62), in several steps. First, in Section VI.3.1, we introduce a polar representation for the block. Second, in Section VI.3.2, we introduce as a unitary transformation of , and we solve the self-consistency condition for the block using , , and . Third, in Section VI.3.3, we resolve the self-consistency of the , , and blocks using the extra information provided by and .
VI.3.1 Polar Representation of the A block
We first review some mathematical properties of the matrix group with following form
[TABLE]
where and and the group multiplication is
[TABLE]
There is an isomorphism for this matrix group to with , with the group product taken as
[TABLE]
Interpreting as the integer time self-energy, the corresponding integer time Green’s function is given as
[TABLE]
and it can be seen that
[TABLE]
It is also useful to express and in terms of and as
[TABLE]
We now use the preceding results to study the block of the integer time Green’s function. Given that and have the form given by Eq. (337), the and also have the form of Eq. (334), and therefore also has the form of Eq. (334). We can then use the polar representation to express the following quantities
[TABLE]
allowing for the integer time Dyson equation to be recast as
[TABLE]
which yields Eq. (77). Using , , and in Eqns. (340) and (341), we obtain Eqns. (75) and (76). Using , , , and , we obtain
[TABLE]
which yields Eq. (78) by assuming . Using Eq. (344), we can rewrite Eq. (275) in terms of , and as
[TABLE]
The matrix elements in representation are given by
[TABLE]
VI.3.2 Resolving the self-consistency in the -block
A key ingredient of the qubit parametrization is the introduction of the qubit representation, defined by
[TABLE]
where is a unitary matrix defined as and
[TABLE]
where is a parameter that will be determined by demanding that yields the physical density. This should be contrasted to the representation, where the local density of is generally different from the physical density.
We begin by deriving Eq. (73), which determines for a given , and . Using Eq. (362), Eqns. (356)-(361) are transformed to
[TABLE]
where
[TABLE]
Using the self-consistency condition for the 1,1 and 1,2 entries, we have
[TABLE]
Eqns. (373) and (374) reduce to two linear equations in and as
[TABLE]
and and can be determined from Eqns. (88) and (89). Using Eqns. (371) and (372), we obtain
[TABLE]
Both and can be expressed in terms of as
[TABLE]
Substituting Eqns. (88) and (89) into Eq. (380) and using Eq. (383), we obtain a sixth order equation in which can be factored into the following form
[TABLE]
Notice that the first factor in Eq. (384) is positive, implying that the second factor is zero, and may be obtained as
[TABLE]
which yields Eq. (73). Notice that , yielding a constraint on as
[TABLE]
VI.3.3 Resolving the , , and blocks
We begin by deriving a subtle symmetry between and , described by Eq. (83) and Eq. (84), which can be rewritten as
[TABLE]
where the tilde is defined by the following rules. For a matrix , we have and . For a matrix , we have and . For a matrix , we have , , , and . In order to prove Eq. (387), we use Eq. (327) to obtain
[TABLE]
Given that (see Eq. (331)) and that commutes with (see Section VI.3.2), Eq. (387) can be proven by verifying that
[TABLE]
Using Eq. (331), we obtain
[TABLE]
To simplify these two equations, we introduce , , , and (defined in Eqns. (79)-(82)), which yields Eqns. (83) and (84). To compute , Eqns. (333), (326), and (329) can be used to obtain
[TABLE]
Introducing and , defined in Eqns. (85) and (86), we can simplify Eq. (397) to (87). The matrix elements of \left(\textrm{\@text@baccent{\hat{a}}}_{\ell}^{\dagger\left(3\right)}\textrm{\@text@baccent{\hat{a}}}_{\ell}^{\left(3\right)}\right)_{w,\ell} can then be determined as
[TABLE]
Eqns. (362) and (364) may be used to evaluate
[TABLE]
and we define
[TABLE]
where , , and are given in Eqns. (90), (92), and (91). The derivations of Eqns. (71)-(92) are now complete.
VI.4 Examining the qubit energy form in special cases
In this section, we showcase the qubit energy form for the special case of half-filled orbitals. Additionally, we examine how the qubit energy form for recovers the qubit energy forms for and .
VI.4.1 The case of half-filled orbitals
In this section, we examine the case of half-filling with particle-hole symmetry where and . Using the general algorithm given in Eqs. (71)-(92), we provide corresponding results. Starting with and , we obtain
[TABLE]
and , , and
[TABLE]
and , , , and
[TABLE]
It is also useful to define
[TABLE]
yielding
[TABLE]
Eq. (428) can be plugged into Eq. (62) to obtain the total energy. This result will be applied to the multiorbital Hubbard model in Section VII.
VI.4.2 Recovering the qubit energy form for
Given that is a special case of , it is clear that the former can be obtained by constraining the latter. We previously demonstrated that restricting the momentum density distribution to be flat in each region and taking within will recover in the case where corresponds to a pure state Cheng2023035127 . Here we illustrate this fact using the qubit parametrization. We begin by enforcing , and Eq. (78) yields . Using Eq. (383) and , we obtain
[TABLE]
Using Eqns. (73) and (429), we obtain
[TABLE]
yielding and . Using the assumption of a flat momentum density distribution, given by and , which yields a quasi-particle weight of
[TABLE]
we obtain
[TABLE]
and , , , , , , , , and
[TABLE]
yielding , recovering the qubit energy form obtained from .
VI.4.3 Recovering the qubit energy form for
Given that is a special case of , it is clear that the former can be obtained by constraining the latter. However, it should be noted that in the qubit parametrization, in is assumed to correspond to a Slater determinant, but must be the identity to recover . Nonetheless, we demonstrate that the qubit energy form of can still be constrained to recover the qubit energy form of . The solution is to restrict to be a diagonal matrix, implying that , yielding , , , , , , , , , , , , , , , , f_{\ell,0}=n_{\ell}-\frac{A_{\ell}^{2}\text{\deltan}_{\ell}}{\xi_{\ell,0}^{2}}, , and . Finally, we have
[TABLE]
recovering the qubit energy form of . The preceding result demonstrates that when all orbitals have , implying that the system is a Mott insulator, the solution can be recovered by , demonstrating the power of in the Mott phase.
VII Applications: multiorbital Hubbard model at half filling with particle-hole
symmetry in
Here we showcase how to use the qubit energy form for to study the multiorbital Hubbard model in with density-density interactions. While the qubit energy form can be applied to arbitrary densities, here we study the case of half-filling where the local interaction energy takes a simple form. In Section VII.1, we explicitly evaluate the qubit energy form for half-filling and demonstrate how to analytically minimize over the momentum density distribution, yielding a final energy form which can straightforwardly be numerically minimized. In Section VII.2, we demonstrate that , defined in Eq. (67), is a key variable for understanding the Mott transition. For a special density-of-states, the minimization can be analytically performed, yielding an analytical relation between the ground state energy and the Hubbard via . In Section VII.3, we study the effect of the Hund coupling on the nature of the Mott transition in the multiorbital Hubbard model.
VII.1 Numerical minimization of the qubit trial energy
VII.1.1 Evaluating the qubit trial energy
In order to elucidate the results of the multiorbital Hubbard model, we first begin by considering the single orbital case with density of states having particle-hole symmetry. Due to spin symmetry, we omit the spin-orbital index in \xi_{\ell},$$\Delta_{\ell}, and . The qubit energy form for is given as (see Section VI.4.1 for derivation)
[TABLE]
where particle-hole symmetry implies , and in order to ensure that is real, must satisfy the following constraint
[TABLE]
The effective many-body density matrix can be encoded as where can be parametrized using a single parameter as
[TABLE]
This parametrization ensures that and can be determined as
[TABLE]
The double occupancy is given as
[TABLE]
and to have a real , the must be real which requires to satisfy the following constraint
[TABLE]
Finally, the trial energy for the single orbital case is a function only of and , given as
[TABLE]
We now proceed to the multiorbital case, with a local interaction given as
[TABLE]
where . The qubit trial energy is given as
[TABLE]
where is a pure state that is restricted to , particle-hole symmetry implies , and must satisfy the following constraint
[TABLE]
The expression for is given in Eq. (447). The most straightforward approach to satisfying Eq. (462) is to first choose , and then Eq. (462) becomes a linear constraint on (see Section III for further discussion).
VII.1.2 Numerical minimization of the qubit trial energy
We now proceed to minimizing the qubit trial energy. As before, we first focus on the single orbital model for clarity, and then consider the multiorbital case. The general numerical minimization has been described in Section III, which consists of two steps: First, the momentum density distribution is partially optimized under the constraint of , , , and , or through four Lagrange multiplier , , and . Second, one needs to minimize over the remaining independent variational parameters subjected to the inequality constraint given by Eq. (70). For the half-filled, particle-hole symmetric case, there are several simplifications. First, the optimized momentum density distribution now only depends on and , or just two Lagrange multipliers and , and the partially optimized momentum density distribution is given as
[TABLE]
where the Lagrange multipliers and are determined by and via inverting the following relation
[TABLE]
which yields and . In order to better appreciate the flexibility of , it is useful to examine the behavior for various choices of and (see Figure 1). For and , the distribution corresponds to a Fermi liquid, given that the discontinuity of at yields a quasi-particle weight of
[TABLE]
which will be between zero and one. For the special case where and with remaining finite, the distribution recovers a flat distribution, which is obtained for an optimized . Finally, when and , the system is in the Mott phase where .
The second simplification for half-filling and particle-hole symmetry is that the number of independent parameters is , given that and . If we have further symmetry between different spin orbitals, the number of independent parameters can further be reduced. For example, for the single-orbital case with spin symmetry, we have and , and therefore the number of independent parameters is . As discussed in Section III, there are two possible strategies. The first strategy starts from , which determines , and is specified via , which will have a range given by Eq. (457). Mathematically, this qubit trial energy is given by
[TABLE]
This strategy allows the energy to be explicitly written in terms of the variational parameters, and thus it is straightforward to compute the derivatives via automatic differentiation. For the multiorbital case, the constraint between and is given by Eq. (462), which is not straightforward to implement for a given . Therefore, it is useful to pursue a second strategy.
We begin by illustrating the second strategy in the single-orbital case. This strategy begins by specifying via , then the range of can be determined from the constraint given by Eq. (451), and then and can be specified, which determines via inverting Eq. (465), denoted as . Mathematically, the resulting qubit trial energy is given by
[TABLE]
For the multiorbital case, the qubit trial energy is given as
[TABLE]
where and are the multiorbital versions of and . The second strategy allows one to automatically implement all of the constraints in the multiorbital case, and the only downside is that must be numerically evaluated, though this is a trivial task.
VII.2 Understanding the Mott transition in the single orbital model
In the preceding section, the resulting qubit trial energy for the single orbital model at half-filling is either parametrized by , , and (i.e. Eq. (467)), or , and (i.e. Eq. (468)), which both allow for straightforward numerical minimization for a given and . In this section, we explore a different perspective based on the one body reduced density matrix viewpoint (see Ref. companion ), which provides a clearer understanding of the Mott transition. The first step begins with the parametrization of the qubit trial energy using , , and , where the local interaction energy is , where is defined in Eq. (455). It should be noted that has the form , and therefore we can optimize with respect to for fixed and , yielding an optimized as a function of , denoted as . Therefore, the interaction energy is purely a function of and , which can be viewed as a one body reduced density matrix functional, yielding a simple picture of the Mott transition. Two key points should be appreciated. First, has a critical value of , denoted , and for , which corresponds to the Mott insulator. The second point is that is independent of . While the total energy can be obtained by numerically minimizing over and , further insight can be obtained by expressing and the total energy as functions of .
VII.2.1 Qubit trial energy in terms of and
As outlined in Section VII.1.2, the qubit trial energy can be parametrized as
[TABLE]
For simplicity, we study the case where . We now consider how to minimize the total energy over , which amounts to minimizing over . In order to visualize the minimization, we plot as a function of for a given and . Given that will not influence the minimization over , we choose from a flat momentum density distribution (see Figure 2). For a given curve, there are four distinct sets of points denoted as , , , and , where the points provide the optimized values for , the and points provide the values for and , respectively, and a point provides the maximum value for given by in Eq. (458). For small values of , the optimized value of is nonzero, and monotonically decreases with increasing . For larger than some critical value, the optimized value for is zero. Having obtained a graphical understanding of this function, we proceed to mathematically minimize the over for a given and , which is a constrained minimization given that . Solving yields
[TABLE]
When , then yields the optimized value for , and otherwise the optimized value is given by the minimum value for the boundary points. It is useful to solve , yielding
[TABLE]
Therefore, the optimized value for is
[TABLE]
having two distinct regimes as a function of (see inset of Figure 2). Finally, the physical double occupancy can be written as a function of and as
[TABLE]
Therefore, the total trial energy can be written purely in terms of and , given as
[TABLE]
In order to find the ground state for a given and , it is necessary to minimize over and , which cannot be performed analytically in general. However, it is important to appreciate that the optimized value of is sufficient to determine if the system is in the Mott phase. For , the ansatz corresponds to the Hartree-Fock wave function, while the maximum value of corresponds to a collection of isolated atoms, and the Mott transition occurs when , before the system becomes a collection of atoms. We now verify that the system is indeed a Mott insulator for . First, the local interaction energy is independent of for (see Eq. (476)), dictating that for the optimized . Second, Eq. (466) dictates that the quasiparticle weight is zero when , implying a Mott insulating state. Alternatively, when , the ansatz describes a metallic phase.
Following the results of Sections VI.4.2 and VI.4.3, we illustrate how the qubit energy form for and can be recovered from when properly restricting the variational parameters. We begin with , which can be viewed as a continuation of the Mott phase for , where and is given by (see Figure 2). Alternatively, the is characterized by a flat momentum density distribution determined by where and are identical, given as
[TABLE]
It is well known that for the Mott transition occurs when the system becomes a collection of isolated atoms, meaning that the transition happens for .
VII.2.2 Solution for the two-peak density of states
In Section VII.2.1, we demonstrated that the interaction energy can be written analytically in terms of and , but the kinetic energy must be numerically determined in terms of and . While the latter is a trivial numerical problem, it still precludes a completely analytic solution for the energy in terms of . Therefore, we introduce the two-peak density of states, where the kinetic energy can be analytically evaluated in terms of and , allowing for an analytical relation between the total energy and the Hubbard in all three ansatz, where is the non-interacting kinetic energy for the lattice. Specifically, the two-peak density of states is given by
[TABLE]
While this density of states is somewhat unphysical, the results retain all of the qualitative features of the Bethe lattice, in addition to being quantitatively very similar assuming the same (see Figure 3). As expected, the solution is identical for both density of states, given that only depends on , independent of the details of . Interestingly, has a very similar critical value of for both density of states, with small differences differences in the double occupancy for a given .
We now proceed to analytically evaluate the total trial energy as a function of for the two-peak model. Using Eqns. (480), (449), and (448), the kinetic energy and are determined as and , yielding the total trial energy in terms of the single variational parameter , given as
[TABLE]
where
[TABLE]
and is defined in Eq. (97). The is plotted in Figure 4. While for a given the energy cannot be analytically minimized over , it is straightforward to find for a given value of that satisfies , given as
[TABLE]
and is plotted as a function of in Figure 4. The critical value of and double occupancy at the Mott transition, where , are given as
[TABLE]
These values are the same from both the metallic and insulating sides of the Mott transition, which can be seen in Figure 4, confirming that the transition is continuous.
Corresponding equations for and as functions of can be obtained for and by substituting the corresponding relations into Eq. (483) (see Figure 4 for plots). For , we have
[TABLE]
which recovers the Gutzwiller approximation, and the Mott transition occurs at , where and . For , there are some subtleties to consider. For a given , there are several candidate values of : the value given by the saddle point Eq. (483), which yields
[TABLE]
and the boundary values of and . The total energy must be used to evaluate these candidate values of and select the global minimum. It should be noted that recovers the Hartree-Fock solution. There are two critical points to consider: the local stability of and a transition from a saddle point solution to the Hartree-Fock solution. The local stability of the is determined by the minimal value of in Eq. 489, which is given by
[TABLE]
For any , there exists a locally stable solution. However, one should compare the energy of the saddle-point solution to the Hartree-Fock solution, which yields another critical value of . For , the saddle point solution is the global minimum, while for the Hartree-Fock solution is the global minimum. We summarize all the critical points for , , and in Table 1.
VII.2.3 Solution for a general density of states
In Section VII.2.2, we demonstrated that the qubit trial energy can be written solely in terms of for the case of a two-peak density of states. Here we extend this strategy for a general density of states, allowing for the optimized value of to be evaluated as a function of , which is completely independent of . Therefore, this approach is particularly useful for analyzing the Mott transition. We begin by rewriting the qubit trial energy from Eq. (478) using the result of Eq. (482), yielding
[TABLE]
where is given in Eq. (96). We proceed by constructing the saddle point equations of and for a given , yielding the following two equations
[TABLE]
where and are the Lagrange multipliers from Eq. (463). A practical approach for solving the two preceding equations for a given is to express and in terms of and , denoted as and , and then solve for and . However, we take an alternative approach, as our goal is to determine the optimized value of for a given . Therefore, we proceed by dividing Eq. (494) by Eq. (495), which yields
[TABLE]
Moreover, and are required to yield the given value of , such that
[TABLE]
Simultaneously solving Eqns. (496) and (497) yields and as functions of , and therefore all quantities that depend on , including and , are now functions purely of . Subsequently, the double occupancy and can be determined as a function of as
[TABLE]
Alternatively, the can also be expressed as
[TABLE]
which will recover Eq. (483) when evaluating the two peak model. For the case of the Bethe lattice, we plot , , and as functions of in Figure 5, and the critical values of all quantities are listed in Table 2.
VII.3 Understanding the effect of Hund’s coupling in the multiorbital Hubbard
model
Here we generalize the treatment from Section VII.2 to the multiorbital case including the Hund coupling . The key difference is that in the multiorbital case, one cannot analytically minimize over the local variational parameters, though these parameters can easily be numerically minimized as a function of for a given . The remaining procedure closely follows the single orbital case.
Consider the multiorbital Hubbard model defined in Eq. (94). The qubit trial energy can be written as
[TABLE]
where is defined in Eq. (447), , and
[TABLE]
where are defined in Eq. (94). Having defined the qubit trial energy, we proceed to obtain the solutions for the two-peak density of states and the Bethe lattice using the parametrization. We first rewrite the local interaction energy as
[TABLE]
where
[TABLE]
which is obtained from . Given the form of Eq. (505), the optimized will only depend on , motivating the definition of the following function
[TABLE]
A convenient way to generate this function is to perform a two stage minimization. First, we perform a constrained minimization with the restriction on . Second, we minimize the expression over . In the first stage, given that is fixed, we only need to minimize , which can be mathematically expressed as
[TABLE]
To efficiently generate , we can introduce a Lagrange multiplier and determine the ground state for , yielding the optimal and corresponding and for a given . One can then perform such calculations over a grid of , and then spline the relationship between and . A plot of is provided in Figure 2 of Ref. companion . Finally, the partially optimized local energy can be written as
[TABLE]
In order to visualize and as a function of , we plot these quantities for and (see Figure 6). For , one can clearly observe that continuously goes to zero, while there is a discontinuity for . For , discontinuously goes to zero for and continuously goes to zero for . It should be noted that can be applied to solve an arbitrary particle-hole symmetric , and therefore captures the essence of the Mott transition for a given .
VII.3.1 Understanding the non-analytic behavior of
via a Taylor series
In Section VII.2, we demonstrated that there is non-analyticity in at . Here we provide a Taylor series analysis to explain how the non-analyticity emerges. Equation (510) indicates that is the minimum of within the range , and it is convenient to study the quantity
[TABLE]
Finding the minimum of will yield the minimum of given that and .
We begin by Taylor series expanding to sixth order in about , and the second order coefficient in is expanded in about such that it is zero for , yielding
[TABLE]
where , , , and are constants for a given and . Given that the optimized goes to zero with increasing (see Figure 6), this requires . Furthermore, we take , though there will be cases where is negative and a higher order expansion is necessary. Therefore, we only need to understand how the sign of influences the non-analyticity in . For , when , the minimum of is given by with , while for , the minimum of is obtained with
[TABLE]
Notice that continuously increases from [math] when decreases from . Therefore the critical value of is given by . Moreover, using
[TABLE]
we find that is continuous and only has a kink at . For , when there is a local minimum at , while for there is a local minimum given by Eq. (513). The two saddle points need to be compared to obtain the global minimum, which yields Therefore, jumps from zero to a finite value when decreases from , implying that is discontinuous at . It should be noted that for , is exact, while for , the expression is an approximation, and in this case one should use the exact form of to determine if precision is needed.
We now proceed to analytically compute the expansion for two cases: with , and with . We begin by expanding
[TABLE]
where
[TABLE]
It should be noted that , , and all monotonically decrease with increasing for . It is straightforward to show that when and when and when . The remaining task is to compute , which we separately consider for the two aforementioned cases.
For and , symmetry can be used to parametrize with , where is a variational parameter with and counts the number of electrons in state . It is convenient to reparametrize as
[TABLE]
such that , where is the Pochhammer symbol. When taking an expectation value of an operator in the qubit space, it is convenient to use a matrix representation , where and take values from , such that
[TABLE]
The non-zero entries of the effective matrices for and are given as
[TABLE]
[TABLE]
Perturbation theory can then be used to obtain
[TABLE]
where
[TABLE]
The expansion coefficients of are obtained as
[TABLE]
Given that , we have and has a kink at .
We now discuss the case of with . Using perturbation theory, we find
[TABLE]
where
[TABLE]
where . Therefore, we can compute the coefficients in the expansion of as
[TABLE]
We can see that , and therefore , indicating that there is a discontinuity in for .
Finally, we numerically explore the cases of , where and are identical to the case of . To be concrete, we consider , and we numerically compute the expansion coefficients by fitting to a sixth order polynomial. For , we have and . Notice that in this case , and thus an expansion beyond sixth order is necessary, though plotting indicates that , yielding a discontinuity in for (not shown). For , and . For , and . For , and . For , and . It should be noted that for , we have and has a kink at .
VII.3.2 Solution for the two-peak density of states
We now consider the two peak density of states
[TABLE]
where is the total non-interacting kinetic energy per site. For the two-peak density of states, , and the qubit trial energy can be written purely in terms of as
[TABLE]
Following the single orbital case, can be used to determine from which yields
[TABLE]
thus providing a succinct solution parametrized by . The relation can be used to determine the nature of the Mott transition from , given that this quantity will allow any observable to be expressed in terms of . We plot versus for various and , which demonstrates three types of non-analytical scenarios for (see Fig. 7, panel ). First, for all , the is continuous with a positive slope and has a kink at . Second, for and small , the is discontinuous at with a negative slope for . Third, for and , the is continuous and has a kink at , and the slope is negative for . The can now be used to determine , which yields the order of the Mott transition (see Fig. 7, panel ). First, for all , increases monotonically and continuously with , with a kink at , and therefore there are no metastable regions and the Mott transition is continuous. Second, for , there is an unstable region in the metal phase where , and the total energy can be used to determine the transition between the metal and insulating phase, corresponding to a horizontal line, and the transition is first-order.
In summary, we have demonstrated that the nature of the Mott transition for the two peak density of states is determined purely by , and below we demonstrate that the Bethe lattice has the same behavior. Therefore, it appears that is the essence of what determines the nature of the Mott transition in .
VII.3.3 Solution for a general density of states
We now execute a similar strategy for a general density of states. Using Eq. (509), the qubit trial energy can be written as
[TABLE]
The saddle point equations are given as
[TABLE]
Given that and are functions of and , one can solve and from Eqns. (544) and (545) for a given , and then determine all physical quantities. We now demonstrate that indicates that the system is in the Mott phase. Given that the quasiparticle weight is given as (see Eq. 466), and that for finite (see Eq. 464), the only scenario where is when . Therefore, when , the system is metallic, while the system is insulating. For a given , one must minimize over in order to determine nature of the ground state.
An alternate approach is to parametrize the solution in terms of . Equations (544) and (545), in addition to the constraint on and for a given , yield
[TABLE]
For a given , and can be determined from Eqns. (546) and (547), can be determined from , and can be determined using Eq. (545) as
[TABLE]
allowing for the evaluation of the total energy.
We now consider the Bethe lattice in for . Recall that for a given , the yields a which divides the metallic and insulating states, where indicates an insulating phase. For the case of a continuous transition, the will be determined by , while for a first-order transition, one must explicitly determine the where the insulating and metallic states cross in energy. The algorithm is executed by evaluating , , the interaction energy, and the total energy as functions of . We begin by plotting the as a function of (see Figure 8, panel ). For , is a monotonic and continuous function of , implying a continuous phase transition at , which can be identified as a kink. Alternatively, for , the is not a monotonic nor a continuous function of , implying that the there are regions of phase coexistence and unstable regions. The metallic curve exists for , and the solution is only stable for , where is determined from , and therefore the metallic solution is only stable for . The insulating curve exists for without any unstable regions, and therefore the insulating phase exists for . Given that , there exists a region of coexistence for the metallic and insulating solutions, and the total energy dictates the lowest energy solution. We now proceed to present the total energy and the interaction energy as functions of for various and (see Figure 9), and the results are very similar to the two-peak case. Consistent with previous Gutzwiller Bunemann19974011 , slave boson Hasegawa19971391 , and DMFT Ono2003035119 studies, the Mott transition is continuous for and first-order for .
Finally, we evaluate for , which is explicitly given by , where is determined from and , where and are defined in Eqns. (465) and (464), respectively. The critical values for the Bethe lattice are listed in Table 3. For the case of large , we have
[TABLE]
consistent with our numerical results in Ref. Cheng2023035127 . For a systematic exploration of how depends on and , see Fig. 5 of Ref. companion .
VIII Summary and Conclusions
We begin by providing a high level overview of the variational discrete action theory (VDAT), such that the developments of the present work can be properly understood. VDAT is a variational approach to the many-body body problem that consists of two main components: a variational ansatz for the many-body wave function or density-matrix, known as the sequential product density matrix (SPD), and a formalism for evaluating expectation values under the SPD, known as the discrete action theory Cheng2021195138 ; Cheng2021206402 . The SPD has a natural mechanism to trade off between efficiency and accuracy, where the integer monotonically increases the variational power of the SPD and guarantees the ability to recover the ground state solution for . Moreover, there are two distinct types of SPD which satisfy the properties of a many-body density matrix, denoted as G-type and B-type. The G-type , , and SPD encapsulate the Hartree-Fock wave function, the Gutzwiller wave function, and the Gutzwiller-Baeriswyl wave function, respectively. The key breakthrough using VDAT was the demonstration that the SPD can be exactly evaluated for multiorbital Hubbard models in . We demonstrated that the G-type SPD accurately solves the Anderson impurity model on a ring Cheng2021206402 , the single band Hubbard model over all parameter space Cheng2021206402 , the two orbital Hubbard model including a crystal field and the full rotationally invariant Hund’s coupling Cheng2022205129 , and the Hubbard model for Cheng2023035127 . Moreover, we demonstrated that the computational cost of solving a G-type SPD is comparable to a G-type SPD, meaning that VDAT can provide a sufficiently accurate solution at a cost not far beyond the Gutzwiller approximation. The success of VDAT at motivated a search for the best possible algorithm for executing calculations using a G-type SPD Cheng2023035127 , which is essential for detailed exploration of the multiorbital Hubbard model and merging VDAT with realistic electronic structure methods.
The VDAT algorithm in consists of two steps: the exact evaluation of the SPD via the self-consistent canonical discrete action theory (SCDA) and the optimization of the energy with respect to the variational parameters. The SCDA requires the numerical solution of a set of self-consistency conditions, and therefore can be inconvenient when minimizing over the variational parameters. For the case of a G-type SPD with certain restrictions (see Sections I, VI.1, and VI.2 for further details), the SCDA self-consistency condition can be automatically satisfied, which we refer to as the gauge constrained SCDA algorithm Cheng2023035127 . In the present work, we introduce the so-called qubit parametrization of the gauge constrained SCDA algorithm, which is mathematically equivalent to the original gauge constrained SCDA algorithm. The qubit parametrization offers several key improvements. The qubit parametrization analytically resolves some constraints over the variational parameters, thus reducing the number of variational parameters by one per spin orbital. Additionally, the variational parameters are physically intuitive and facilitate a deeper understanding of how the SPD captures Mott and Hund physics. Therefore, the qubit parametrization achieves the long sought goal of resolving the shortcomings of the Gutzwiller approximation while maintaining the computational simplicity and physical appeal.
The variational parameters of the qubit parametrization consist of the momentum density distribution, the non-interacting reference momentum density distribution, and the pure state of a qubit system with a dimension of the local Hilbert space. The qubit system naturally arises from reparametrizing the variational parameters of the interacting projector, and the renormalized correlations within the qubit space yield the physical local correlations. The variational parameters are restricted by two constraints per spin orbital, requiring that the local density computed from the momentum density distribution is the same as that computed from the non-interacting reference momentum density distribution, and the same as the density computed from the qubit system. The qubit trial energy has a very intuitive form: the kinetic energy is determined by the momentum density distributions, while the local interaction energy is the expectation value of an effective Hamiltonian within the qubit system. Interestingly, the effective Hamiltonian has the same form as the local interacting Hamiltonian where the local density operator is substituted by an effective density operator of the qubit system. The effective density operator for a given spin-orbital depends on five parameters: the density , the magnetization in the direction for the -th qubit, denoted , and three quantities determined from the momentum density distribution, denoted as , , and . The quantity characterizes the number of electrons promoted across the reference Fermi surface, while and characterize the momentum density distribution below and above the reference Fermi surface, respectively. The quantity is an important variable which differentiates between a zero and non-zero quasiparticle weight when all variables are fully optimized, where indicates zero quasiparticle weight for spin-orbital . The main computational cost for evaluating the ansatz is dictated by computing expectation values within the qubit system.
While evaluating the trial qubit energy ansatz is a straightforward task, optimizing over all the variational parameters remains nontrivial. In general, the qubit trial energy can be partially optimized over the momentum density distribution by introducing four Lagrange multipliers per spin-orbital, replacing the continuous momentum density distribution with four variables. For a system with spin-orbitals per site, there remain variational parameters which must be optimized in general. However, this number may be greatly reduced by symmetry, and it is likely possible to compress these variables into a smaller number of parameters without a serious loss of fidelity. For the special case of half-filled orbitals with particle hole symmetry, we demonstrate that one can efficiently minimize over all variational parameters, with a computational cost proportional to computing the ground state of a Hamiltonian defined within the qubit system.
In order to demonstrate the power of the qubit parametrization, we studied the ground state properties of the multiorbital Hubbard model at half-filling with particle-hole symmetry for various and . For a given , the majority of the energy minimization can be encapsulated into the computation of a single variable function , which can then be used to obtain the solution at a negligible cost for an arbitrary and density-of-states. The entire function is evaluated by solving a collection of qubit systems, which has a relatively small computational cost. For example, for , the ground state for a given qubit system can be solved in several seconds on a typical single desktop computer core, and taking on the order of 100 samples, the entire function can be accurately obtained on the order of hundreds of seconds. The extreme computational efficiency of the qubit parametrization in this case allows one to easily map out all of parameter space, which is not possible with DMFT given the lack of efficient impurity solvers for the zero temperature multiorbital problem. We find that for , the Mott transition is continuous, while it is first-order for , consistent with previous Gutzwiller Bunemann19974011 , slave boson Hasegawa19971391 , and DMFT Ono2003035119 studies.
While the key result of this paper is formulating the qubit parametrization of the gauge constrained SCDA algorithm for a G-type SPD, we also demonstrate that the qubit parametrization can be applied to the G-type and B-type SPD. Moreover, we demonstrate that properly restricting the variational parameters of the qubit trial energy for the G-type SPD can recover the corresponding qubit trial energy for the G-type and B-type SPD. Interestingly, the qubit trial energy for the G-type SPD has an identical form to the slave spin mean-field theory (see Appendix B), and thus the qubit trial energy may provide insights for proceeding beyond mean-field theory in the slave spin formalism.
The qubit parametrization of the gauge constrained SCDA algorithm at is likely the optimal form when evaluating an SPD with a kinetic projector that is diagonal in both the momentum and spin-orbital indices and an interacting projector that consists of diagonal Hubbard operators. For the Hamiltonians treated in the present study, which have density-density interactions and hopping parameters that are diagonal in the spin-orbital index, the aforementioned restrictions on the SPD do not limit the variational power. When solving a general Hamiltonian which includes the full rotationally invariant form of the Hund exchange or non-diagonal hopping terms, the qubit parametrization can still be applied and it will still yield an upper bound on the energy in , but it will not contain the full variational power of . Ongoing research is addressing how to generalize the qubit parametrization to handle an arbitrary G-type SPD, with aspirations of completely superseding our general decoupled minimization algorithm for Cheng2022205129 .
IX Acknowledgments
This work was a supported by a RISE-LDRD grant from Columbia University and Brookhaven National Laboratory. This research used resources of the National Energy Research Scientific Computing Center, a DOE Office of Science User Facility supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC02-05CH11231.
Appendix A One-body reduced density matrix functional for the Hubbard model
An important application of the qubit energy form for is the construction of a one body reduced density matrix functional (1RDMF) for the multi-orbital Hubbard model, which is the focus of our companion manuscript companion . Here we derive the corresponding results for and . Additionally, we evaluate existing 1RDMF’s from the literature for the single band Hubbard model at half-filling in .
A.1 The 1RDMF’s for the multi-orbital Hubbard model from
and
We begin by presenting the 1RDMF from using the qubit parametrization. The interaction energy is given as
[TABLE]
where , the effective density operator is with and , and both and are diagonal in the Pauli-Z basis of the qubit system. For the case of half-filling, the minimization yields
[TABLE]
where .
We now present the 1RDMF for . Unlike or , the variational parameters do not explicitly contain but instead , where . Therefore, for a given , we have , where , where and are the maximum and minimum values of for a given . It should be noted that the gauge symmetry can be used to restrict . The interaction energy can then be written as
[TABLE]
where . Notice that the interaction energy will decrease with decreasing , and therefore the restriction in Eq. (555) can be replaced with . For the case of half-filling, we have
[TABLE]
where is defined in Eq. (508).
Using the above interaction energy functionals will yield results identical to the corresponding VDAT results, such as the ones provided in Fig. 3.
A.2 Results using published 1RDMF’s for the single-orbital Hubbard model
in
We are not aware of any applications of existing 1RDMF’s to the Hubbard model in , though there are various studies of Hubbard clusters Kamil2016085141 ; Mitxelena2017425602 and Lopezsandoval20001764 ; Mitxelena20201701 and Hubbard models Saubanere2016045102 ; Mitxelena2020064108 . It is important to emphasize that is the most relevant test of local electronic correlations, and is emblematic of typical three dimensional strongly correlated materials in that hosts a standard Fermi liquid in the metal phase and exhibits a Mott transition at a finite value of . While is exactly solvable via the Bethe ansatz, the Mott transition occurs at an infinitesimal Lieb19681445 . Alternatively, is a testbed for the most advanced and expensive computational approaches, and the properties at half-filling and low temperatures are still actively studied Tanaka2019205133 ; Schafer2021011058 ; Chatzieleftheriou2024236504 ; Geng2025115143 . While it is still very interesting to compare total energies from 1RDMF’s in and , if the goal is to determine whether or not a 1RDMF can describe the Mott transition, then it is most critical to first benchmark in .
In this section, we discuss how to use various 1RDMF’s to solve the one band Hubbard model at half-filling, including the MBB Muller1984446 , CA Csanyi20007348 , CGA Csanyi2002032510 , power Sharma2008201103 , PNOF5 Piris2011164102 , PNOF7 Piris2017063002 , and dimer Saubanere2016045102 functionals. The interaction energy is given by , where is the double occupancy and is a functional of . If the interaction Hamiltonian is rewritten as , the interaction energy for the MBB, CA, CGA, and power functionals can be viewed as a modification of the Hartree-Fock (HF) energy where the Fock term is altered Mitxelena2017425602 . We begin by writing the local interaction as , which can be written in momentum space as
[TABLE]
where is the number of -points. The MBB, CA, CGA, and power functionals are all defined using the following approximation
[TABLE]
where is defined through . The and are given for each functional in Table 4, and our values of are identical to those provided in Ref. Mitxelena2017425602 . It is worth noting that the MBB, CA, and CGA recover the atomic limit where when , while the power functional does not if .
Now we discuss how to minimize the total energy over . Given that MBB is a special case of the power functional, we first consider the power functional. Given that the interaction energy for the power functional is fixed for a given and , the kinetic energy can be minimized by introducing two Lagrange multipliers and , yielding a target functional to be minimized, and the partially optimized are obtained as
[TABLE]
The ground state energy for a given is then obtained by optimizing the total energy over and .
We now consider the CA and CGA, which can be viewed in a unified way by writing the interaction energy functional as
[TABLE]
where corresponds to CA and corresponds to CGA. We can similarly introduce two Lagrange multipliers, yielding a target functional
[TABLE]
where and the partially optimized value of is given as
[TABLE]
For a given , one can then optimize the Lagrange multipliers to obtain the ground-state energy and other observables. Plots of the double occupancy as a function of are provided in Ref. companion .
An important drawback of the MBB, power, CA, and CGA functionals is that the anti-symmetry of the two-body reduced density matrix (2RDM) is violated when deviates from the HF value Mitxelena2017425602 . The Piris natural orbital functionals (PNOF) Piris2013620 resolve this issue by approximating the form of the 2RDM as Mitxelena2017425602
[TABLE]
where and are explicitly defined in PNOF5 and PNOF7. In the previous calculations Mitxelena2017425602 ; Piris20131298 , it was found that the optimized natural orbital often breaks symmetry in order to further minimize the energy, and translational symmetry breaking in the Hubbard model allows PNOF5 and PNOF7 to recover the correct atomic limit Mitxelena2017425602 . In our context, we require translational symmetry to be respected, so we will examine PNOF5 and PNOF7 in this case. PNOF5 only includes intra-pair correlation (i.e. when belong to different pairs), and therefore the deviation of the summation in Eq. (557) from the corresponding Hartree-Fock value scales like in the thermodynamic limit while there is a prefactor of , and thus PNOF5 will yield the same energy form as HF in the thermodynamic limit. PNOF7 accounts for interpair correlation, which will provide a thermodynamic contribution, and for half-filling we find , which is similar to the CA but with different prefactor for .
Finally, we discuss the interaction energy of the dimer functional at half-filling, given as
[TABLE]
where and is the non-interacting kinetic energy. Minimizing the total energy over yields the double occupancy as a function of as
[TABLE]
Appendix B Equivalence between the Qubit energy form for
and the Slave Spin Mean Field Theory
Here we prove that the slave spin mean-field theory (SSMF) is identical to the qubit energy form for . We begin by describing the SSMF using the conventions in our work, as there are trivial differences. We associate the spin state with the fermionic state , while the standard SSMF associates with . To restore the enlarged Hilbert space of the SSMF back to the physical space, we require that holds for every local site and spin-orbital index . Then, the local Hamiltonian at site can be written as , while the hopping term can be written as , where , , is an arbitrary complex number, and is a fermionic annihilation operator. While the preceding transformations are exact for any , an appropriate choice of is indeed critical when making mean-field approximations. In Ref. De'medici2005205124 , which only addressed half-filling, the and , while later work generalized this result Hassan2010035106 , choosing such that the non-interacting limit can always be correctly recovered within the mean field approximation. We follow the latter choice. Finally, the Hamiltonian in the slave spin representation is given as
[TABLE]
with the operator constraint for every site and spin-orbital. There are two ways to derive the SSMF. The first approach introduces a mean-field decoupling for the hopping term and treats the operator constraint in the mean field level, yielding decoupled Hamiltonians for electrons and spins that must be solved self-consistently. The second approach Georgescu2017165135 ; Maurya2021425603 ; Maurya2022055602 ; Crispino2023155149 uses the variational principle, assuming a trial wave-function which is a direct product of a fermionic part and a spin part . Furthermore, the is assumed to be Slater determinant and the is assumed to be a direct product in real space, such that . Therefore, the trial energy of Eq. (567) is given as
[TABLE]
subject to the constraint
[TABLE]
Given translational symmetry and the correspondence and , we can prove that trial energy in Eq. (568) is identical to the qubit trial energy for given in Eq. (50) and the constraint in Eq. (569) is identical to Eq. (54). The key step is to evaluate
[TABLE]
where gauge symmetry allows . To ensure in the non-interacting limit, we must choose such that , where is the value for in the non-interacting limit. Plugging into Eq. (570), we have , completing the proof.
The preceding proof demonstrates the equivalence of the qubit energy form for and the trial energy for the SSMF. Now we demonstrate how to obtain a Hamiltonian form for the SSMF using the saddle point equations for the trial energy. The total energy under a fixed can be minimized by introducing Lagrange multipliers for the electron and spin systems, yielding
[TABLE]
Taking the derivative with respect to yields the mean-field Hamiltonian for the spin system as
[TABLE]
where , , , is a constant that does not influence the results, and we assume . Now consider the derivative of the electron components as
[TABLE]
where , which can be connected to a non-interacting fermionic Hamiltonian as
[TABLE]
The ground states for Eqns. (572) and (574) will yield an updated and , yielding new Hamiltonians for Eqns. (572) and (574), and this procedure is iterated until self-consistency is achieved.
Appendix C The Central Point Expansion (CPE)
The central point expansion (CPE) can be viewed as an approach to evaluate both a G-type and B-type SPD at by expanding about a reference SPD referred to as the central point. The CPE can be applied in arbitrary dimensions, and it is formally exact if all orders are summed, though it has only ever been applied at first-order. Interestingly, for the G-type SPD, the first-order CPE yields the same result as the Gutzwiller approximation (GA) and the SCDA in any dimension, and the derivation offers an alternative perspective from the GA and the SCDA. For the B-type SPD, the first-order CPE yields the same result as the SCDA in , while providing a different approximation in finite dimensions. The CPE was originally developed in the context of the off-shell effective energy theory (OET) Cheng2020081105 , where the CPE was renormalized using both weak coupling and strong coupling perturbation theory to ensure the correct limiting behavior, yielding excellent results for the Hubbard model in .
C.1 The CPE for
In this section, we use the first-order CPE to evaluate , which will be shown to be equivalent to the GA and the SCDA. The first-order CPE can be motivated by the fact that the GA relation between and is a linear form (see Eq. (111)) given by
[TABLE]
which is valid for an arbitrary with a constraint , where is the number of sites in the lattice. This remarkably simple relation motivates the use of as the expansion parameters, where can be determined using a with slightly deviating from the uniform distribution at first order. Alternatively, the relation between and is highly nonlinear, suggesting that a first-order approximation in cannot be applied to the Gutzwiller wavefunction.
The CPE begins by choosing an expansion point for as , where and is chosen to reproduce the non-interacting local density through . Considering a kinetic projector that deviates slightly from as , where , we compute the response of the expectation value to to the first order about the central point, given as
[TABLE]
where we have utilized that fact that commutes with Cheng2020081105 , , and the notation is defined as
[TABLE]
The response coefficient of to at a general can be conveniently expressed via the integer time correlation function in the compound space as
[TABLE]
While Eq. (580) is difficult to evaluate in general, the CPE circumvents this problem by observing that Eq. (580) becomes trivial to evaluate at the central point given that and are direct product states in real space. Though it is not yet clear, it will prove critical to exploit the gauge symmetry of the SPD to restrict such that . The response for the momentum density distribution to can be obtained as
[TABLE]
where the correlation function can be computed by transforming to real space as
[TABLE]
Utilizing the fact that , where is the local reduced density matrix for , then is only non-zero in the following cases: (1) when , and , the value is ; (2) when , , , and , the value is , where ; (3) when and , the value is , where ; (4) when , and , the value is . Combining these four cases, we have
[TABLE]
where is defined as
[TABLE]
Therefore, the response coefficient of to is given as
[TABLE]
which consists of two contributions: (1) the coherent contribution , which reflects the hopping renormalization captured in the GA, and (2) the incoherent contribution , which has no contribution to the momentum density distribution given that the constraint requires
[TABLE]
When applying Eq. (585) with such that , the response for the bare momentum density distribution to is given by
[TABLE]
The next stage is to find the relation between and . We first compute the response of and for a given with the constraint given by Eq. (586), and then solve from and express in terms of , given as
[TABLE]
The constraint given by Eq. (586) for naturally yields the following relation
[TABLE]
which is the constraint imposed in the GA.
Finally, to connect Eq. (588) with Eq. (107), we need to show , which can be accomplished by proving that and have the same local reduced density matrix. To prove this, we need to consider the expectation of a given Hubbard operator under , which can be accomplished by considering the linear response of a general diagonal Hubbard operator at site to as
[TABLE]
Given the constraint on (see Eq. 586), the first order change in is zero and therefore we have
[TABLE]
Similarly, with and we have
[TABLE]
Therefore, we have recovered Eq. (102). Moreover, we have
[TABLE]
where we have used the fact that the local reduced density matrix of is same as the local reduced density matrix of , as indicated in Eq. (595), and represented as in Eq. (105). Similarly, the local reduced density matrix of is the same as the local reduced density matrix of , as indicated in Eq. (596), and represented as in Eq. (106).
In summary, we have demonstrated that the first order CPE is equivalent to the GA and the SCDA. The key steps in the proof include: (1) the local reduced density matrix for is the same as for , with no dependency on the details of except its average value and (2) the momentum density distribution is uniformly shrunk towards through , which is uniquely determined by . Finally, it would be interesting to explore the behavior of the CPE beyond first order in finite dimensions, which provides insight beyond the GA for evaluating .
C.2 The CPE for
In this section, we use the first-order CPE to evaluate , where and , demonstrating the equivalence to the SCDA in . The CPE for can be viewed as a dual version of the CPE for the , which has various correspondences Cheng2020081105 . First, in the CPE for the , the central projector determines the local density, and is invariant after applying the interacting projector . Correspondingly, in the CPE for , the central projector determines the local density, and is invariant after applying . Second, in the CPE for the , the physical momentum density distribution is linearly related to the bare momentum density distribution, and the local reduced density matrix is independent of the details of the momentum density distribution. Correspondingly, in the CPE for the , the local reduced density matrix is linearly related to the reference local reduced density matrix, and the momentum density distribution is independent of the details of the local reduced density matrix.
We proceed in evaluating via the CPE by choosing the central point for as , where and is chosen such that . We can also rewrite by expressing as a linear combination of Hubbard operators, where . Considering the linear response of about , similar to Eqns. (576)-(578), we have
[TABLE]
where we similarly utilize the fact that commutes with and . We proceed by computing the response
[TABLE]
Directly computing Eq. (601) is cumbersome, and this can be avoided by decomposing into an alternate form using Eq. (3), resulting in
[TABLE]
where
[TABLE]
We introduce the density fluctuation operator , where , and when , and we use as a new basis for the CPE expansion. The diagonal Hubbard operator can be written as
[TABLE]
where enumerates over all subsets of and . The interacting projector can be written as , and at the central point , with absorbing the constant contribution, while if . Similar to Eq. (600), for we have
[TABLE]
given that in this case. It should be noted that has no effect on the expectation values so we implicitly only consider cases with . The response of to is given as
[TABLE]
Since is diagonal in , the correlation function on the right side of Eq. (607) is only non-zero when , and it can be written as the product of contributions from each relevant spin-orbital as
[TABLE]
requiring the evaluation of
[TABLE]
and can be evaluated similarly to Eq. (582) as
[TABLE]
Given that is a direct product state in momentum space, is only non-zero in the following cases. (1) , , and results in , where , (2) , , and results in , where and , (3) results in . In summary, we have
[TABLE]
which can be used to obtain
[TABLE]
where is the number of sites in the lattice. A constraint is imposed on such that
[TABLE]
resulting in and
[TABLE]
In infinite dimensions, only local contributions need to be accounted for, which is used in the following steps. Considering the response and to first order in and solving for as a function of , the response in infinite dimensions is
[TABLE]
where the renormalization factor is given by
[TABLE]
where and .
Finally, we need to consider how the momentum density distribution is influenced by using the response
[TABLE]
which is only non-zero when . To first order, and the contribution from where yields
[TABLE]
and
[TABLE]
In conclusion, if we write the local Hamiltonian as , the total energy per site for is given as
[TABLE]
where with for , and is the physical momentum density distribution and is constrained by . The can be viewed as variational parameters determined from the within . Combining Eq. (615) and Eq. (618), we can explicitly express as
[TABLE]
To evaluate the Hamiltonian in the usual Hubbard operator representation for a given while respecting the density constraint, we can compute from , and then use Eq. (614) to evaluate and then use Eq. (605) to compute Finally, we remark that Eq. (614) is derived by ignoring non-local contributions, which is equivalent to taking the limit of infinite dimensions. One could straightforwardly account for non-local contributions using Eq. (613), which will be explored in the future work.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1) Z. Cheng and C. A. Marianetti, companion paper, Phys. Rev. Let XX.
- 2(2) E. Arrigoni and G. C. Strinati. Beyond the gutzwiller approximation in the slave-boson approach - inclusion of fluctuations with the correct continuum-limit of the functional integral. Phys. Rev. Lett. , 71:3178, 1993.
- 3(3) D. Baeriswyl. Variational schemes for many-electron systems. In Alan R Bishop, David K Campbell, and Steven E Trullinger, editors, Nonlinearity in Condensed Matter , pages 183–193. Springer-Verlag, Berlin, 1 edition, 1987.
- 4(4) J. Bunemann. The gutzwiller approximation for degenerate bands: a formal derivation. European Physical Journal B , 4:29, 1998.
- 5(5) J. Bunemann. A slave-boson mean-field theory for general multi-band hubbard models. Physica Status Solidi B-basic Solid State Physics , 248:203, 2011.
- 6(6) J. Bunemann and F. Gebhard. Equivalence of gutzwiller and slave-boson mean-field theories for multiband hubbard models. Phys. Rev. B , 76:193104, 2007.
- 7(7) J. Bunemann and W. Weber. Generalized gutzwiller method for n>=2 correlated bands: first-order metal-insulator transitions. Phys. Rev. B , 55:4011, 1997.
- 8(8) J. BÃŒnemann, S. Wasner, E. Von oelsen, and G. Seibold. Exact response functions within the time-dependent gutzwiller approach. Philosophical Magazine , 95:550, 2015.
