Enhancement of axion decay constants in type IIA theory?
Gaurav Goswami

TL;DR
This paper explores the potential for enhancing axion decay constants within type IIA string theory compactifications, but finds limitations in achieving super-Planckian values through known mechanisms.
Contribution
It provides a detailed analysis of axion decay constant enhancement attempts in string theory, including flux adjustments and the KNP alignment mechanism, revealing fundamental constraints.
Findings
No super-Planckian decay constants found
Alignment angle cannot be arbitrarily small
Flux adjustments do not enable large decay constants
Abstract
We investigate the possibility of enhancement of effective axion decay constant in well controlled constructions in string theory. To this end, we study the dynamics of axions arising in the compactifications of type IIA string theory on Calabi-Yau orientifolds with background fluxes (with non-perturbative effects included to ensure stabilization of all moduli). In this setup, we attempt to obtain large effective axion decay constant in two different ways: by searching for a direction in field space in which the potential is sufficiently flat and by arriving at a very explicit stringy embedding of the Kim-Nilles-Peloso (KNP) alignment mechanism. We do not find super-Planckian effective decay constants by either of the approaches. Furthermore, we find that the alignment angle of KNP mechanism can not be made arbitrarily small by adjusting the fluxes.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Enhancement of axion decay constants in type IIA theory?
Gaurav [email protected]
School of Engineering and Applied Science, Ahmedabad University, Ahmedabad 380009, India
Abstract
We investigate the possibility of enhancement of effective axion decay constant in well controlled constructions in string theory. To this end, we study the dynamics of axions arising in the compactifications of type IIA string theory on Calabi-Yau orientifolds with background fluxes (with non-perturbative effects included to ensure stabilization of all moduli). In this setup, we attempt to obtain large effective axion decay constant in two different ways: by searching for a direction in field space in which the potential is sufficiently flat and by arriving at a very explicit stringy embedding of the Kim-Nilles-Peloso (KNP) alignment mechanism. We do not find super-Planckian effective decay constants by either of the approaches. Furthermore, we find that the alignment angle of KNP mechanism can not be made arbitrarily small by adjusting the fluxes.
I Introduction
Recently, there have been a number of speculations regarding the possible constraints which any self-consistent theory of Quantum Gravity imposes on the low energy effective theory describing the Universe at longer distances. This has lead to various ideas such as the Weak Gravity Conjecture ArkaniHamed:2006dz (and its various manifestations), Swampland conjecture Vafa:2005ui ; Ooguri:2006in (and refined swampland distance conjecture Klaewer:2016kiy ), de-Sitter swampland conjecture Obied:2018sgi and its refinements (see e.g. Garg:2018reu ; Ooguri:2018wrx for early papers and Palti:2019pca for a recent review).
Some of the arguments on which these conjectures rest are based on our general expectations from any self consistent theory of Quantum Gravity while others are based on diligent studies of well known string compactifications Font:2005td ; Ibanez:2012zz ; Douglas:2006es ; Baumann:2014nda at leading order in and with well-controlled sub-leading corrections (see e.g. Danielsson:2018ztv , Cicoli:2018kdo and Kachru:2018aqn ). In many such studies, an important role has been played by compactifications of type IIA theory mostly because in such a setting, all geometric moduli get fixed at the classical level (in this context, see e.g. Grimm:2004ua ; DeWolfe:2005uu ; Palti:2008mg for some early papers on moduli stablization in type IIA theory, Hertzberg:2007ke ; Hertzberg:2007wc ; Caviezel:2008tf ; Flauger:2008ad ; Blaback:2013fca ; Blaback:2018hdo for some attempts to obtain cosmologically interesting solutions and Acharya:2006ne ; Banks:2006hg ; Saracco:2012wc ; McOrist:2012yc ; Blaback:2010sj ; Gautason:2015tig ; Andriot:2018ept ; Junghans:2018gdb ; Banlaki:2018ayh ; Cordova:2018dbb ; Escobar:2018rna for some possible concerns about the validity of these solutions).
Another related subject, which has received a lot of attention is the possible non-existence of super-Planckian axion decay constants in well-controlled regimes of string theory Banks:2003sx . This subject is of paramount importance given the possible connection of this to large field cosmic inflation: the only version of inflationary scenarios which can be observationally tested in the foreseeable future. One must recall that at the level of field theory, one could imagine mechanisms which could boost the axion decay constant to super-Planckian values (such as in Kim:2004rp , see also Choi:2014rja ), the same has not been convincingly established in well-controlled regimes of string theory (see e.g. Blumenhagen:2017cxt for a recent work and references). Moreover, it is worth noting that many recent attempts to search large axion decay constants Palti:2015xra and large field excursions Baume:2016psm in well understood regimes of type IIA theory have motivated the refined swampland distance conjecture Klaewer:2016kiy . All this is consistent with the literature on Weak Gravity Conjecture ArkaniHamed:2006dz which mentions what is referred to as the axionic version of the Weak Gravity Conjecture.
Keeping this interesting literature in mind, we will attempt to come up with different ways of obtaining large (potentially super-Planckian) effective decay constants in the context of well studied type IIA flux vacua Palti:2015xra ; Baume:2016psm (see also Hebecker:2018fln for a recent work). First, we will try to do this by finding directions in axion field space such that the scalar potential along the direction is sufficiently flat by making sure that in such directions, the effective decay constant due to one of the axions is large and the vevs of the saxions corresponding to the rest of axions are so large that their contributions to the scalar potential are negligible. We will find that this can always be done for any fixed choice of fluxes but eventually, we shall argue that this approach for finding enhanced effective decay constant is not going to give desired results.
There have been many attempts in the past to embedd the well known Kim-Nilles-Peloso mechanism Kim:2004rp (also called alignment mechanism Choi:2014rja ) in string constructions (see e.g. Long:2014dta ; Gao:2014uha and Hebecker:2015rya ; Palti:2015xra ; Hebecker:2018fln ). In the context of IIA theory, this was recently tried in ref Palti:2015xra using some of the ideas first presented in Hebecker:2015rya . We will use the idea presented in Palti:2015xra and Hebecker:2015rya to obtain a very explicit realisation of the KNP alignment mechanism in which the effective axion decay constant can be explicitly deduced from the various fluxes. We will use this explicit construction to attempt to enhance the effective decay constant by inducing alignment by scanning over various flux values. We shall find that there is a lower limit on the alignment angle as well as a sub-Planckian upper limit on the value of effective decay constant for all the flux values we consider.
In §II.1, we remind the reader the basic equations relevant for understanding the dynamics of axions in IIA theory. Then, in §III, we attempt to enhance the effective decay constant, in particular, in §III.2.1, we present a method of doing so and study various consequences of this method in the later subsections. In §III.3, we find that this method does not lead to the desired results. Thus, in §IV, we attempt to obtain an enhanced effective decay constant by an explicit realisation of KNP alignment mechanism in type IIA theory (the details of this construction are given in §IV.0.2. We find that this approach also fails to provide a super-Planckian effective decay constant. Finally, in §V, we conclude with a discussion of various related issues.
II A quick reminder of key concepts
II.1 Type IIA flux vacua
In this work, we shall follow the notations and conventions used in ref Palti:2015xra . It is well known that compactifying type IIA string theory on orientifolds of Calabi-Yau threefolds () gives rise to supergravity theory in 1+3 spacetime dimensions Grimm:2004ua . We note that in the following, we focus attention to four dimensional effective field theory obtained from massive type IIA supergravity theory in ten dimensions Romans:1985tz .
II.1.1 supergravity
The dynamics of the scalar sector of supergravity theory in 1+3 spacetime dimensions is determined by a Kahler potential and a superpotential (along with other quantities which won’t play any role in what follows). Recall that if the complex scalar fields in the theory are denoted as , then, the Kahler potential is a real function of these fields and has mass dimension . Similarly, the superpotential is a holomorphic function of the fields and it has a mass dimension . The Lagrangian for the scalar sector (for those scalars which are not gauged i.e. in the absence of a D-term potential) is given by the expression
[TABLE]
where, the F-term scalar potential is given by
[TABLE]
note that here, the Kahler covariant derivative is given by and , while is the inverse of .
II.1.2 supergravity from IIA: fundamentals
The supersymmetry of the four-dimensional effective theory would ensure that all scalar fields are complex, and therefore pseudoscalar axions exist along with scalar moduli. For type IIA supergravity on orientifolds of a , we have three sets of complex scalars: (called complexified Kahler moduli, here, runs from 1 to ), , where (here, are the complex structure moduli while are axions) and, finally, (here, is the dilaton and is one of the axions). Recall that the vacuum expectation values of the moduli fields determine the shape or size of certain topological cycles in the extra dimensions.
The Kahler potential for the resulting theory is given by a sum of three contributions
[TABLE]
here, depends on the Kahler moduli alone (see Palti:2015xra for its exact expression) while depends on complex structure moduli alone. Note that the Kahler potential does not depend on the axions i.e. , and due to a perturbative shift symmetry. In this work, we focus our attention on two simple and quiet similar cases:
(a) which is mirror of the quintic, for which , (we will call this the two axion case), and,
(b) which is mirror to manifold for which while is given by Denef:2004dm
[TABLE]
which we will call the three-axion case. Note that one could obtain of two-axion case from the of three-axion case by setting .
The moduli and axions correspond to flat directions of the scalar potential (evaluated at the leading order) and they are stabilized by fluxes or instantons, to which we now turn.
II.1.3 Fluxes, superpotential and perturbative moduli stabilization
In ten dimensions, Type IIA supergravity has the following form gauge potentials: (and its magnetic dual ) in NS sector as well as , (and their duals , ) in RR sector Grimm:2004ua . Thus, the possible field strengths are as well as and . In addition, we have two more kinds of fields strength available: (a) since we are working with massive type IIA supergravity, the mass parameter of the theory acts as form flux Romans:1985tz , and (b) there is a form flux , which is basically the volume form on the compact manifold. If one compactifies on a manifold with a non-trivial cycle , one could consider configurations in which the flux of any of the field strengths could be non-zero on one of these cycles in the extra dimensions. When such fluxes are turned on, a superpotential is induced in the four dimensional effective theory (Grimm:2004ua ; DeWolfe:2005uu ; Acharya:2006ne ; Palti:2008mg , also, see Palti:2015xra for the details relevant in our context) and the compactification manifold is no longer a Calabi-Yau manifold, but for small enough fluxes, the backreaction of fluxes on the compactification manifold can be ignored.
One could then find the scalar potential using Eq (2) and look for supersymmetric critical points, by using the condition Wess:1992cp . Notice that since we are looking for supersymmetric critical points, they can not be de-Sitter. By following this procedure, one finds that all the geometric moduli (Kahler moduli and complex structure moduli) get fixed while only one linear combination of RR axions (, ) gets fixed, thus, unless , this leaves some axions unfixed.
For our purpose, the flux values themselves can be thought of as “free parameters” which we can adjust to obtain different solutions. When we focus attention on the sector of the scalar field space, the corresponding “free parameters” are the fluxes denoted as , , and (the dilaton flux parameter) as well as the volume (see Palti:2015xra for details). In the three-axion case, moduli stabilization at the classical level fixes the vevs of the fields , and to the values determined by the fluxes as follows Palti:2015xra
[TABLE]
On the other hand, only the following linear combination of the axions is fixed by this procedure.
[TABLE]
Here, the two-axion case can be arrived at by setting and to zero.
II.1.4 Non-perturbative effects
In the saddle point or semiclassical approximation, the amplitude of a process (e.g. tunneling) goes as , where is the Euclidean action of the process. There are non-perturbative contributions to the vacuum in string theory Dorey:2002ik ; Blumenhagen:2009qh ; Ibanez:2012zz , whose strength can be found by the following consideration: if the compactification manifold has a topologically non-trivial spacelike p-cycle, , this completely spacelike cycle could be wrapped by an Euclidean D(p-1)-brane (called an instanton, since the worldvolume of such a brane is localized in the time-direction of target space just like a gauge instanton is localized in spacetime rather than space). The contribution of the Euclidean D(p-1)-brane wrapping a p-cycle to the path integral (and hence the strength of the instanton) is then given by (see Eq (13.15) of Ibanez:2012zz ):
[TABLE]
here, is the string length, is the string coupling, is the volume of the p-cycle charcterised by the index , is the RR p-form gauge potential, is the axion and the factor comes from the Euclidean signature of the brane. The last term generates a cosine potential for the RR axion corresponding to . In the case of our interest, the superpotential generated by fluxes could receive non-perturbative corrections from Euclidean D2-brane instantons, which is of the form Palti:2015xra
[TABLE]
Given this generic form of the superpotential and the Kahler potential given in Eq (3), one can easily find the scalar potential by using Eq (2). For the correct choices of and , the potential would be of the form shown in Eq (27), Eq (57) or Eq (79) below. Note that the quantities and could in principle be deduced from string theory (they would be the intersection numbers between the 3-cycle wrapped by the instanton and the 3-cycles associated to the Ibanez:2012zz ). These choices are thus determined by the choice of the compactification manifold, and since there there is a lot of freedom to choose the compactification manifold, there is a wide range of possibilities for the values of and too. Let us note that, in this simple type IIA setting, the values of the moduli and can be explicitly determined in terms of fluxes using Eq (5), Eq (6) and Eq (7).
II.2 Kim-Nilles-Peloso mechanism
Before proceeding, we’d like to quickly remind the reader of the celebrated Kim-Nilles-Peloso mechanism Kim:2004rp , which is often also referred to as lattice alignment mechanism and which we will call KNP mechanism from here onwards (see also Choi:2014rja ).
Consider a two dimensional scalar field space , if the potential is of the form
[TABLE]
then, obviously, the direction in field space is periodic (with period ) while the direction orthogonal to it, the direction, is a flat direction (this is equivalent to saying that the corresponding decay constant is infinity).
Now consider a situation in which the potential is given by
[TABLE]
If one defines a new field by the relation
[TABLE]
with and , then one can interpret the direction in field space as the direction which makes an angle w.r.t. the direction such that etc. Thus, the potential in Eq (12) becomes
[TABLE]
so that the direction orthogonal to the direction must be a flat direction. Similarly, if the potential is
[TABLE]
then there is a direction , which makes an angle of with axis and for which
[TABLE]
with and and . The potential in Eq (12) becomes
[TABLE]
and the direction orthogonal to direction is a flat direction.
Finally, consider a system whose scalar potential is of the form , i.e.
[TABLE]
In this case, we have two directions in field space, the direction, Eq (13), and direction, Eq (16), and the angle between these directions can be found by evaluating the cross product of the unit vectors in those directions. We find that
[TABLE]
It is clear that, when i.e. when , i.e. the directions and are aligned, then, the direction orthogonal to (and ) will be a flat direction in field space. Now, implies that , i.e.
[TABLE]
We may wish to avoid having an exactly flat direction in field space (because that would correspond to having a massless scalar field in the spectrum), so, we may be interested in the situations in which there is a sufficiently light scalar whose mass is small in Planck units but big enough to avoid any detection. The most important insight is that under certain conditions, even when the above condition does not get satisfied exactly, there exists a direction in field space along which is large.
Consider a field redefinition such that the first field direction, is still given by Eq (13) and the second field direction, is orthogonal to . One can rewrite the potential Eq (II.2) in terms of these new fields and obtain
[TABLE]
where,
[TABLE]
If one could freely choose such that becomes too small, then, , and one can set , one would then get
[TABLE]
with chosen to be as large as desired. Thus, if one sits at the origin in field space, there exists a direction (the heavier, direction) in which the potential rises very steeply and there exists a direction orthogonal to this (the lighter, direction) in which the potential rises very slowly.
This KNP alignment mechanism has been studied in the context of string theory also (see e.g. Long:2014dta ; Gao:2014uha and Hebecker:2018fln ). In ref Palti:2015xra , the author, inspired by Hebecker:2015rya , studied a way to realise a rough version of KNP mechanism for type IIA string vacua. Recall that in this case, by simply turning the fluxes on, one can generate a potential which would fix all Kahler moduli, complex structure moduli and a linear combination of axions. In Palti:2015xra , the most explicitly studied case is the one with two-dimensional axion field space: there is one heavy direction and in the direction orthogonal to it, which is flat at the perturbative level, the potential is generated by non-perturbative effects (and hence, is a cosine). It was then argued that, unlike Eq (24), the decay constant in this stringy set up can not be enhanced to arbitrarily large values. It is this feature of the model studied in Palti:2015xra that we wish to study in greater detail. 222 Before proceeding, we note that in ref Palti:2015xra , the author focusses on a case in which , the superpartner of dilaton and , the superpartner of largest modulus are the only ones being mixed. Even in the case in which there are three axions, the author adjusts fluxes to ensure that the axion which is the superpartner of the smaller modulus becomes sufficiently heavy that it can be ignored from the low energy dynamics.
We will do so in two different ways, in particular, in §IV.0.2, we will arrive at a very explicit realisation of the KNP alignment mechanism.
III First attempt to obtain large effective decay constant
In this section, we shall attempt to enhance axion decay constant in the set up of IIA theory presented in the last section. This problem was studied in ref Palti:2015xra which we closely follow. As we shall see, there are important new lessons to be learnt even in the simple case of a which is mirror to manifold Denef:2004dm , for which and there are three scalar fields.
To begin with, however, we remind ourselves of how to deal with a slightly different situation, the two-axion case mentioned in the last section i.e. mirror of the quintic for which and . This case has already been studied in Palti:2015xra , but as we shall see, there are important observations to be made in order to study the more interesting case of mirror to with i.e. the three axion case. In Palti:2015xra , the author only briefly mentions the three axion case (in particular, in Palti:2015xra only the two-axion limit of the three axion case is mentioned). In the upcoming subsection, we revisit the two axion case while in the sub section after that, we analyse the three axion case.
III.1 The two-axion (i.e. ) case
Recall that in type IIA theory, moduli stabilization at leading order in and ensures that only a single linear combination of axions is fixed Grimm:2004ua ; DeWolfe:2005uu . When we have only two axions, this defines a unique straight line in field space which is a flat direction at perturbative level. If distance along this direction is thought of as a field, this field is perturbatively massless. As we saw, non-perturbative effects such as Euclidean brane instantons will then lift this flat direction and generate a potential for the perturbatively massless field.
In this case, the fluxes we could vary (to understand axion dynamics) are and we could think of the volume of the compactification manifold as another “free parameter.” The values of moduli and can be found in terms of these variables.
III.1.1 Basics
Here, can be obtained from Eq (4) by setting and so, using the equations presented in §II.1.1, we can show that
[TABLE]
This is the metric in field space and this will determine the kinetic terms of and . Recall that fluxes generate a potential and hence fix the values of the moduli and while for the axions and , a linear combination viz. , gets fixed. Thus, in the field space, there exists a direction which is flat at perturbative level. At low energies, we can think of and as essentially fixed quantities.
The Euclidean D2-brane instantons could generate a potential for the axions. The actual form of the potential would depend on the details such as which cycles the brane wraps, this determines the instanton. There must be solutions in which the instantons happen to be such that the Lagrangian determining the dynamics of the remaining low energy fields (i.e. ) is given by (see e.g. Palti:2015xra )
[TABLE]
where, the potential for the axions is generated by non-perturbative effects. On comparing Eq (1), Eq (26) and Eq (27), we find that is dependent on while is dependent on i.e.
[TABLE]
Needless to say, the canonically normalised axions are and . Before proceeding, we note the following:
The advantage of restricting our attention to instantons which lead to the potential shown in Eq (27) is that the periods as well as the amplitudes of the cosines are known in terms of moduli. Finally, note that, in this model, we also explicitly know the values of moduli themselves in terms of fluxes. 2. 2.
When one ignores the non-perturbative effects, at leading order in and , the linear combination is fixed to some value. By redefining the fields, one could ensure that
[TABLE]
Notice that if we keep the flux fixed and increase the flux , then, the straight line representing perturbative flat direction tends to align with the axion . 3. 3.
The moduli and have geometrical interpretation, they are related to volumes of certain topological cycles (in units of string length). Thus, if we wish to stay in the trustworthy regime of effective field theory, we must have and . From the expressions for the decay constants and in Eq (28), it is then clear that these decay constants have to be smaller than one in Planck units.
In the plane of the canonically normalised fields, this describes a straight line passing through the origin i.e.
[TABLE]
The slope of this line is given by and the two direction cosines of the line are
[TABLE]
where, . At this stage, it is worth recalling that in dimensional Euclidean space with Cartesian coordinates , the distance along any straight line passing through the origin (and with direction cosines ) is . If we now call to be the distance along the direction described by line Eq (30), one finds that
[TABLE]
If we go along the straight line direction Eq (30), non-perturbative effects shall generate a potential which can be found by substituting for and from Eq (33) into Eq (27), one thus obtains,
[TABLE]
where, and . Since in the low energy theory we can think of and as fixed quantities, the potential experienced if one moves along the straight line direction Eq (30) is a function of only one field, the distance along this direction. Then, Eq (III.1.1) suggests that the potential of is a sum of two cosines with different amplitudes and periods.
III.1.2 Flux independence of the slope of fixed direction
The coefficients of and in Eq (29) are clearly flux dependent, hence, by changing the fluxes, we could change the slope of the line in plane. Now consider the line in plane of the canonically normalised fields, the slope of the straight line in Eq (30) is . Using Eq (28), eq (5) and eq (6) (with set to 0), we find that this slope is equal to . I.e the fixed direction in the space of canonically normalised fields makes an angle w.r.t. the positive axis. Thus, by changing the fluxes, we can not change the orientation of the straight line in the plane of canonically normalised fields.
III.1.3 Obstruction to flat potential
Now, in this context one could think about the potential along the direction and its possible flatness. One of the things we mean when we say that the potential along the straight line direction Eq (30) is pretty flat is that it is a cosine with very large period. Suppose that one of decay constants among and , say the latter, is very large and that is large as compared to , then, the amplitude of the first cosine in Eq (III.1.1) is exponentially suppressed as compared to the second cosine while the period of this second cosine is also large, thus, we could get a direction in which the potential is quite flat. Note that,
[TABLE]
Thus, it appears that if one keeps , and fixed and increases , then for large enough , stays put while decreases and stays constant while increases. Thus, one might conclude that the first cosine in Eq (III.1.1) shall become suppressed over the second one while the period of the second one could be made large, thus, flattening the potential. Furthermore, using eq (28), eq (5) and eq (6) (with set to 0), we conclude that
[TABLE]
so that increasing with fixed (by holding , and fixed) will cause to increase as much as we like without any consequences.
This happens to be not true, since it turns out that
[TABLE]
Since and are simply proportional to the fundamental axion decay constants and , and since these fundamental decay constants can not be super-Planckian, we conclude that and shall also remain sub-Planckian. even though we could increase and , we can not make them so large that and become too small. Since and are geometric moduli which determine the sizes and shapes of compactification manifold, they can not be made too small without leaving the regime of validity of low energy effective field theory.
For our purpose, we note that the factors relating to and to are numbers. We shall see that in two-axion limit of three-axion case, there is additional freedom which can cause these factors to be very large numbers.
III.2 The three axion case
Going beyond the work of ref Palti:2015xra , in this subsection, we shall analyse the case with three axions and recover the two-axion case as a limiting case. It would appear that the three axion case offers new features and there is scope for enhancement of decay constant. But as we shall see at the end of this subsection, this is not so.
For the case with three axions, the Lagrangian of the low energy effective theory contains three scalar fields and . In the absence of non-perturbative corrections, the condition given by Eq (8) is satisfied, this defines a plane in space, which will call the perturbatively flat plane. In the absence of non-perturbative effects, the scalar potential along this plane is constant. But, for small enough and large enough compactification volume, the potential due to the non-perturbative effects would be quite small compared to the one generated by fluxes, thus, the energy required to leave the plane will be far larger than the potential generated by non-perturbative effects. Thus, at low energies, one would be “stuck” in this plane.
In plane , there are multiple straight line directions one could go to and each of these can be thought of as the field of interest. The question we ask is, can we get the potential experienced along any such direction as a cosine with a large period i.e. is sufficiently flat? As we shall see, it appears that this can be done. Notice that we have no guarantee that the field will actually go along such a straight line trajectory (though its dynamics is determined by, among other things, its potential). In any case, we’d argue at the end of this subsection that this approach shall not work.
Given the Kahler potential, the metric in the scalar field space can be found from
[TABLE]
which in the subspace of the scalar field space turns out to be
[TABLE]
where, . We now restrict our attention to the subspace of which is spanned by . In this two dimensional subspace, notice that the metric still depends on the vev of the moduli and .
Given the Kahler metric , we could find its eigenvalues (which we call and ) and eigenvectors. If one performs a change of basis such that the eigenvectors are used as the basis vectors, then the metric in the new basis is diagonal. One can then make an additional anisotropic scaling transformation to turn the metric into an identity matrix. Let the normalised eigenvectors of the metric be denoted by and and let be the matrix of change of basis from to , i.e.
[TABLE]
since the metric is real-symmetric, must be an orthogonal transformation. One must note that all these quantities depend on the moduli which themselves depend on the fluxes.
III.2.1 Search directions and enhancement
Perturbative moduli stabilisation ensures that at low energies we stay stuck in a plane, plane , in the space.
[TABLE]
Using Eq (44) and after scaling, this implies that
[TABLE]
where, we have simply rewritten the previous equation in terms of normalised eigenvectors of the Kahler metric. This normalisation off-course also canonically normalises the axions we work with i.e. the fields , and are the canonically normalised axions. Needless to say, in the above equation is a function of while and are functions of .
Now, Eq (46) describes a plane in the space of canonically normalised fields and from its defining equation, one can easily read off the components of the unit vector normal to the plane. Consider the line common between the plane Eq (46) and the plane . Obviously, the equation of this line is given by
[TABLE]
This is a direction in plane and one could go along this direction and ask whether the potential generated by non-perturbative effects could be sufficiently flat. In order to explore the other search directions, one could begin with a unit vector along the line given by the above equation and make a rotation by an angle about the axis which is normal to the plane, see fig (1). For any choice of this angle , there will be a new search direction. Let the direction cosines of this new search direction be , notice that these direction cosines depend on in addition to depending on the fluxes. Since the search direction lies in the plane described by Eq (46), its direction cosines must satisfy the equation of the plane (since the plane passes through the origin)
[TABLE]
Now, let be the distance along the search direction, then, since are direction cosines (recall, discussion just before eq (33)),
[TABLE]
Now, re-expressing the above relations in terms of the original axions , tells us that
[TABLE]
the above two equations can be used to solve for and in terms of , thus one gets (using the orthogonality of )
[TABLE]
One could assume that for some Euclidean D-brane instantons, after the diagonalisation of Kahler metric, the low energy effective theory is given by
[TABLE]
which, when expressed in terms of the field , will give the potential
[TABLE]
Comparing Eq (57) with Eq (III.2.1) and using Eqs (54), (55), (56), one can thus read-off the effective decay constants,
[TABLE]
This set of equations tell us that in space of canonically normalised fields, if we go along a direction with direction cosines and if the distance travelled is the field , the potential experienced is given by Eq (III.2.1), where, the three effective axion decay constants and are given by the above equation. It is worth noting that in the above Eq, the matrix elements of depend on the fluxes while, as mentioned above, the direction cosines depend on fluxes as well as . An important questions worth answering is could there be choices of fluxes and which enhance the effective decay constants?
III.2.2 A few useful remarks
When our search direction is perpendicular to axis, we are in the region of field space where and the scalar potential does not depend on . Perpendicularity to axis also implies that is zero. So, in Eq (49), on LHS, and on RHS, (as we are in the plane perpendicular to axis. In such a case, Eq (49) becomes indeterminate and we do not expect to find from Eq (59). Similarly, it is possible that for a fixed choice of fluxes, we happen to be exploring a direction such that the denominator in Eq (60) or Eq (61) becomes zero. Leaving such special cases where the denominator vanishes exactly, one could still ask whether there can be an enhancement of the effective decay constants.
Following the discussion at the beginning of §III.1.3, an important point worth noting is that in Eq (III.2.1), even if one of the decay constants, say is large enough, in order to have a flat potential, we must also ask whether and are large enough that the contribution of their potentials (which will be relatively more oscillatory since their decay constants are smaller) in the complete potential would be unimportant. If this can not be ensured, then, even if one of the decay constants, say is large, we won’t get a flat potential. Suppose we choose the flux values such that e.g. and are sufficiently large as compared to , then the potential will be mostly dominated by the axion . For such a fixed choice of fluxes, one could go along any direction in field space (starting from the origin). If the direction cosines of the search direction happen to be such that the denominator in Eq (60) becomes small, then, we could have an enhancement of as well as get an actual flat potential. From Eq (57), it is easy to see that the mass of each axion would be given by
[TABLE]
and typically, , thus, , thus, large vev shall make the axions light (because of the exponential factor). Thus, it is conceivable that the potential can be flattened by this procedure. In fig (2), we have shown an example of this phenomenon.
Starting from the formalism of three-axion case, one should be able to recover the two-axion case in some limit. This limiting case was briefly mentioned in Palti:2015xra but we will find new effects not studied there. When one chooses , one finds that and hence . If the other fluxes are also adjusted to also ensure that , then, the axion becomes too heavy. We then expect that we should be able to integrate out this heavy axion and recover the two-axion case in a limit. As we shall see, though this is true, there exist interesting subtleties. To take limit, we define
[TABLE]
the desired limit is then limit. Then, in terms of , the Kahler metric in Eq (78) takes the form
[TABLE]
retaining only the leading powers of , we can find the eigenvalues and hence see that
[TABLE]
where, the does not become too large. At leading order, the matrix of change of basis is
[TABLE]
where, we follow the convention that the first column of is the eigenvector corresponding to smaller eigenvalue. Let us suppose that when we try to retain the two-axion limit, the search direction we explore is the intersection of the plane of perturbatively unfixed axions and plane, this makes sense since this is equivalent to . Using the above form of the matrix in Eq (46), it is easy to see that, in the limit , the line which is common to this plane (in the space of canonically normalised scalar fields) and the plane has along the vector . Since this is the direction along which is defined in the two-axion limit, this indicates that we have recovered the flux independence of the slope of this line (mentioned in §III.1.2) in the two-axion limit. Moreover, if we keep the leading order terms in and follow the procedure described in §III.2.1, we can see that
[TABLE]
This result was also mentioned in Palti:2015xra . Now, having recovered the results in the two-axion case, let us apply the ideas presented in §III.2.1.
To this end, we begin to explore other directions in the three axion field space. In generating fig (3), we have fixed the fluxes to the following values and the volume . This gives and the initial direction of exploration (the intersection of the plane of perturbatively unfixed axions and plane) makes an angle of -29.25 deg w.r.t. direction, which is pretty close to the angle obtained in §III.1.2.
As we explore other directions in the plane by changing the angle (see discussion below eq (47)), we experience enhancement of the decay constants. Furthermore, the case gives results approximately in agreement with the two-axion case. In fig (3) the minimum of the green curve (the variation of against ) lies very close to the dotted green horizontal line (which specifies ). But the the minimum of the red curve (the variation of against ) is far above the dashed orange horizontal line (which specifies ). This is a manifestation of eq (38) and Eq (39). In particular, in fig (3), at , the extreme left region, . Similarly, is is easy to see from fig (3) that, as mentioned in §III.2.2, as we vary , the effective decay constants blow up and this happens for for .
While it was useful to recover the two-axion model in an appropriate limit of the three-axion model, one must understand that one could vary fluxes such that is no longer small. For every choice of fluxes, we could vary to look for directions to enhance the effective decay constants. The results presented here suggest that the decay constants can be enhanced this way. In particular, no matter what choice of fluxes on begins with, one could always vary and find directions in field space along which the potential seems quite flat.
III.3 Failure of the approach
The results presented till now basically imply that there are some radial directions in the perturbatively flat plane in field space in which the potential appears to be a cosine with arbitrarily large effective decay constant. For certain directions, this effective decay constant could even be infinite! The question is: is this the correct way to look for large effective decay constants? We are of course free to explore any direction in field space, but there is no reason that the field would actually evolve in the direction which we choose to explore. Will the field roll in the direction along which the effective decay constant is large?
In the language of plane polar coordinates introduced in the perturbatively flat plane, the scalar field is the radial coordinate in this plane while will be the angular coordinate and the potential for in Eq (III.2.1) depends on the choice of . In order to better understand what’s happening, we could find the potential in Eq (III.2.1) for various values of and then plot the potential for all these values of . When we do that, we obtain the contour plot of the potential as shown in fig (4). It now becomes clear the Lagrangian in Eq (57) happens to be such that one direction in the plane is still a flat direction. This is the reason because of which we get the effective decay constant which diverges for certain choices of and this is what explains the results shown in fig (3). Thus, the apparent enhancement in effective decay constant is just an artefact of the fact that the Lagrangian of Eq (57) does not lead to stabilization of all the moduli. Finally, notice that in fig (4), we have introduced the fields and , which are Cartesian coordinates in the perturbatively flat plane in field space.
Thus, the approach described in the last subsection does not lead to any actual enhancement of the effective decay constant. But, we can learn the following important lessons from this:
we should only pay attention to the behaviour of the scalar potential in the directions in which the field could actually roll, 2. 2.
we should avoid situations in which there are massless directions in field space, and, 3. 3.
since it is easier to interpret our results if we work with fields which act as Cartesian coordinates in the perturbatively flat plane, we must work with such fields.
IV A more careful attempt to obtain large effective decay constant
In this section, we shall try another, much more careful approach to obtain large effective decay constant which will not suffer from the problems of the approach explained in the last section. As we shall see, even this approach will fail to give a super-Planckian effective decay constant.
IV.0.1 Evolution in field space
We are interested in the behaviour of fields near AdS vacua. For a field theory with negative vacuum energy, the spacetime evolves as a spatially open FRW Universe (which expands and then re-collapses). If space is homogeneously filled with the scalar fields which roll down their potentials (which is negative), the fields and spacetime evolve in accordance with
[TABLE]
where, we have set . In the following, we shall ignore the evolution of spacetime and hence, for us,
[TABLE]
i.e. the acceleration of the field in field space will point in the direction opposite to that of the gradient vector. At a local minimum in field space, the gradient vanishes so the acceleration of the field at a point near the local minimum would be given by
[TABLE]
so that if the direction happens to be an eigenvector of the Hessian matrix of the field, the field accelerates in the same direction as the displacement. Furthermore, if the initial velocity of the field is zero, it evolves in the same direction as acceleration. For potential given by Eq (II.2), near the origin, if one finds the eigenvectors of the Hessian matrix, when becomes small, these eigenvectors coincide with and directions.
Thus, there will now be no freedom to explore the various directions in field space. When all moduli are stabilized, the Hessian matrix will have two positive eigenvalues, we must look at the behaviour of the potential in the direction of the eigenvector corresponding to smaller eigenvalue.
IV.0.2 Realising KNP mechanism
From the Kahler potential, Eq (3), with given by Eq (4), one can easily find the Kahler metric, which in this case will be given by
[TABLE]
where . Notice that the metric still depends on the vev of the moduli , and . To aid the discussion, let us denote the set of fields by . Then, we could assume that appropriate D-brane instantons have been chosen such that the Lagrangian of the low energy effective theory is of the following form
[TABLE]
where the Kahler metric is not diagonal. The above choice happens to be such that there are no massless directions in field space anymore. The factor of half in front of the kinetic term arises from the fact that the complex scalars in Eq (1) are decomposed as etc.
Since the metric would still be a real, symmetric matrix, it can be diagonalised by an orthogonal transformation in the field space. In this choice of basis fields, the Kahler metric (which in this case would be a real and symmetric matrix) is not diagonal, it then makes sense to make an orthogonal transformation in field space to diagonalise it. Let the new basis fields be , then,
[TABLE]
and hence the Lagrangian in Eq (79) becomes
[TABLE]
It is well known that if the matrix is chosen such that its column is eigenvector of , then, will be a diagonal matrix . Let
[TABLE]
then, Eq (81) takes the form
[TABLE]
Next one could redefine the fields such that the Kahler metric becomes an identity matrix: this requires a scaling transformation in the field space. We could now make a scaling transformation of the form
[TABLE]
and hence Eq (81) becomes
[TABLE]
where, the sum over is implied. Note that the field space metric for fields is a Kronecker delta.
One could find the equation of the perturbatively flat plane in terms of the new fields. The constraint of always being in this plane can be used to eliminate one of the field using the equation of the plane. One can then introduce a “Cartesian-coordinate” system in this plane and write the Lagrangian in terms of the the fields. This Lagrangian depends on the three fields , but, since there is a constraint of being stuck in the plane described by Eq (8), there are really only two independent fields. Since we are stuck in the plane Eq (8), it is sensible to introduce a Cartesian coordinate system in this plane and express the above Lagrangian, Eq (85) in terms of these fields. To this end, we note that Eq (8) can be written in terms of as
[TABLE]
with
[TABLE]
This is the equation of the plane described by Eq (8), which we shall denote as , in terms of the new fields. Note that the Eq (86) implies that the vector
[TABLE]
must be normal to the plane . Now consider the intersection of the plane with the plane , this gives a line, a vector along this line would be such that , so, it is of the form
[TABLE]
unlike the vector , the vector lies in the plane . In order to have an orthonormal basis, we need another vector which lies in the plane which is perpendicular to . We can consider
[TABLE]
then,
[TABLE]
it is easy to verify that the components of this vector satisfy Eq (86). From the expressions for , and , one can find the unit vectors , and . Let us write , and , notice that .
Now consider an arbitrary vector in the field space,
[TABLE]
if we use as the new basis, then, the same vector could be written as
[TABLE]
One can use the above two expressions for to etc.If the vector lies in the plane , then, and we get
[TABLE]
We could now use these expressions in Eq (85) and get the following
[TABLE]
where,
[TABLE]
From Eq (99), it is clear that this is a successful realisation of KNP mechanism.
IV.0.3 Numerical analysis
From Eq (4), it is clear that must be greater than , Eq (6) and Eq (7) then suggest that this can be ensured if . We fix the fluxes and and fix the volume . This fixes the value of the modulus . We now wish to change the fluxes and (with ) and for each choice of these fluxes, we can find the values of the fields and .
If we now work in the regime in which so that the first cosine in Eq (99) gives negligible contribution and can be completely ignore. In this regime, Eq (99) becomes identical to Eq (II.2) where the decay constants are completely determined by fluxes. To achieve this, we only admit those choices of fluxes which ensure that . Furthermore, we would enforce the constraint that the ratio lies between 1 and 10. Otherwise, the amplitudes of the two cosines in Eq (II.2) would be too different even without the alignment mechanism.
One can now determine the Kahler metric Eq (78) and hence its eigenvalues, eigenvectors and the elements of the matrix in Eq (80). In trustworthy regimes, we expect the eigenvalues of the Kahler metric to be less than one in Planck units, because they are closely related to Volumes of cycles which better be large in string units. Thus, we also enforce the requirement that the quantities , and are all smaller than 1.
Keeping all of these observations in mind, it appears that, since we have so many fluxes available to adjust, we should be able to get to be as large as desired. Thus, we fix the fluxes , and the volume , this then fixes the modulus . We then vary from 26 to 225 while, for each of these values of , we let vary from 25 to (to ensure that ). This leads to 20100 combinations of values of fluxes, of course, not all these combinations would lead to the satisfaction of the constraints outlined above. Among these, there are only 3100 flux combinations which lead to the satisfaction of all the constraints mentioned.
Next, for each of these choices of the fluxes and (with ), one could use the formalism presented in the last subsection to determine the “decay constants” as well as . Since these “decay constants”are derived quantities with no geometrical interpretation, we do not enforce the constraints that they be small or large. We shall find that these derived “decay constants” could also sometimes be negative. Note that, if and both happen to be zero, then would be infinity. Thus, since the matrix elements of the matrix turn up in the expressions of the decay constants, the values of these derived decay constants may appear to become unreasonably large for some circumstances. Having determined these derived decay constants, we could find whether is sufficiently small. If that happens to be the case, the value of can be found from Eq [24]. On the other hand, when is not too small, there is no hierarchy between the masses of and and hence no sufficiently flat direction in the field space.
We could now impose the requirement that we’d only consider those values of fluxes which lead to the value of to be less than, 0.25 (to ensure sufficient alignment), then, we find that there are only 68 combinations of fluxes which satisfy this constraint. Among these, the largest value of effective decay constant (i.e. Eq (24)) obtained is , corresponding to and and the corresponding alignment angle in Eq (19) is degrees (i.e. roughly 0.13 radian). The behaviour of the scalar potential near the origin in space for this choice of fluxes can be seen in fig (5).
Thus, provided we satisfy all the constraints mentioned above, even though we have the freedom to adjust fluxes to change the values of the individual decay constants which turn up in Eq (99), we can not make the effective decay constant to be super-Planckian and we can not decrease the alignment angle to an arbitrarily small value.
V Discussion
It has been observed for a very long time Banks:2003sx that obtaining a large axionic decay constant i.e. for string theory axions always takes us out of the trustworthy regime of theoretical control. All of this is closely related to a number of issues studied in the recent literature e.g. the various versions of swampland distance conjecture and axionic weak gravity conjecture Klaewer:2016kiy ; Font:2019cxq ; Grimm:2019wtx as well as issues of the trajectory being followed in field space Landete:2018kqf . Keeping this literature in mind, in the present work, we revisited the question of possible enhancement of the effective decay constant of axions. The best interpretation of the results we have obtained is that we have found more evidence that in trustworthy regimes of string theory, the effective decay constant can not be super-Planckian.
One must note the various caveats associated with the formalism used in this work: firstly, in these constructions, the axionic field space is not compact e.g. in two-axion case, the the potential in direction orthogonal to the perturbatively flat direction is not periodic. Secondly, it seems that the conclusions are too much dependent on the choice of the specific Euclidean D-brane instantons. Also, note that the type IIA flux vacua we have been dealing with are AdS. Thus, if one intends to make any statement about cosmic inflation, one must find what uplifting will do to the various effects described here. Thus, one needs to understand uplifting mechanisms better before one can make any statements about large field inflation based on the work of this paper. Finally,, one must mention some of the concerns expressed in the literature about the validity of the solutions used in this paper Acharya:2006ne ; Banks:2006hg ; Saracco:2012wc ; McOrist:2012yc ; Blaback:2010sj ; Gautason:2015tig ; Andriot:2018ept ; Junghans:2018gdb ; Banlaki:2018ayh ; Cordova:2018dbb ; Escobar:2018rna . Since in such constructions one works with massive type IIA supergravity (supplemented with orientifold 6-plane sources), the corresponding Romans mass parameter does not dilute in the large volume limit, this has inspired doubts about the validity of these constructions. In addition to the problems with expanding around a non-solution like a Calabi-Yau metric or the concerns about defining O6-planes Acharya:2006ne , one also finds that the solutions of DeWolfe:2005uu do not solve the massive IIA supergravity equations of motion even approximately McOrist:2012yc (as required for large volume, weakly-coupled backgrounds). We must also mention that we have ignored the open string moduli arising from e.g. brane positions as well as blow-up moduli or twisted moduli, we have not attempted to enforce tadpole cancellation conditions and we have not taken into account the backreaction caused by Kahler moduli as pointed out in Hebecker:2018fln very recently.
With all of these issues in mind, §III.2, we tried to explore the possibility of enhancement of axion decay constant from the point of view of the statement of axionic Weak Gravity Conjecture. Thus, we tried to find directions in axion field space in which the scalar potential is a cosine with a large period. This requires two things to happen (a) the potential along the direction of interest must be a cosine with effective decay constant being due to only one of the axions and being large, and, (b) the vev of the saxion corresponding to the rest of axions must be so large that their contributions to the scalar potential must be negligible. We found that just these requirements can always be fulfilled for any fixed choice of fluxes. But, as we explained in §III.3, we realised that we should be careful to ensure that the field should actually roll along the direction so found and to ensure that no moduli are left unstabilised.
In order to cure these problems, we came up with a completely explicit realisation of the KNP alignment mechanism in the context of type IIA string theory (see Eq (99)) which proves to be a very convenient way of obtaining an effective axion decay constant in a well-controlled regimes of string theory. The decay constants in this case can be explicitly found for any choice of fluxes (see e.g. the discussion around Eq (99)). We have thus presented a formalism which can be used to find the alignment angle (defined by Eq (19)) as well as the effective decay constant (defined by (i.e. Eq (24)) from the fluxes. It might appear that, since we have so many fluxes available to adjust, we should be able to get to be as large as desired. But when we numerically evaluated quantities for a very large combination of fluxes and imposed some sensible conditions on the valid answers, and , it was found (as is suggested by a lot of previous literature) that unlike any randomly chosen field theory, in a low energy effective field theory arising from string theory, somehow, there seems to be a sub-Planckian upper limit on the effective decay constant and a lower limit on the alignment angle. This provides yet another example of the fact that the famous KNP alignment mechanism does not quite work in explicit examples in string theory.
Acknowledgements The author would like to thank Mansi Dhuria (IITRAM, India), Ashwin Pande (Ahmedabad University, India), Anshuman Maharana (HRI, India), Ashoke Sen (HRI, India), Timm Wrase (TU Wien, Austria), Fernando Quevedo (ICTP, Italy), Eran Palti (MPP, Germany) and Chethan Krishnan (IISc, India) for clarifying a number of issues related to string compactifications. This research was supported in part by the International Centre for Theoretical Sciences (ICTS) during a visit for participating in the program - Kavli Asian Winter School (KAWS) on Strings, Particles and Cosmology 2018 (Code: ICTS/Prog-KAWS2018/01). The author would also thank the organisers of School and Workshop on “String Field Theory and String Phenomenology” held at Harish Chandra Research Institute (HRI), Allahabad, India.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1) N. Arkani-Hamed, L. Motl, A. Nicolis and C. Vafa, “The String landscape, black holes and gravity as the weakest force,” JHEP 0706 , 060 (2007) doi:10.1088/1126-6708/2007/06/060 [hep-th/0601001].
- 2(2) C. Vafa, “The String landscape and the swampland,” hep-th/0509212.
- 3(3) H. Ooguri and C. Vafa, “On the Geometry of the String Landscape and the Swampland,” Nucl. Phys. B 766 , 21 (2007) doi:10.1016/j.nuclphysb.2006.10.033 [hep-th/0605264].
- 4(4) D. Klaewer and E. Palti, “Super-Planckian Spatial Field Variations and Quantum Gravity,” JHEP 1701 , 088 (2017) doi:10.1007/JHEP 01(2017)088 [ar Xiv:1610.00010 [hep-th]].
- 5(5) G. Obied, H. Ooguri, L. Spodyneiko and C. Vafa, “De Sitter Space and the Swampland,” ar Xiv:1806.08362 [hep-th].
- 6(6) S. K. Garg and C. Krishnan, “Bounds on Slow Roll and the de Sitter Swampland,” ar Xiv:1807.05193 [hep-th].
- 7(7) H. Ooguri, E. Palti, G. Shiu and C. Vafa, “Distance and de Sitter Conjectures on the Swampland,” Phys. Lett. B 788 , 180 (2019) doi:10.1016/j.physletb.2018.11.018 [ar Xiv:1810.05506 [hep-th]].
- 8(8) E. Palti, “The Swampland: Introduction and Review,” ar Xiv:1903.06239 [hep-th].
