A Detailed Comparison of Multi-Dimensional Boltzmann Neutrino Transport Methods in Core-Collapse Supernovae
Sherwood Richers, Hiroki Nagakura, Christian D. Ott, Joshua Dolence,, Kohsuke Sumiyoshi, Shoichi Yamada

TL;DR
This paper compares two advanced neutrino transport methods, discrete ordinates and Monte Carlo, in simulating core-collapse supernovae, highlighting their agreement, differences, and the impact of various approximations.
Contribution
First detailed multi-dimensional comparison of DO and MC neutrino transport methods in supernovae, including an improved Sedonu code with relativistic capabilities.
Findings
Good spectral and angular agreement between methods
DO excels in optically thick regions for heating/cooling rates
MC provides sharper angular features but has higher noise
Abstract
The mechanism driving core-collapse supernovae is sensitive to the interplay between matter and neutrino radiation. However, neutrino radiation transport is very difficult to simulate, and several radiation transport methods of varying levels of approximation are available. We carefully compare for the first time in multiple spatial dimensions the discrete ordinates (DO) code of Nagakura, Yamada, and Sumiyoshi and the Monte Carlo (MC) code Sedonu, under the assumptions of a static fluid background, flat spacetime, elastic scattering, and full special relativity. We find remarkably good agreement in all spectral, angular, and fluid interaction quantities, lending confidence to both methods. The DO method excels in determining the heating and cooling rates in the optically thick region. The MC method predicts sharper angular features due to the effectively infinite angular resolution, but…
| Problem Name | Special Relativity | Spatial Resolution | Angular Resolution | MC Particles | DO |
|---|---|---|---|---|---|
| () | |||||
| 1D Calculations | |||||
| 1D_1x | yes | 1.82 | |||
| 1D_2x | yes | 2.33 | |||
| 1D_4x | yes | 2.67 | |||
| 1D_4x_nonrel | no | – | |||
| 1D_4x_native | yes | – | 2.96 | – | |
| 1D_4x_native_nonrel | no | – | 3.25 | – | |
| 2D Calculations | |||||
| 2D_LR | yes | – | |||
| 2D_LR_nonrel | no | – | |||
| 2D_HR | yes | – | |||
| 2D_HR_native | yes | – | 63.4 | – | |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
A detailed comparison of multi-dimensional Boltzmann neutrino transport methods in core-collapse supernovae
Sherwood Richers1,∗, Hiroki Nagakura1, Christian D. Ott2,1, Joshua Dolence3, Kohsuke Sumiyoshi4,
and Shoichi Yamada5,6
1TAPIR, Walter Burke Institute for Theoretical Physics, Mailcode 350-17, California Institute of Technology, Pasadena, CA 91125, USA
2 Center for Gravitational Physics and International Research Unit of Advanced Future Studies, Yukawa Institute for Theoretical Physics, Kyoto University, Kitashirakawa Oiwakecho, Sakyo-ku, Kyoto 606-8502, Japan
3CCS-2, Los Alamos National Laboratory, P.O. Box 1663 Los Alamos, NM 87545
4Numazu College of Technology, Ooka 3600, Numazu, Shizuoka 410-8501, Japan
5Advanced Research Institute for Science & Engineering, Waseda University, 3-4-1 Okubo, Shinjuku, Tokyo 169-8555, Japan
6Department of Science and Engineering, Waseda University, 3-4-1 Okubo, Shinjuku, Tokyo 169-8555, Japan
Abstract
The mechanism driving core-collapse supernovae is sensitive to the interplay between matter and neutrino radiation. However, neutrino radiation transport is very difficult to simulate, and several radiation transport methods of varying levels of approximation are available. We carefully compare for the first time in multiple spatial dimensions the discrete ordinates (DO) code of Nagakura, Yamada, and Sumiyoshi and the Monte Carlo (MC) code Sedonu, under the assumptions of a static fluid background, flat spacetime, elastic scattering, and full special relativity. We find remarkably good agreement in all spectral, angular, and fluid interaction quantities, lending confidence to both methods. The DO method excels in determining the heating and cooling rates in the optically thick region. The MC method predicts sharper angular features due to the effectively infinite angular resolution, but struggles to drive down noise in quantities where subtractive cancellation is prevalent, such as the net gain in the protoneutron star and off-diagonal components of the Eddington tensor. We also find that errors in the angular moments of the distribution functions induced by neglecting velocity dependence are sub-dominant to those from limited momentum-space resolution. We briefly compare directly computed second angular moments to those predicted by popular algebraic two-moment closures, and find that the errors from the approximate closures are comparable to the difference between the DO and MC methods. Included in this work is an improved Sedonu code, which now implements a fully special relativistic, time-independent version of the grid-agnostic Monte Carlo random walk approximation.
Subject headings:
supernovae: general—neutrinos—radiative transfer
1. introduction
Most massive stars () end their lives in a cataclysmic core-collapse supernova (CCSN) explosion that releases around of kinetic energy and around of neutrino energy. The iron core begins to collapse when it exceeds its effective Chandrasekhar mass as degenerate electrons capture onto nuclei and photodissociation breaks apart nuclei (e.g., Bethe 1990). Within a few tenths of a second after the onset of collapse, the inner core becomes very neutron-rich () and exceeds nuclear densities (). At this point, the strong nuclear force kicks in, dramatically stiffening the equation of state (EOS) and abruptly stopping the collapse of the inner core within a few milliseconds. The inner core then rebounds, sending a shock wave through the supersonically infalling outer core. Neutrino cooling removes energy from the matter under the shock and photodissociation of heavy nuclei weakens the shock. The shock subsequently stalls at around as it is lacks pressure support from below to overcome the ram pressure of the accreting outer stellar core.
Understanding the mechanism that revives the shock’s outward progress and results in a CCSN is presently the main target of CCSN theory. The canonical theory is the neutrino mechanism (Bethe & Wilson, 1985), whereby neutrinos emitted from the dense inner core pass through the matter below the shock, depositing enough thermal energy to revive the shock via thermal support and by driving turbulence (e.g., Janka 2001; Burrows 2013; Müller 2016). However, the strongly nonlinear dynamics in this stage is inexorably coupled to a variety of microphysical processes. In particular, it is very sensitive to properties of the neutrino field passing through the matter. During the stalled shock phase, the star delicately straddles the line between explosion and total collapse, so small differences in how the neutrinos interact with the matter can be the difference between an explosion and a dud (e.g., Janka 2001; Murphy & Burrows 2008; O’Connor & Ott 2011; Melson et al. 2015a; Couch & Ott 2015; Burrows et al. 2016).
Computation has become the primary tool for studying these nonlinear processes, as it allows us to see detailed dynamics and make observable predictions (electromagnetic radiation, neutrinos, gravitational waves) under the assumptions imposed by the model. However, computational techniques and resources are still too primitive to allow for simulations complete with all of the required fidelity and involved physics. In general, simulations of CCSNe require a three-dimensional general-relativistic (GR) treatment of magnetohydrodynamics, neutrino radiation transport, and a microphysical equation of state (EOS) (e.g., Kotake et al. 2012; Janka 2012; Ott 2016). The simulations also require sufficient resolution or sub-grid modeling to capture everything from global dynamics to 100 meter-scale or smaller turbulence (Radice et al., 2016; Ott, 2016).
Deep in the inner core, neutrinos are trapped and form an isotropic thermal distribution that slowly diffuses out. Outside the shock, the neutrinos are free-streaming and move only radially outward. Though radiation transport methods are constructed to simulate these limits well, the intermediate semi-transparent region is challenging to accurately simulate. This region is responsible for most of the dynamics that support the shock’s progress due to neutrino heating. In addition, the neutrino opacity scales approximately as the square of the neutrino energy, causing the energy deposition rate and the location of the transition from trapped to free-streaming to depend sensitively on neutrino energy. Hence, we require a means of simulating neutrinos of many energies in all regions of a CCSN.
A full treatment of classical neutrino radiation requires evolving the neutrino distribution function of each neutrino species according to the seven-dimensional Boltzmann equation (e.g., Lindquist 1966; Ehlers 1993; Mihalas & Weibel-Mihalas 1999) (three spatial dimensions, three momentum dimensions, time), which presents a significant computational challenge. A wide variety of methods have been used to capture the most important aspects of neutrino transport through the supernova that can be broadly categorized as either phenomenological, deterministic, or probabilistic methods. Though some methods are definitively more accurate than others, there is always a trade-off between efficiency and accuracy.
Phenomenological approaches include the light bulb scheme and neutrino leakage and only very approximately account for neutrino effects. These schemes are very efficient, making them very conducive to parameter studies. In the light bulb scheme (e.g., Bethe & Wilson 1985; Janka & Müller 1996; Ohnishi et al. 2006; Murphy & Burrows 2008) the luminosity and temperature of each neutrino species are simply input parameters. All heating rates are based on this parameter and cooling rates are estimated based on an approximate optical depth. The inner light bulb boundaries have also been combined with gray transport schemes in the semi-transparent and transparent regimes (Scheck et al., 2006). In the leakage scheme, an approximate neutrino optical depth at each point is calculated, and this is used to set the cooling rate at each point (Ruffert et al., 1996; Rosswog & Liebendörfer, 2003). Neutrino heating can be included approximately by assuming that the neutrino luminosity through a given point is determined by the energy leaking radially outward from below (O’Connor & Ott, 2010; Fernández & Metzger, 2013; Perego et al., 2016).
Approximate deterministic methods solve a simplified version of the Boltzmann equation in order to make a more tractable problem. The isotropic diffusion source approximation (IDSA) method evolves an isotropic trapped component and a free-streaming component of the distribution function (Liebendörfer et al., 2009). In truncated moment methods, the distribution function is discretized into an infinite list of angular moments, only the first few of which are directly evolved. Flux-limited diffusion (FLD, Levermore & Pomraning 1981; Mihalas & Klein 1982; Pomraning 1983; Castor 2004; Krumholz et al. 2007) is a one moment method that evolves only the zeroth moment of the distribution function (energy density) and requires a closure relation to estimate the first moment (flux). However, FLD fails to capture much of the angular information about the distribution function and tends to smooth out angular variations (e.g., Janka 1992; Burrows et al. 2000; Liebendörfer et al. 2004; Ott et al. 2008; Zhang et al. 2013). In the two-moment method (Pomraning, 1969; Anderson & Spiegel, 1972; Thorne, 1981; Dubroca & Feugeas, 1999; Audit et al., 2002; Shibata et al., 2011; Vaytet et al., 2011; Cardall et al., 2013), the zeroth and first moments are evolved, and a closure relation is required to estimate the second moment (pressure tensor) and complete the system of equations. The closure can be provided by some ad-hoc analytical function (e.g., Smit et al. 2000; Murchikova et al. 2017 and references therein). This is also known as the M1 method, though M1 confusingly refers to a specific closure as well (Levermore, 1984; Dubroca & Feugeas, 1999). The closure can also be more accurately determined using a direct solution of the Boltzmann equation, referred to as a variable Eddington tensor (VET) method (e.g., Stone et al. 1992; Hayes & Norman 2003).
In discrete ordinates (DO or ) methods, the distribution function is discretized into angular bins, each of which is directly evolved (e.g., Pomraning 1969; Mihalas & Weibel-Mihalas 1999 and references therein). In spherical harmonic () methods, the distribution function is decomposed in terms of spherical harmonics, and a small number of these are evolved (e.g., Pomraning 1973; Radice et al. 2013). Finally, fully spectral methods in all six space-momentum dimensions have been applied to stationary neutrino transport calculations (Peres et al., 2014).
Monte Carlo (MC) radiation transport (Fleck et al., 1971; Fleck & Canfield, 1984; Densmore et al., 2007; Abdikamalov et al., 2012) is a probabilistic method that samples the trajectories of a finite number of individual neutrinos and assumes their behavior is representative of the rest of the bulk neutrino behavior. Tubbs (1978) applied MC methods to neutrino transport for the first time to study neutrino-matter equilibration in an infinite uniform medium. MC transport has been long used in 1D steady-state transport calculations (Janka & Hillebrandt, 1989; Janka, 1991, 1992; Yamada et al., 1999; Keil et al., 2003; Abdikamalov et al., 2012), though the code of Abdikamalov et al. (2012) was also designed to perform time-dependent calculations. Richers et al. (2015) performed steady-state MC transport calculations on 2D snapshots of accretion disks from neutron star mergers, though the optical depths were much lower than in the CCSN context.
There is much more to a radiation transport code than the broad classes of methods mentioned above. Detailed Boltzmann transport using either DO or VET methods in dynamical simulations has been achieved in one (Mezzacappa & Bruenn, 1993; Yamada et al., 1999; Burrows et al., 2000; Rampp & Janka, 2002; Roberts, 2012) and two (Livne et al., 2004; Ott et al., 2008; Nagakura et al., 2017a) spatial dimensions, but three dimensional calculations are presently only possible when the fluid is assumed to be stationary (Sumiyoshi & Yamada, 2012; Sumiyoshi et al., 2015). The local two-moment method is used in 1D (O’Connor, 2015), 2D (O’Connor & Couch, 2015; Just et al., 2015a), and 3D (Sekiguchi et al. 2015; Roberts et al. 2016; Foucart et al. 2016b; Kuroda et al. 2016, see also Müller & Janka 2015) core collapse and neutron star merger simulations. FLD is also popular in 2D simulations (Dessart et al., 2006; Swesty & Myra, 2009; Zhang et al., 2013). Various versions of the ray-by-ray (RbR) approximation can also be used to extend a 1D transport method to two (Burrows et al., 1995; Buras et al., 2006; Müller et al., 2010; Suwa et al., 2010; Pan et al., 2016) or three (Hanke et al., 2013; Takiwaki et al., 2012; Lentz et al., 2015; Melson et al., 2015b) spatial dimensions in an efficient manner by making transport along individual radial rays nearly independent of other rays and/or by solving a single spherically averaged 1D transport problem. Fully general relativistic neutrino radiation hydrodynamics simulations are now possible (Janka, 1991; Yamada et al., 1999; Liebendörfer et al., 2001; Sumiyoshi et al., 2005; Müller et al., 2010; Shibata & Sekiguchi, 2012; Kuroda et al., 2012; O’Connor, 2015; Foucart et al., 2016b; Roberts et al., 2016; Kuroda et al., 2016) and many codes incorporate general relativistic effects with various levels of approximation (e.g., Müller et al. 2010; O’Connor & Couch 2015; Skinner et al. 2016). Special-relativistic effects can be accounted for in full generality (Müller et al., 2010; Shibata & Sekiguchi, 2012; Richers et al., 2015; Foucart et al., 2016b; Nagakura et al., 2017a) or by using only up to terms (e.g., Rampp & Janka 2002; Lentz et al. 2015; Just et al. 2015a; Dolence et al. 2015; Skinner et al. 2016). Even if the transport method is equivalent, simulations differ in how neutrino-matter and neutrino-neutrino interactions are treated (e.g., Lentz et al. 2012a). In short, each piece of relevant physics can be simulated accurately, but 3D simulations containing all pieces remain a goal that has not yet been achieved.
Any computational method is an approximation of reality, and every method has strengths and weaknesses. It is therefore expected that computations performed by different codes should arrive at different solutions, though they should converge to the physical answer with increasing simulation fidelity. Understanding the weaknesses of a given method is a prerequisite to interpreting the physical meaning of simulation results. It is standard practice to test that codes produce known solutions to simple problems and to perform self-convergence tests to ensure that results are not mistakes or numerical artifacts. However, even with these practices in place, different codes produce different results and independent verification is required to help determine which features of each are realistic (Calder et al., 2002).
Several works in the past have evaluated the accuracy of a low-order method like FLD or two moment transport by comparing with a high-order DO, MC, or VET method (e.g., Janka 1992; Mezzacappa & Bruenn 1993; Messer et al. 1998; Burrows et al. 2000; Liebendörfer et al. 2004; Ott et al. 2008). However, in these comparisons, the low-order method is not expected to converge to the same result as the high-order method, which does not help to verify the high-order method. There have been few detailed comparisons between high-order methods that solve the same full Boltzmann equation (i.e., VET, DO, and MC methods), and none in more than one spatial dimension. Yamada et al. (1999) compared the results of a new DO implementation to the MC code of Janka (1991) in 1D GR snapshots of CCSN simulations, but ignored fluid motion. Liebendörfer et al. (2005) performed a comparison of dynamical 1D CCSN simulations using the GR DO code Agile-BOLTZTRAN (Liebendörfer et al., 2004) with the Newtonian VET code VERTEX-PROMETHEUS (Rampp & Janka, 2002). They found very good agreement once an effective potential was introduced to VERTEX-PROMETHEUS to account for GR effects (Marek et al., 2006). Several groups have since used the results of Liebendörfer et al. (2005) as a standard for comparison (e.g., Sumiyoshi et al. 2005; Müller et al. 2010; Suwa et al. 2011; Lentz et al. 2012b, a; O’Connor & Ott 2013; O’Connor 2015; Just et al. 2015b; Suwa et al. 2016). With the recent arrival of the many advanced multi-dimensional neutrino radiation hydrodynamics codes mentioned previously, continued independent verification is essential to interpreting simulation results.
In this paper, we perform the first detailed multi-dimensional comparison between fully special relativistic Boltzmann neutrino transport codes using a DO neutrino radiation hydrodynamics code (Nagakura et al., 2017a) (hereafter NSY) and the MC radiation transport code Sedonu (Richers et al., 2015). We make the time-independent comparisons on spherically symmetric (1D) and in axisymmetric (2D) snapshots from CCSN simulations at around after core bounce. Both codes are carefully configured to calculate the full steady-state neutrino distribution function from first principles in as similar a manner as possible. We find remarkably good agreement in all spectral, angular, and fluid interaction quantities, lending confidence to both methods. The MC method predicts sharper angular features due to the effectively infinite angular resolution, but struggles to drive down noise in quantities where subtractive cancellation is prevalent (e.g., net gain within the protoneutron star and off-diagonal components of the Eddington tensor). We test the importance of accounting for fluid velocities by setting all velocities to zero and find that the differences induced are much smaller than the errors due to finite momentum-space resolution. We compare directly computed second angular moments to those predicted by popular two-moment closures, and find that the error from the approximate closure is comparable to the difference between the DO and MC methods.
This paper is organized as follows. In Section 2, we review the discrete ordinates, Monte Carlo, and two moment methods. We present the results of the transport method comparisons in spherical symmetry in Section 4 and in axial symmetry in Section 5. We summarize our conclusions in Section 6. The MC transport code Sedonu is publicly available at https://bitbucket.org/srichers/sedonu and the results obtained in this study from both transport codes are available at https://stellarcollapse.org/MCvsDO.
2. Numerical Methods of Neutrino Transport
The transport of classical neutrinos is described in general by the Boltzmann equation (e.g., Lindquist 1966; Ehlers 1993; Mihalas & Weibel-Mihalas 1999):
[TABLE]
where is an affine parameter. In a coordinate basis, the geodesic equation gives
[TABLE]
where are the neutrino four-momenta, are the four spacetime coordinates, and are Christoffel symbols of the spacetime metric. In an orthogonal coordinate system in flat spacetime, the distribution function of each neutrino species is defined as
[TABLE]
where is the number of neutrinos, is the volume element at position , is the time, is the neutrino direction, and is the neutrino energy. The three source terms on the right hand side of Equation 1 are interaction terms from emission and absorption, scattering, and pair processes, respectively. The neutrino propagation is generally calculated in reference to coordinates defined in the lab frame, but interactions between matter and neutrinos are formulated in the fluid rest frame (a.k.a. the comoving frame). It is thus important to very carefully keep track of the frame in which various quantities are defined. This is consistent with widely used conventions in the relativistic neutrino transport community (e.g., Shibata et al. 2011; Cardall et al. 2013).
The interaction terms are all local and formulated in a frame comoving with the underlying fluid. The emission and absorption terms are the simplest, as they depend linearly on the distribution function of a given neutrino species. The scattering term in general depends quadratically on the distribution function of a given species, since neutrinos are fermions and the reaction is inhibited by final-state neutrino blocking. However, under the assumption of isoenergetic scattering, it reduces to a linear dependence (see Appendix D). The pair term, which includes neutrino pair annihilation and creation and neutrino bremsstrahlung, depends on the product of the distribution functions of the species and anti-species. Our static transport calculations solve for the that satisfies .
There are six species of neutrinos in the standard model corresponding to the six leptonic species (,, and their anti-particles). Electron neutrinos and anti-neutrinos interact with nucleons via both charged current and neutral current processes, while heavy lepton neutrinos interact only via neutral current processes. This makes each heavy lepton neutrino species individually less impactful than electron anti/neutrinos and making all four species behave very similarly. In light of this, we simulate and individually, but group all of the heavy-lepton neutrinos into a single simulated species for computational efficiency.
Neutrino interaction rates depend on the properties of the fluid through which they traverse. In this study, we use the non-hyperonic equation of state (EOS) of Shen et al. (2011) to determine the abundances and chemical potentials of each constituent (i.e., leptons, nucleons, and nuclei) given the fluid density, temperature, and electron fraction. We consider the following minimum but essential sets of neutrino-matter interactions in the postbounce phase of CCSNe:
[TABLE]
where . From top to bottom, these processes are electron capture by free protons, positron capture by free neutrons, isoenergetic scattering with nucleons, electron-positron pair annihilation, and nucleon-nucleon bremsstrahlung, along with each of their inverse reactions.
Though a multitude of phenomenological, approximate, and exact transport methods exist in the literature, we will focus on three of them. The discrete ordinates method (DO, Section 2.1) and the Monte Carlo (MC, Section 2.2) methods described here both solve the Boltzmann equation directly in all three momentum dimensions and multiple spatial dimensions, and so should converge to the same physical result. We also investigate how well approximate closure relations in the two-moment method (Section 3) compare to the solutions computed by DO and MC calculations.
2.1. Discrete Ordinates
The DO Boltzmann code of Nagakura, Sumiyoshi, and Yamada (hereafter NSY) is a grid-based multi-dimensional neutrino radiation hydrodynamics code that solves the conservative form of Equation 1 in the language of the 3+1 formulation of general relativity (GR). The numerical method is essentially the same as described by Sumiyoshi & Yamada (2012), though it has since been extended to account for special relativistic effects as has been coupled with Newtonian hydrodynamics (Nagakura et al., 2014, 2017a). The newest version of this code was recently applied to axisymmetric CCSN simulations in Nagakura et al. (2017b).
The neutrino distribution function is discretized onto a spherical-polar spatial grid described by radius , polar angle , and azimuthal angle . The radial grid is constructed so as to provide good resolution where the density gradient is large. The radial mesh spacing is set to at the center and decreases with increasing radius up to the location of the steepest density gradient at , where . For , the spacing increases by per zone up to . For , the spacing increases by per zone up to the outer boundary of . This results in 384 radial grid zones over the entire domain. The spatial angular grid is set to 128 Gaussian quadrature points from . At each spatial location, is discretized onto a spherical-polar momentum space grid described by neutrino energy , neutrino polar angle (where is in the radial direction), and neutrino azimuthal angle . The first bin of the neutrino energy grid extends over in the 1D_1x and 2D calculations, in the 1D_2x calculations, and in the 1D_4x calculations. The rest of the energy bins are logarithmically spaced from to . The number of energy and direction bins used in each simulation is listed in Table 1.
The NSY code treats the advection terms in the GR Boltzmann equation semi-implicitly. Both advection and collision terms are implemented self-consistently by using a mixed-frame approach with separate momentum-space grids in the lab and comoving frames. See Appendix A and Nagakura et al. (2017a) for implementation details.
Though the NSY code is capable of evolving coupled neutrino radiation hydrodynamics, we restrict the capability of the code in this study to evolve only the radiation field on top of a fixed fluid background with a flat spacetime metric until a nearly steady-state solution is reached. As we list in Table 1, the maximum time variability in the energy density at any spatial location relative to the value averaged over is less than , which is significantly smaller than difference between DO and MC results.
2.2. Monte Carlo
The MC method for radiation transport is a probabilistic implementation of a reformulated Equation 1, making it fundamentally different from the DO method. We employ the Sedonu MC neutrino transport code (Richers et al., 2015) to solve for equilibrium neutrino radiation fields and neutrino-matter interaction rates. The neutrino radiation field of each neutrino species is discretized into neutrino packets, each with some total packet energy representing a large number of neutrinos at the same location with the same individual energy and the same direction . The neutrino motion itself is always computed in the lab frame, and special relativistic effects are accounted for by explicitly Lorentz transforming into and out of the comoving frame for interactions with the background fluid.
Neutrinos are emitted from the fluid within each grid cell at a random location, with an isotropically random direction in the comoving frame, and with a random comoving energy according to the energy-dependent emissivity. The comoving packet energies are set such that the grid cell emits with a luminosity determined by the energy-integrated emissivity.
Each particle moves linearly in a series of discrete steps in the lab frame. The size of each step is the minimum of a random distance determined by the scattering opacity and the grid cell distance . is set to the maximum of the distance to the grid cell boundary and 1% of the grid cell’s smallest dimension to prevent neutrinos from getting stuck at cell boundaries. After each step, the packet energy is diminished by a factor of , where is the absorption opacity and is the length of the step. When the packet undergoes an elastic scatter, a new direction is chosen isotropically randomly in the comoving frame. The absorption and scattering opacities are re-evaluated when the neutrino enters a new grid cell. If the cell is optically thick to neutrinos, we use a time-independent relativistic version of the Monte Carlo random walk approximation (Fleck & Canfield, 1984) to allow the neutrino to make a large step through many effective isotropic elastic scatters (see Appendix B).
The neutrino heating rates and radiation field quantities output by Sedonu are the steady-state quantities by construction, and require no concept of relaxation time. Upon emission, each neutrino packet energy is set assuming an arbitrary emission time of and is allowed to propagate until it leaves the simulation domain. After each step, Sedonu accumulates in the radiation field energy-direction bins, where is again the distance the packet moves and is the packet energy averaged over the step. Sedonu additionally accumulates the amount of the packet energy that is absorbed into the fluid. The steady-state radiation energy density is obtained by normalizing the accumulated radiation field by , where is the cell volume. The steady-state comoving frame neutrino specific heating rate is obtained by normalizing the deposited energy by , where is the fluid rest density. This method also provides a large speedup over time-dependent Monte Carlo radiation transport for time-independent problems.
In this study, we perform the transport on 1D and 2D fluid grids in spherical-polar coordinates that are identical to those employed by the NSY code. We tally the radiation field in two different ways. In the first, we use energy and direction bins identical to those used by the NSY code. The data output is thus discretized, even though the neutrinos themselves are always transported through continuous space and are not influenced by any grid structure except in evaluating the opacities. In the second way (“native”), we accumulate neutrino energy directly into angular moments without any reference to a discrete direction grid, though we still use the same energy bins. The version of Sedonu used in this paper is fundamentally the same as that used in Richers et al. (2015), except that it includes the Monte Carlo random walk approximation for regions of large optical depth, the native moment prescription, and various performance and usability upgrades. Sedonu is open source and available at https://bitbucket.org/srichers/sedonu.
3. Eddington Tensor Analysis
The so-called local two-moment transport method (Pomraning, 1969; Anderson & Spiegel, 1972; Thorne, 1981; Shibata et al., 2011; Cardall et al., 2013) is the current state of the art method for time-dependent multi-dimensional simulations of neutrino radiation hydrodynamics (e.g., O’Connor & Couch 2015; Just et al. 2015b; Foucart et al. 2016a; Roberts et al. 2016; Kuroda et al. 2016; Radice et al. 2017)111Higher-order transport calculations (e.g., Müller et al. 2010) are used in multiple dimensions via the ray-by-ray approximation (e.g., Burrows et al. 1995; Buras et al. 2006), but we do not address these in this paper.. In the two-moment method, Equation 1 is again reformulated, this time in terms of specific moments of the distribution function. In an orthonormal coordinate system, these moments are defined by
[TABLE]
where is the neutrino energy and are the basis vectors. The ellipsis denotes that there is an infinite list of moments that can be used to reconstruct the two-dimensional angular dependence of the distribution function, much like an infinite list of terms in a Taylor series can be used to reconstruct a one-dimensional function. Each of these specific moments () can be integrated in energy according to
[TABLE]
to get the total neutrino energy density, energy flux, etc..
In the two-moment method, only the first two moments are evolved and are assumed to provide a good enough representation of the full distribution function. The evolution equations for each moment depend on higher-order moments. In a local two-moment scheme, the pressure tensor and higher-order moments are estimated based on the energy density and flux at the same spatial location. This estimate is referred to as a closure relation, many of which have been proposed based on various motivations (e.g., Smit et al. 2000; Murchikova et al. 2017 and references therein).
In this study, we do not perform two-moment radiation transport, but rather compare the radiation pressure tensor predicted by approximate closures to the actual radiation pressure tensor output from the MC and DO calculations. We consider three popular approximate closure relations based on different physical motivations: the maximum entropy closure of Minerbo (1978) in the classical limit (Minerbo), the isotropic rest-frame closure of Levermore (1984) (Levermore), and the closure of Janka (1991) in the form presented in Just et al. (2015b) that is empirically based on MC calculations of neutrino transport in protoneutron stars. In all three cases, the pressure tensor is expressed as an interpolation between optically thick and thin limits where the solution is known analytically:
[TABLE]
where
[TABLE]
and is the identity matrix. In our analysis, we ignore the term for simplicity. The different closure relations are defined by
[TABLE]
is referred to as the the flux factor. is referred to as the Eddington tensor, and in the context of spherically-symmetric radiation transport, is referred to as the Eddington factor.
4. Transport Comparison in Spherical Symmetry
We perform several spherically symmetric (1D) steady-state neutrino transport calculations using different momentum-space treatments listed in Table 1. In the 1D_1x, 1D_2x, and 1D_4x simulations, the neutrino radiation field is discretized into the number of angular and energy bins described in the “Spatial Resolution” and “Angular Resolution” columns. They differ only by the number of spatial and energy bins, and are all run in each the NSY code and Sedonu. The 1D_4x_nonrel calculation (performed only by the NSY code) is identical to the 1D_4x calculation, except with all velocities set to zero. In the 1D_4x_native calculation performed only by Sedonu, MC particles are accumulated directly into angular moments rather into angular bins. The 1D_4x_native_nonrel calculation is identical to the 1D_native calculation, but with all velocities set to zero. Note that length contraction causes the simulations that include relativistic effects to have a slightly larger total rest mass, but only by . Throughout this section, we primarily compare the highest-resolution DO calculation (1D_4x) to the highest-resolution native-moment MC calculation (1D_4x_native). We use the lower resolution calculations (1D_1x and 1D_2x) in resolution comparisons.
In Figure 1, we show the static fluid background that comes from a spherically symmetric neutrino radiation hydrodynamics simulation of the collapse of a star (Woosley et al., 2002) at after core bounce using the NSY code (Nagakura et al., 2017a). For the calculations in this paper, the NSY code is again used to calculate a steady-state solution of the neutrino radiation field on this background using the DO method. The opacities and emissivities are then exported to Sedonu, which computes a steady-state radiation field on the same background.
In Figure 2 we show radial profiles of the total energy density of each neutrino species using both the 1D_4x DO calculation and the 1D_4x_native MC calculation. For each species, the energy density is approximately constant in the inner core () as neutrinos are trapped and in equilibrium with the fluid. In this region, the neutrinos have a Fermi-Dirac distribution function, and so the energy density is determined entirely by the fluid temperature and the electron and nucleon chemical potentials. In the outer (i.e., shock-processed) core at , the temperature is very high (), and many more electron anti-neutrinos and heavy lepton neutrinos are emitted than in the cooler inner core. Beyond the energy-averaged neutrinospheres (, depending on the neutrino species), the neutrinos are only weakly coupled to the fluid and the energy density decreases as .
Both codes produce remarkably similar results, with differences in the energy density (Figure 2) smaller than 2% everywhere within the shock. The remaining differences near the energy-averaged neutrinospheres (, depending on species) are due to the MC random walk approximation, and decrease when the critical optical depth is increased (see Appendix B). The differences outside of the decoupling region (, depending on species) decreases with increasing DO directional angular resolution. Outside of the shock, the difference in the energy density grows as the NSY code experiences a small departure from scaling of the energy density. This is an artifact of the finite spatial resolution. The size of the error visibly increases at , where the radial resolution coarsens.
Figure 3 shows a quantity akin to the comoving-frame average neutrino energy222Recall that we used mixed frame quantities, since many GR transport schemes are formulated in terms of lab-frame energy density and comoving-frame neutrino energy (e.g., Shibata et al. 2011)., defined as
[TABLE]
where is the lab-frame energy density in bin and is the comoving-frame bin central energy. Because neutrinos are in equilibrium with the fluid below the decoupling region (, depending on species), they have a Fermi-Dirac distribution function that depends only on the fluid temperature and electron and nucleon chemical potentials. The neutrino absorption opacity scales approximately as , so high-energy neutrinos are preferentially absorbed, causing the average neutrino energy to continuously decrease with radius. The average energy jumps at the shock front, since after passing the shock front, neutrinos are moving through matter falling with speeds of . The comoving-frame neutrino energy is thus Doppler boosted even though the lab-frame energy density is constant across the shock.
The differences in the average neutrino energy between the 1D_4x DO calculation and the 1D_4x_native MC calculation are smaller than about within the shock. Analyzing the various potential sources of errors, the differences at are simply statistical noise that decreases with the square root of the number of MC neutrino packets simulated. The differences below the shock are primarily due to the MC random walk approximation error, and decreases with an increased critical optical depth (see Appendix B) independent of the momentum-space resolution. The differences near and outside the shock are a result of finite energy resolution, which results in interpolation error when the NSY code transforms energy and direction bins between the comoving and lab frames.
We plot the direction-integrated neutrino energy density spectra at for each neutrino species in Figure 4. This point is below the shock in the semi-transparent region. The results of the 1D_4x DO calculation and the 1D_4x_native MC calculations agree in every bin with at most of the peak value. In energy bins with little energy density, the relative errors become quite large, but bins with such small energy density have much less dynamical effect in CCSN simulations. Some statistical noise from the MC calculation can be seen in the range of , but the small overall offsets are due to the finite neutrino energy and angular resolution.
Figure 5 shows the lab-frame energy-integrated heavy lepton neutrino distribution function at three separate radii. The red curves are from the radius where and show that the neutrinos are nearly isotropic. The blue curves are from near the shock front and are nearly free-streaming and very forward-peaked, as almost no neutrinos are moving inward. The green lines show the distribution function at an intermediate location of () between the trapped and free-streaming limits. In the plot, the distribution function is normalized by the largest value so the shapes can be easily compared. We assume a constant value for the distribution within the directional bin in the forward direction and linearly interpolate the distribution function for all other directions. This is done to ensure that in post-processing the value of the distribution function never exceeds one. However, this gives rise to the artificially flattened nose of the distribution functions most apparent in the blue curves.
The thickest lines in Figure 5 are from high-resolution 1D_4x MC and DO simulations, while thinner lines indicate lower resolution. The 1D_4x_native MC simulation does not collect data on a grid of discrete angular bins, so its results cannot be used to make such a plot. The importance of the angular resolution is very apparent for the blue curves at the shock front, since most of the neutrino energy is in a single angular bin. The MC results look remarkably similar to the DO results, though a lack of numerical diffusion in the MC calculations allows for slightly more sharply forward-peaked distribution functions for a given angular resolution. This angular dependence is reflected in all angular moments of the distribution function.
In Figure 6, we show the energy-integrated lab-frame radial flux factor (, see Equation 4) of the distribution function of all three neutrino species using the 1D_4x DO calculation and the 1D_4x_native MC calculation. Below around the neutrinos are trapped and the distribution function is nearly isotropic, resulting in a minuscule flux relative to the energy density (corresponding to the red curves in Figure 5). In the transition region (corresponding to the green curves in Figure 5), an increasing fraction of the neutrino radiation energy is moving radially outward, causing the flux factor to approach 1 at large radii (corresponding to the blue curves in Figure 5). and are identically zero due to spherical symmetry.
The angular moments of the radiation field are naturally sensitive to the angular grid resolution. We see small differences of at most around 0.02 in the flux factor, but the size of this difference scales approximately linearly with the angular grid size for calculations with a coarser angular grid (not plotted). Sedonu consistently predicts a more rapid transition to free streaming than does the NSY code. Here the MC method shows a significant advantage in that by computing moments directly rather than post-processing from an angular grid, we get angular moments with effectively infinite angular resolution. The NSY code comes very close to this solution, but suffers from some angular diffusion. This causes the NSY code to predict distribution functions that are slightly, though artificially, more isotropic. The difference approaches a small but constant value at large radii, where almost all of the energy in the DO calculations is contained in the single outward-pointing angular bin. The Sedonu results are, however, visibly noisy in the difference plot, since subtractive cancellation tends to amplify statistical noise. There is a small hump visible in the heavy lepton neutrino difference plot just below that is a result of the MC random walk approximation. The size of this hump decreases when the critical optical depth is increased (see Appendix B), bringing it closer to the electron neutrino and electron anti-neutrino difference curves.
In Figure 7, we show the component of the energy-integrated lab-frame Eddington tensor (, see Equation 4) of the distribution function of all three neutrino species. Only the diagonal components of the Eddington tensor (,, and ) are nonzero in spherical symmetry. At , all diagonal components of the Eddington tensor are because the radiation is nearly isotropic. After decoupling, the component approaches unity as all radiation is moving radially outward, while the and components (not plotted) approach zero.
Once again, the differences between Sedonu and the NSY code scale approximately linearly with the neutrino direction angular zone sizes. However, the maximum difference of 0.03 is larger than the maximum flux factor difference of 0.02. Unlike with previously discussed radiation quantities, the random walk approximation does not add significant error to . Though we do not plot or , the differences between MC and DO results are similar to those of . Since the integral in Equation 4 contains a factor of , the results do not suffer from subtractive cancellation and the amount of statistical noise is significantly lower than that of the flux factor.
The dependence of the angular moments on angular resolution can be clearly seen in Figure 8, where we plot the component of the Eddington tensor for four MC calculations (solid lines) and three DO calculations (dashed lines). We first direct our attention to the blue lines, corresponding to the low resolution 1D_1x DO and MC calculations. Even though both are post-processed in the same way from the same angular grid, the MC results transition to large values of faster than do the DO results. At , saturates at the maximum value possible given the angular resolution, which the DO results approach at large radii. The same is true for the higher resolution green and red curves, but the saturation occurs at a larger radius and is not so visibly obvious.
Due to the effectively infinite angular resolution of the 1D_4x_native MC calculation, the corresponding black line in Figure 8 can be thought of as exact, modulo a small amount of MC noise. Going from coarsest to finest resolution, the maximum difference between the DO results and the black curve are 0.125, 0.057, and 0.028. This corresponds to a factor of 2.2 improvement when going from 1x to 2x resolution, and a factor of 2.0 when going from 2x to 4x resolution. Similarly, the maximum difference between the gridded MC and the native MC results are, in order of increasing resolution, 0.0896, 0.0243, and 0.0062. The accuracy improves by a factor of 3.7 when going from 1x to 2x resolution, and by a factor of 3.9 when going from 2x to 4x resolution. This trend, where DO results are near first order convergence and gridded MC results are near second order convergence, is apparent in the flux factor results as well. This is because the post-processing angular integration scheme is second order (except in the forward-most bin, where it is first order), but the evolution scheme in the NSY code is only first order.
The use of an approximate analytic closure in two-moment radiation transport schemes is significantly faster than either the DO or MC methods. However, since there are many reasonable closures available, it is of great interest to evaluate how well these closures perform against our full Boltzmann results. We re-plot the electron neutrino curve (black line in Figure 8) from the 1D_4x_native MC calculation as a solid black line in Figure 9. We then use and from the same MC calculation to estimate using the three analytic closures given in Equation 8. The Janka and Minerbo closures perform similarly and have a maximum difference with MC of , which is comparable to the differences between the 1D_4x DO calculation and the 1D_4x_native MC calculation. The Levermore closure, however, performs better, with a maximum difference of . This is significantly better than the accuracy of any DO result and is comparable to the accuracy of the 1D_4x MC calculation. These results are also replicated in a similar analysis of and (not plotted). In short, analytic closures perform remarkably well in this particular steady-state spherically-symmetric transport calculation.
The primary role of neutrinos in the explosion mechanism of CCSNe is redistributing thermal energy from the protoneutron star region to the gain region that drives turbulence and pushes the shock outward. The relevance of these detailed transport calculations comes down to how the differences between the methods affect the heating and cooling of matter in the supernova. In Figure 10, we show the comoving-frame net gain, i.e., the heating rate less the cooling rate due to neutrinos. We show results from the 1D_4x MC and DO calculations (red line), the 1D_4x_nonrel MC and DO calculations (green line), and the 1D_1x MC and DO calculations (blue line). Below about the fluid is overall cooling, and the emitted neutrinos pass through the gain region from to and deposit energy. Below the shock, the net gain from the 1D_4x MC and DO calculations are very similar, with differences of of the peaks in the gain curve.
Just outside the shock, the fluid densities are low and most nucleons are bound in nuclei. Because of this, the heating is due primarily to neutrino pair annihilation. The pair annihilation rates are under-resolved in neutrino energy space even with 80 energy bins, resulting in significant differences between heating rates of different resolutions. However, test results show only a difference between the 1D_4x results and a test with an energy-space resolution of 160 bins. We can use the radial profiles of heating rate, density, and velocity to estimate the amount of energy per nucleon the fluid is heated before passing through the shock as
[TABLE]
where is the heating rate. Using the heating rate from the highest-resolution simulations, this predicts a total heating of . Compared to the post-shock temperatures of and the iron-56 binding energy of , this pre-shock heating is insignificant.
The differences between MC and DO are amplified outside the shock, where we must divide a volumetric heating rate (in erg cm*-3* s*-1*) by the low density to get a specific heating rate. Also, recall that in our calculation of pair annihilation rates, we assume that the neutrino distribution functions are isotropic (see Appendix D for details). At large radii relative to the neutrinospheres and to leading order, however, the angular dependence actual reaction rates is (e.g., Bruenn 1985)
[TABLE]
where is the neutrinosphere radius. The location of the neutrinosphere depends on the neutrino species and energy, but for a typical radius of this angular term scales the reaction rate by a factor of at the shock. Thus, we expect the heating rates (and hence the heating rate differences) to be over-estimated by a factor of at the shock.
Including velocity-dependence in neutrino transport algorithms is a complication that can significantly increase the complexity and cost of the transport calculation. It is natural to attempt to quantify the size of the error made in codes that neglect velocity dependence. We repeat the high-resolution calculations with the same rest-frame fluid profile shown in Figure 1, but set all velocities to zero. Velocity dependence changes the comoving frame neutrino energies and directions, modifying the rates at which neutrinos interact with the fluid. This has a very minor effect below the shock, but significantly changes the heating rate outside the shock where velocities are . However, the density drops by a factor of 10 across the shock and the pre-shock fluid moves so quickly that the overall heating is negligible. These small errors outside the shock are unlikely to have a significant impact on simulation results. The volume-integrated net gain in the gain region (where there is net heating under the shock) is in the 1D_4x DO calculation and in the 1D_4x_native MC calculation, a difference of only . Compare this to the difference of the same quantity between the 1D_4x and 1D_1x DO calculations of and between the 1D_4x and 1D_4x_nonrel DO calculations of . Though including velocity dependence impacts the heating rates more than the differences between the codes in the highest resolution case, low resolution can cause significantly larger inaccuracies.
5. Transport Comparison in Axisymmetry
In this section, we describe results from the first multi-dimensional comparison of Boltzmann-level neutrino transport codes. We present a set of four axisymmetric time-independent neutrino transport calculations as listed in Table 1. Once again, the NSY code is used to calculate an approximately steady-state solution and the opacities and emissivities are exported from the completed NSY calculations to Sedonu for the MC calculation. Due to computational cost, we only consider two resolutions in the DO code. The low-resolution (2D_LR) calculations have momentum-space resolution equivalent to the 1D_1x calculations, and the high-resolution (2D_HR) calculations have momentum-space resolution between that of the 1D_1x and 1D_2x calculations.
The rest-frame fluid profile used in all simulations shown in Figure 11 comes from a 2D simulation of the collapse of the same star (Woosley et al., 2002) used in Section 4 (Nagakura et al., 2017a). In Figure 11 and in all other colormap plots in this section, data separated into quadrants shows data from the northern hemisphere of the calculation only to ease visual comparison of datasets. Data on half-circles show the full simulation domain out to . In Figure 11, multi-dimensional structure in all fluid quantities is apparent and is due to neutrino-driven turbulent convection. The postshock velocity field in particular shows fluid speeds up to , compared to the maximum radial velocity of in the 1D calculations. This multi-dimensional structure provides a challenge for any radiation transport method.
We begin with a comparison of the spectral properties of the results from Sedonu and the NSY code. Figure 12 shows the comoving-frame average energy of each of the three simulated neutrino species for the 2D_HR DO calculation and the 2D_HR_native MC calculation. Just as in the 1D calculations, the electron neutrinos have the highest energy in the inner core due to the high electron chemical potential, and the lowest energy at large radii since they decouple at the largest radius and the lowest matter temperature. The DO and MC results are nearly identical, so differences can only be seen in the right half of each plot, where we subtract the DO results from the MC results. The average energies differ between the MC and DO results by at most 0.5 MeV, which is larger than in the 1D results in Figure 3 due to the lower energy resolution. When electron neutrinos and anti-neutrinos decouple from matter, the opacity is dominated by absorption. Because of this, MC packets that use the random walk approximation quickly lose energy, preventing errors from the random walk approximation from propagating outward through the rest of the domain. However, the heavy lepton neutrino opacity is dominated by scattering, so MC packets carrying errors from the random walk approximation retain their energy when traveling through the rest of the domain, causing errors to be slightly higher. Just as in the 1D results, the differences between the MC and DO average energies error jump across the shock, since the NSY code suffers from numerical diffusion when transforming between grids in the two grid approach (Appendix A). There are also a number of hot spots in the average energy differences within the shock, which correspond to regions of high velocity in Figure 11. These differences are also because of some numerical diffusion in the two-grid approach. Heavy lepton neutrinos are more strongly impacted by the protoneutron star convection, since they decouple at a smaller radius. The features visible in the heavy lepton neutrino average energy difference plot (rightmost panel of Figure 12) at small radii are diminished by reducing the MC random walk critical optical depth, independent of momentum space resolution.
Though the energy resolution is coarser than in the 1D calculations, we are able to compare the full spectra at a given location. For Figure 13, we choose the same radius of used for Figure 4 and an angle of from the north pole. We plot the direction-integrated spectra of all three neutrino species using the 2D_HR DO calculation and the 2D_HR_native MC calculation. The neutrino energy density within each comoving frame energy bin is measured in the lab frame and the individual neutrino energy in the comoving frame, resulting in a mixed-frame quantity. The results are remarkably similar, and effectively reproduce the 1D results. The heights of the peaks differ by , which is comparable to the differences between MC and DO results in the energy density in the lower-resolution 1D results.
In Figure 14, we plot the energy-integrated lab-frame electron neutrino flux factor and Eddington tensor using the 2D_HR DO calculation and the 2D_HR_native MC calculation. These plots effectively reproduce the 1D angular moment results in Figures 6 and 7. We again see that MC results exhibit a quicker transition to a forward-peaked distribution, that the errors in the second moment () are larger than in the first (), and that the MC noise in the second moment is smaller than in the first. The magnitude of the differences in of at the shock can be compared to the 1D results in Figure 8. The 2D_HR differences are between the 1D_1x and 1D_2x differences, reflecting the fact that the 2D_HR angular resolution is between that of the 1D_1x and 1D_2x calculations. We also demonstrate this resolution dependence in the leftmost (for ) and center (for ) plots of Figure 15. The top left quadrant of each shows the difference between the moments calculated using the 2D_HR DO calculation and the 2D_HR_native MC calculation. The bottom left quadrant shows the difference between the 2D_LR DO calculation and the same MC calculation. The differences are significantly smaller for the higher resolution DO calculation, indicating that the DO results are converging to the MC results with increasing resolution.
Unlike in spherical symmetry, is not identically zero in axisymmetry and is thus a sensitive probe of multidimensional effects on the radiation field. and are still identically zero, since we do not consider azimuthal fluid velocities. In Figure 16, we plot the energy-integrated lab-frame for all three neutrino species using the 2D_HR DO calculation and the 2D_HR_native MC calculation. Since the dominant neutrino propagation direction is radial, the off-diagonal components of the pressure tensor are strongly correlated with the corresponding flux. In this particular snapshot, happens to be overall mostly positive, and we find the morphology to be indeed very similar to (not shown). Generally, both positive and negative values are to be expected (see Nagakura et al. 2017a). It is interesting to note that the electron neutrino and electron anti-neutrino plots have complementary hot spots. Within the protoneutron star, non-radial neutrino fluxes are present due to turbulent fluid carrying trapped neutrinos. Outside of the convective zone of the PNS but still within the neutrinospheres, electron neutrinos tend to diffuse away from regions of high electron chemical potential while electron anti-neutrinos diffuse toward them. In tests where the inner is excluded from the calculation, the distribution is much more uniform, suggesting that the hot spots are due to a combination of anisotropic neutrinos from the neutrinosphere interacting with multi-dimensional features in the fluid background.
Once again, the MC and DO results for look remarkably similar. Unlike for the diagonal moments, much subtractive cancellation occurs when computing , which in turn requires a large number of MC particles to drive down the noise. Similar to the other moments, the MC calculation tends to show larger values of , since its effectively infinite angular resolution is able to resolve finer angular structures. We demonstrate this resolution dependence in the rightmost plot of Figure 15. The top left quadrant shows the difference between the electron neutrino from the 2D_HR DO and 2D_HR_native MC calculations, while the bottom left quadrant compares the 2D_LR DO calculation to the same MC calculation. The differences are significantly larger for the lower-resolution calculation, indicating that the DO calculation is converging to the MC result with increasing angular resolution.
Figure 17 compares components of the electron neutrino Eddington tensor computed by the 2D_HR DO calculation to those predicted by the Levermore closure using and from the same DO calculation. We demonstrated in Section 4 that in our spherically symmetric snapshot, the Levermore closure predicts and from only the flux factor with an accuracy within 0.01 of the actual Eddington tensor value. This result is reproduced in two dimensions for electron neutrinos in Figure 17. The leftmost and center plots show and , respectively. The top left quadrant of each shows the moment computed directly from the 2D_HR DO calculation (same data as depicted in Figure 14), and the bottom left quadrant shows the same moment predicted by the Levermore closure. They are visually identical, and the error plotted on the right side of each plot shows a maximum error of in and a maximum error of 0.0089 in . Though there is some multi-dimensional structure in how accurately the Levermore closure predicts the diagonal components, this effectively mirrors the results of the 1D calculations. The rightmost plot shows (same data as in Figure 16), multiplied by 10 for visibility on this color scale. The Levermore closure predicts this component of the moment within 0.0077. This is large compared to this component’s maximum value of 0.012 and compared to a difference of between DO and MC results. Thus, though this analytic closure has small relative errors for the diagonal components of the Eddington tensor, it has difficulty accurately predicting the small off-diagonal components in this CCSN snapshot. The Minerbo and Janka closures show errors at smaller radii, but the extrema of these errors are only slightly larger than those of the Levermore closure. Other neutrino species behave very similarly, except that the heavy lepton neutrino values for (and hence errors) are significantly smaller.
Ignoring special relativistic effects in radiation transport calculations greatly simplifies the problem. Fluid velocities are also generally only a few percent of the speed of light below the shock, but are larger in 2D () than in 1D (). We test the effects of ignoring fluid velocities in Figure 15, where we plot the difference between the 2D_LR_nonrel and 2D_LR DO calculations. The error in and from ignoring velocities is much smaller than the difference between MC and DO calculations or the difference between resolutions. The only exception is in the convective region of the protoneutron star, where the flux is determined entirely by the fluid velocity because the neutrinos are trapped. The magnitude of this error is at most comparable to the error induced by the coarse resolution, and is significantly smaller than the error in all components of the second moment predicted by the Levermore closure (Figure 17).
Finally, in Figure 18, we investigate how these different transport schemes affect the actual heating and cooling rates of the fluid. Once again, we show the results from the 2D_HR DO calculation in the top left quadrant, which outside the core appears visually identical to the 2D_HR_native MC calculation results in the bottom left quadrant. Just as in Figure 10, there is significant statistical noise within the core, where neutrinos are largely in equilibrium with the matter. We depict the difference between these results on the right half of the plot. The MC results show more rapid cooling in the cooling region and more rapid heating in the outer regions of the gain layer, but only by a few percent of the maximum net gain outside of the protoneutron star. This is similar to the behavior of the lower-resolution 1D results in Figure 10, where MC calculations predict a smaller gain than do the DO calculations at for the 1D_2x calculation and at for the 1D_1x calculations. The 2D MC calculation also predicts larger heating than does the 2D_HR DO calculation in regions of high inward velocity. This is again an effect of the limited momentum-space resolution in the DO calculations that make the two-grid approach somewhat diffusive in angle and energy. This is to be expected given that the average neutrino energies in these regions (Figure 12) are higher in the MC calculations. Overall, excluding the noisy region in the core, these errors are at most of the amplitude of the net gain curve in Figure 10.
The volume-integrated gain of the gain region (where there is net heating under the shock) is in the 2D_HR DO calculation and in the 2D_HR MC calculation, which is only a difference. Compare to this the relative error of the same quantity between the 2D_LR and 2D_HR DO calculations of , which is smaller than in the 1D resolution comparison because our 2D resolutions are much more similar. Even so, the errors from low resolution are significantly more significant than the differences between the methods. The difference in integrated heating rate between the 2D_LR and 2D_LR_nonrel DO calculations is . This is larger than in the 1D relativity comparison because fluid velocities under the shock are larger in the 2D calculations than in the 1D calculations due to convective motion. Note that the integrated heating rate should not be compared with the 1D results because the fluid profiles are significantly different.
6. Conclusions
Neutrinos dominate energy transport in CCSNe and are a vital component of the CCSN explosion mechanism. It is therefore imperative to ensure neutrinos are simulated accurately in CCSN models. One means of checking the accuracy of a computational method is comparing against another accurate method. The grid-based discrete ordinates (DO) method and particle-based Monte Carlo (MC) method both solve the full transport problem in very different ways. We perform the first detailed multi-dimensional comparison of special relativistic Boltzmann-level neutrino transport codes in the context of core-collapse supernovae using the grid-based discrete ordinates (DO) code of Nagakura et al. (2017a) (NSY) and the particle-based Monte Carlo (MC) Sedonu code. We verify that both methods converge to the same result in the limit of large MC particle count and fine DO momentum-space resolution under the assumption of a static fluid background in spherical symmetry and in axisymmetry. This provides confidence in the accuracy of the results from these two completely different approaches.
We demonstrate an agreement of the average neutrino energy to within MeV for 1D calculations and MeV for coarser 2D calculations everywhere in the simulation domain for all three simulated neutrino species. We also demonstrate exquisite agreement in the local spectra of all three species. We find that numerical diffusion from a coarse momentum-space resolution dominates these small errors, though smaller errors result from finite spatial resolution and from the Monte Carlo random walk approximation.
MC transport computes angular moments of the distribution function with great accuracy when the moments are computed natively during the calculation as opposed to post-processed from an angular grid. The DO method requires a very high angular resolution of about 40 points in the polar direction to compute these moments with similar accuracy in 1D calculations, which is currently not possible in 2D calculations and certainly not possible in 2D time-dependent simulations. The MC method, however, requires a large number of particles to be simulated to reach low noise levels in moments that exhibit subtractive cancellation (i.e., in optically-deep regions, in 2D calculations). The present 2D calculation simulated 63 billion particles and still show some noise in these quantities.
The approximate two moment radiation transport scheme is significantly more efficient than either DO or MC transport by evolving on the the energy density and flux. However, this method requires an ad-hoc closure relation to complete the system of equations by making estimates of higher-order moments. We evaluate the performance of the Levermore, Janka, and Minerbo closures in predicting the second angular moments from the first. In the 1D calculations, the Levermore closure performs best, with an error comparable to the differences between the highest-resolution DO results and the MC results. In 2D calculations, this closure performs comparably well when predicting diagonal components of the second moment, but this accuracy is not sufficient for determining the very small off-diagonal components. Though careful tests would be required to assess the importance of these small off-diagonal components, these components reflect the multi-dimensional nature of CCSN dynamics.
Finally, we find that the difference in local heating and cooling rates between the DO and MC methods is at most of the amplitude of the net gain curve in the cooling region of the CCSN in both 1D and 2D calculations. The volume-integrated gain in the gain region (where there is net heating under the shock), however, differs by only in the highest resolution 1D calculations and by in the highest resolution 2D calculations. In these cases, the DO and MC schemes share the same energy resolution, but the MC scheme has effectively infinite angular resolution. The differences in the same quantities due to changing the DO energy (and angular) resolution are larger than in both 1D and 2D calculations, indicating that neutrino energy resolution is the dominant source of real error. Since both the MC and DO methods rely on opacities and emissivities discretized into energy bins, both suffer from this error. The errors in radiation quantities (energy density, angular moments, and average neutrino energies) below the shock is dominated by the finite momentum-space resolution of the DO calculations and statistical noise and limited energy resolution of the MC calculations.
Though it is important to simulate all physics relevant to the CCSN mechanism, the numerical resolution can pose a significantly larger threat to simulation accuracy than a lack of physical elements. We test the effects of ignoring special relativistic Lorentz transformation of neutrinos and find it to be severely sub-dominant to errors induced by low momentum-space resolution at the resolutions we use. The diagonal components of the Eddington tensor in the low-resolution DO calculations show resolution-induced errors of , and even the highest-resolution 1D calculations (which would be impossible in 2D) show errors of . This underscores the need for resolution tests in interpreting results of simulation results.
Though this study inspires much more confidence in both methods, we must mention several caveats. The largest is that opacities and emissivities are an extremely important component of radiation transport. In order to facilitate a detailed comparison of the methods themselves, we carefully configure both codes to use identical opacities at each spatial location and neutrino energy bin. However, we do not compare the effects of different approximations and physical processes present in the opacities that may overwhelm the small differences in the results between these codes.
Second, we must emphasize that our calculations are performed under the assumption of an unchanging fluid background and flat metric at one particular stage in the CCSN evolution. At different stages, especially during early postbounce prompt convection and the shock revival phase, the matter distribution and hence neutrino radiation fields are significantly different and would benefit from a similar analysis. We also use an approximate treatment of pair processes and neglect electron scattering. These simplifications are made to bring both codes to the same level, where we could be sure that they are simulating the same physics with the same level of approximation. This allows an isolated evaluation of the relative performance of both transport methods, but neglects many components of physics that should be included in realistic dynamical CCSN simulations.
The impact of the time-dependent features of the radiation field on the fluid dynamics will be the next necessary step in verifying neutrino radiation hydrodynamics codes. A similar careful verification of the choice and implementation of different neutrino interactions, the resolution and discretization scheme (including mesh geometry and refinement), the treatment of gravity, and the numerical hydrodynamics scheme would also greatly benefit the interpretation of simulation results. We leave this broader task of evaluating multi-dimensional time-dependent radiation hydrodynamics simulations of CCSNe to future work.
We release the data input to and output by both codes at http://www.stellarcollapse.org/MCvsDO. The Sedonu code is also open source and available at https://bitbucket.org/srichers/sedonu.git, along with a set of ready-to-run input data and parameter files to run the calculations in this paper. This Sedonu release contains many performance, usability, and flexibility changes implemented since previous releases. In addition, we incorporate a special relativistic, time-independent version of the MC random walk approximation that enables Sedonu to efficiently calculate neutrino transport through regions of arbitrarily large optical depth.
We would like to acknowledge Ryan Wollaeger, Kendra Keady, Adam Burrows, David Radice, Luke Roberts, and Yuki Nishino for many insightful discussions of radiation transport methods. S. R. was supported by the National Science Foundation (NSF) Blue Waters Graduate Fellowship. This research is part of the Blue Waters sustained-petascale computing project, which is supported by the National Science Foundation (Grants No. OCI-0725070 and No. ACI-1238993) and the State of Illinois. Blue Waters is a joint effort of the University of Illinois at Urbana-Champaign and its National Center for Supercomputing Applications. This work benefited from access to the NSF XSEDE network under allocation TG-PHY100033 and to Blue Waters under PRAC award no. ACI-1440083. H. N. is supported in part by JSPS Postdoctoral Fellowships for Research Abroad No. 27-348. C. D. O. is supported in part by the International Research Unit of Advanced Future Studies at the Yukawa Institute for Theoretical Physics. This work is furthermore partially supported by the Sherman Fairchild Foundation and NSF under award nos. TCAN AST-1333520, CAREER PHY-1151197, and PHY-1404569. This work is supported by a Grant-in-Aid for Scientific Research from the Ministry of Education, Culture, Sports, Science, and Technology (MEXT) of Japan (15K05093, 16H03986, 26104006) and the HPCI Strategic Program of Japanese MEXT and K computer at the RIKEN and Post-K project (Project ID: hpci 170304, 170230, 170031). J. D. acknowledges support from the Laboratory Directed Research and Development program at Los Alamos National Laboratory (LANL). Work at LANL by S. R. and J. D. was done under the auspices of the National Nuclear Security Administration of the US Department of Energy. This paper has been assigned Yukawa Institute for Theoretical Physics report number YITP-17-61.
Appendix A General Relativistic Boltzmann Equation
The NSY code solves the conservative form of the general relativistic Boltzmann equation, which can be written as (Nagakura et al., 2017a)
[TABLE]
where describes the collision term for neutrino-matter interactions, is the determinant of the 4-dimensional metric, and are the spacetime coordinates. () denote a set of tetrad bases for a local orthonormal frame on 4-dimensional manifolds. In the present study, we assume that the spacetime is flat, so we simply use spherical-polar coordinates, i.e., , , , , , , , , and . ,, and contain the same information as in Section 2. The neutrino momentum space is also written in spherical-polar coordinates (). is the energy of a neutrino with four-momentum , is the polar direction angle with respect to , and is the azimuthal angle with respect to . and contain the same information as in Section 2. The derivative in the first term of Equation A1 is evaluated while holding the momentum coordinates constant. The direction cosines are equivalent to in Section 2 evaluated in the lab frame and can be written as
[TABLE]
The geometric coefficients , , given in Nagakura et al. (2017a) reduce in flat spacetime to
[TABLE]
The NSY code uses Lagrangian remapping grids (LRG) and laboratory fixed grids (LFG) in the fluid rest frame and the lab frame, respectively, to discretize the neutrino momentum space (Nagakura et al., 2014). The LFG are constructed so as to have an isotropic energy grid in the lab frame, while the LRG are constructed so as to have an isotropic energy grid in the fluid rest frame and a propagation angle-dependent energy grid in the lab frame. Here, isotropic means that each angular bin sees the same energy grid. This two-grid technique is essential to treating special-relativistic effects in full generality in the DO method.
Appendix B Monte Carlo Random Walk Approximation
In regions where the scattering optical depth is large, where is the relevant length scale, direct MC radiation transport becomes very inefficient. The path length between scattering events is very small, so a great deal of computer time is spent performing these scattering events while there is little actual movement of energy and lepton number. In these regions, the neutrino transport is very well approximated as a diffusion process, a fact which we use to accelerate the computation.
In the past, MC neutrino transport schemes have excluded the inner regions of high optical depth in favor of an inner boundary condition (Janka, 1991) or have employed the discrete diffusion MC approximation in these regions (Densmore et al., 2007; Abdikamalov et al., 2012). In order to keep the neutrino motion free of any specific grid geometry and to prevent a hard spatial boundary between two algorithms, we instead choose to implement the MC random walk approximation (Fleck & Canfield, 1984). This treats neutrino motion over a specified distance as a diffusive process, and relies on the assumption of isotropic, elastic scattering. In our implementation, we also assume the fluid is unchanging in space and in time during each diffusion step. Here we modify the method of Fleck & Canfield (1984) to treat static fluid backgrounds with relativistic fluid velocities.
The approximation accelerates MC transport in regions of high scattering optical depth using a solution to the diffusion equation:
[TABLE]
The diffusion constant can be shown to be by comparing the solution to the diffusion equation on an infinite uniform background given initial conditions to the solution of a random walk process starting at with step sizes determined by the scattering opacity (Fleck & Canfield, 1984; Chandrasekhar, 1943). In the context of MC radiation transport, the solution represents the probability density of the neutrino being at location at time .
Using the diffusion equation with this diffusion constant, we now specify a sphere of radius in the comoving frame and derive the probability that a neutrino has escaped from the sphere after a certain time . To do this, we again solve the diffusion equation, but this time with the boundary condition to indicate that we are interested only in the first time a neutrino leaves the sphere and we do not allow neutrinos to leave and then re-enter the sphere. This can be solved via separation of variables and Sturm-Liouville orthogonality conditions to arrive at
[TABLE]
The probability that a neutrino has escaped the sphere after time t is represented by the volume integral of the diffusion solution (Figure 19):
[TABLE]
This solution is plotted in Figure 19.
The diffusion equation is acausal in that there is a finite probability of a neutrino escaping at times less than the light travel time to the edge of the sphere. Because of this, we set . We can also use the escape probability at as an estimate of the accuracy of the approximation. We only use the random walk approximation when
[TABLE]
In this study, we use , which corresponds to only using the random walk approximation when the scattering optical depth of the sphere is .
We tabulate , which can then be inverted via inverse transform sampling (e.g., Haghighat 2015) to randomly sample the escape time . The table extends over the range of using 100 evenly spaced points in where and, in our calculations, (corresponding to ). We evaluate the first 1000 terms in the sum in Equation B3, which is far more than is necessary for a converged solution, but tabulating is a very cheap one-time calculation.
We restrict the lab-frame radius of the sphere to the largest length scale between (a) the distance to the nearest grid cell boundary and (b) 1% of the grid cell’s smallest dimension. However, since the sphere is defined in the fluid rest frame, its size must be further limited when the fluid is moving, since the sphere is effectively advected. The largest restriction occurs in the event that the displacement of the neutrino from its starting position to the surface of the sphere is parallel to the fluid velocity, so we will use this worst-case scenario to set the sphere size limiter. The four-vector connecting the neutrino’s initial and final positions in the lab frame can be Lorentz-transformed to give the displacement vector in the lab frame . The longest diffusion time the numerical scheme will allow is , resulting in a maximum lab-frame displacement of . Inverting this, we set the comoving-frame radius to
[TABLE]
The comoving frame neutrino energy remains the same throughout the process, since the scattering is assumed to be elastic. Absorption happens continuously throughout the diffusion process. The packet energy is decreased according to
[TABLE]
and is added to the fluid energy to account for neutrino absorption. The comoving frame packet energy averaged over the diffusion time is
[TABLE]
If neutrino packets are created assuming the fluid emits for a time of , this means that the neutrino contributes to the fluid cell’s steady-state radiation energy content. Averaged over the diffusion process, most of the neutrino energy is distributed isotropically in direction. However, there is a small asymmetry due to the fact that the neutrino ends up at one point on the edge of the diffusion sphere. Averaged over the duration of the diffusion process, for a neutrino packet with energy , there is a net energy flux of in the direction of the final displacement vector while is distributed isotropically in direction.
With the theoretical groundwork complete, we now describe the random walk algorithm itself. A comoving frame diffusion sphere size is first chosen according to Equation B5. If the scattering optical depth is sufficiently large (Equation B4), the time the neutrino takes to reach the edge of the sphere is sampled from Equation B3. A location at the edge of the comoving-frame sphere is randomly uniformly chosen, the displacement 4-vector is Lorentz transformed into the lab frame, and the neutrino is moved this distance. The new comoving neutrino direction is chosen uniformly in the steradians moving strictly away from the diffusion sphere. The neutrino packet energy is decreased due to absorption according to Equation B6 and the absorbed energy is counted toward fluid heating. Comoving radiation energy in the amount of moving in the direction of the final displacement is Lorentz transformed into the lab frame and accumulated into the distribution function. The remaining of comoving radiation energy is divided evenly into pieces, each is assigned an isotropically uniform random direction in the comoving frame, is Lorentz transformed into the lab frame, and is accumulated into the distribution function. This allows us to self-consistently treat both the isotropic and directional components of the radiation field without making reference to a particular grid structure. In this work, we found that is a reasonable compromise between code performance and noise in the resulting radiation field.
Appendix C Comparison Details: Angular Moment Calculations
The DO and MC methods are very different, so care is required to make meaningful comparisons between the two codes. The NSY code evolves the distribution function , while Sedonu calculates the amount of neutrino energy in each spatial-energy-direction cell in non-native calculations. The Sedonu distribution function value at the bin center is calculated using
[TABLE]
where is the total neutrino energy content (units of ergs) in spatial-direction-energy bin , is the spatial volume of the grid cell in the lab frame, and are the neutrino direction angles in the lab frame, and is the neutrino energy in the comoving frame.
In the 1D simulations where Sedonu collects radiation information on angular bins rather than native moments, we take care to ensure that the post-processing for the two codes are equivalent. For both Sedonu and the NSY code, the distribution function is linearly interpolated to on identical fine angular grids in of zones, respectively. Throughout this section, the subscript () refers to the spatial mesh index, the subscript () refers to the neutrino energy bin index, the subscripts () refer to the direction indices on the coarse direction grid used in the transport calculation, the subscripts () refer to the direction indices on the high-resolution post-processing angular grid, and the superscripts () refer to directions (i.e., , , or ) in the lab frame.
The specific energy density (lab frame energy density per comoving-frame neutrino energy) is computed as a sum over the coarse grid for Sedonu and over the fine grid in the NSY code. This takes advantage of the fact that Sedonu computes energy content directly and does not introduce interpolation error into the Sedonu results:
[TABLE]
For both Sedonu and the NSY code, the higher-order moments are evaluated as
[TABLE]
The energy-integrated moments are computed using
[TABLE]
Finally, the average neutrino energy is computed using
[TABLE]
Appendix D Comparison Details: Neutrino Reaction Rates
The three source terms on the right hand side of Equation 1 each encapsulate multiple processes, and are grouped into the mathematical nature of each term. In both Sedonu and the NSY code, all of these source terms are evaluated in the comoving frame. Details of how the NSY code computes reaction rates are explained by Bruenn (1985) and Sumiyoshi et al. (2005).
The emission and absorption term takes the form of
[TABLE]
where and are the emission and absorption reaction rates, respectively. Sedonu takes advantage of the concept of stimulated absorption to account for final-state neutrino blocking (Burrows et al., 2006), in which the effective absorption reaction rate is . This removes the need to treat final-state blocking explicitly in the neutrino emission process.
The scattering term accounts for neutrinos scattering into and out of a given direction according to
[TABLE]
The primed variables are the neutrino final-state quantities and is the scattering reaction rate. Both Sedonu and the NSY code assume isoenergetic scattering, so the scattering reaction rate becomes . Under this assumption, the scattering source term reduces to
[TABLE]
Sedonu uses , where , as the scattering opacity directly.
Finally, pair annihilation and neutrino bremsstrahlung source terms take the form of
[TABLE]
The barred variables are the neutrino anti-species quantities and is the reaction rate for pair and bremsstrahlung processes. In order to ensure the same assumptions go into both radiation transport schemes, the NSY code calculates these reactions assuming the anti-species is isotropic, i.e.,
[TABLE]
where depends only on energy and not on direction. Under this assumption, the source term can be written as
[TABLE]
Sedonu uses in the same way as .
Since the NSY code evolves , they use the reaction rates (units of cm*-1*) directly, but Sedonu needs to convert the emission reaction rates to physical emissivities. For an emissivity with units of (erg cm*-3* s*-1*),
[TABLE]
Here, and are the momentum-space volume (normalized by ) and center of energy bin , respectively. The absorption and scattering reaction rates with tildes () are already equivalent to absorption and scattering opacities.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Abdikamalov et al. (2012) Abdikamalov, E., Burrows, A., Ott, C. D., et al.. 2012, Ap J, 755, 111, ar Xiv:astro-ph.SR/1203.2915
- 2Anderson & Spiegel (1972) Anderson, J. L., & Spiegel, E. A. 1972, Ap J, 171, 127, doi:10.1086/151265 · doi ↗
- 3Audit et al. (2002) Audit, E., Charrier, P., Chièze, J., & Dubroca, B. 2002, ar Xiv:astro-ph/0206281
- 4Bethe (1990) Bethe, H. A. 1990, \rmp , 62, 801
- 5Bethe & Wilson (1985) Bethe, H. A., & Wilson, J. R. 1985, Ap J, 295, 14, doi:10.1086/163343 · doi ↗
- 6Bruenn (1985) Bruenn, S. W. 1985, Ap JS, 58, 771, doi:10.1086/191056 · doi ↗
- 7Buras et al. (2006) Buras, R., Rampp, M., Janka, H.-T., & Kifonidis, K. 2006, A&A, 447, 1049, ar Xiv:astro-ph/0507135
- 8Burrows (2013) Burrows, A. 2013, \rmp , 85, 245, ar Xiv:astro-ph.SR/1210.4921
