The Income Fluctuation Problem and the Evolution of Wealth

Qingyin Ma; John Stachurski; Alexis Akira Toda

arXiv:1905.13045·econ.TH·August 7, 2020·J. Econ. Theory

The Income Fluctuation Problem and the Evolution of Wealth

Qingyin Ma, John Stachurski, Alexis Akira Toda

PDF

Open Access

TL;DR

This paper studies a comprehensive household savings model with state-dependent returns and income, establishing conditions for solution existence, uniqueness, and properties of wealth distribution, including Pareto tails.

Contribution

It extends classic models by allowing multiple state-dependent, correlated processes and derives conditions for wealth distribution characteristics.

Findings

01

Solutions exist, are unique, and globally computable.

02

Wealth dynamics are stationary, ergodic, and geometrically mixing.

03

Wealth distribution exhibits Pareto tails.

Abstract

We analyze the household savings problem in a general setting where returns on assets, non-financial income and impatience are all state dependent and fluctuate over time. All three processes can be serially correlated and mutually dependent. Rewards can be bounded or unbounded and wealth can be arbitrarily large. Extending classic results from an earlier literature, we determine conditions under which (a) solutions exist, are unique and are globally computable, (b) the resulting wealth dynamics are stationary, ergodic and geometrically mixing, and (c) the wealth distribution has a Pareto tail. We show how these results can be used to extend recent studies of the wealth distribution. Our conditions have natural economic interpretations in terms of asymptotic growth rates for discounting and return on savings.

Equations317

G_{β} < 1 where G_{β} := n \to \infty lim (\mathbbm E t = 1 \prod n β_{t})^{1/ n} .

G_{β} < 1 where G_{β} := n \to \infty lim (\mathbbm E t = 1 \prod n β_{t})^{1/ n} .

G_{β R} < 1 where G_{β R} := n \to \infty lim (\mathbbm E t = 1 \prod n β_{t} R_{t})^{1/ n} .

G_{β R} < 1 where G_{β R} := n \to \infty lim (\mathbbm E t = 1 \prod n β_{t} R_{t})^{1/ n} .

G_{R} := n \to \infty lim (\mathbbm E t = 1 \prod n R_{t})^{1/ n} .

G_{R} := n \to \infty lim (\mathbbm E t = 1 \prod n R_{t})^{1/ n} .

max \mathbbm E_{0} {t = 0 \sum \infty (i = 0 \prod t β_{i}) u (c_{t})}

max \mathbbm E_{0} {t = 0 \sum \infty (i = 0 \prod t β_{i}) u (c_{t})}

s.t.

0 \leq c_{t} \leq a_{t}, (a_{0}, Z_{0}) = (a, z) given .

β_{t} = β (Z_{t}, ε_{t}), R_{t} = R (Z_{t}, ζ_{t}), and Y_{t} = Y (Z_{t}, η_{t}),

β_{t} = β (Z_{t}, ε_{t}), R_{t} = R (Z_{t}, ζ_{t}), and Y_{t} = Y (Z_{t}, η_{t}),

\mathbbm{E}_{a,z}:=\mathbbm{E}\left[\,\cdot\,\big{|}\,(a_{0},Z_{0})=(a,z)\right]\quad\text{and}\quad\mathbbm{E}_{z}:=\mathbbm{E}\left[\,\cdot\,\big{|}\,Z_{0}=z\right].

\mathbbm{E}_{a,z}:=\mathbbm{E}\left[\,\cdot\,\big{|}\,(a_{0},Z_{0})=(a,z)\right]\quad\text{and}\quad\mathbbm{E}_{z}:=\mathbbm{E}\left[\,\cdot\,\big{|}\,Z_{0}=z\right].

β \mathbbm E R_{t}^{1 - γ} < 1 and (β \mathbbm E R_{t}^{1 - γ})^{1/ γ} \mathbbm E R_{t} < 1.

β \mathbbm E R_{t}^{1 - γ} < 1 and (β \mathbbm E R_{t}^{1 - γ})^{1/ γ} \mathbbm E R_{t} < 1.

G_{β R} = β \mathbbm E R_{t} = β (\mathbbm E R_{t})^{1 - γ} (\mathbbm E R_{t})^{γ} \leq (β \mathbbm E R_{t}^{1 - γ}) (\mathbbm E R_{t})^{γ} .

G_{β R} = β \mathbbm E R_{t} = β (\mathbbm E R_{t})^{1 - γ} (\mathbbm E R_{t})^{γ} \leq (β \mathbbm E R_{t}^{1 - γ}) (\mathbbm E R_{t})^{γ} .

V_{c} (a, z) = \mathbbm E_{a, z} t = 0 \sum \infty β_{0} \dots β_{t} u [c (a_{t}, Z_{t})],

V_{c} (a, z) = \mathbbm E_{a, z} t = 0 \sum \infty β_{0} \dots β_{t} u [c (a_{t}, Z_{t})],

(u^{'} \circ c) (a, z) \geq \mathbbm E_{z} \hat{β} \hat{R} (u^{'} \circ c) (\hat{R} [a - c (a, z)] + \hat{Y}, \hat{Z})

(u^{'} \circ c) (a, z) \geq \mathbbm E_{z} \hat{β} \hat{R} (u^{'} \circ c) (\hat{R} [a - c (a, z)] + \hat{Y}, \hat{Z})

(u^{'} \circ c) (a, z) = max {\mathbbm E_{z} \hat{β} \hat{R} (u^{'} \circ c) (\hat{R} [a - c (a, z)] + \hat{Y}, \hat{Z}), u^{'} (a)}

(u^{'} \circ c) (a, z) = max {\mathbbm E_{z} \hat{β} \hat{R} (u^{'} \circ c) (\hat{R} [a - c (a, z)] + \hat{Y}, \hat{Z}), u^{'} (a)}

t \to \infty lim \mathbbm E_{a, z} β_{0} \dots β_{t} (u^{'} \circ c) (a_{t}, Z_{t}) a_{t} = 0.

t \to \infty lim \mathbbm E_{a, z} β_{0} \dots β_{t} (u^{'} \circ c) (a_{t}, Z_{t}) a_{t} = 0.

(a, z) \in \SS_{0} sup ∣ (u^{'} \circ c) (a, z) - u^{'} (a) ∣ < \infty.

(a, z) \in \SS_{0} sup ∣ (u^{'} \circ c) (a, z) - u^{'} (a) ∣ < \infty.

ρ (c, d) := ∥ u^{'} \circ c - u^{'} \circ d ∥ := (a, z) \in \SS_{0} sup ∣ (u^{'} \circ c) (a, z) - (u^{'} \circ d) (a, z) ∣,

ρ (c, d) := ∥ u^{'} \circ c - u^{'} \circ d ∥ := (a, z) \in \SS_{0} sup ∣ (u^{'} \circ c) (a, z) - (u^{'} \circ d) (a, z) ∣,

u^{'} (ξ) = ψ_{c} (ξ, a, z),

u^{'} (ξ) = ψ_{c} (ξ, a, z),

G := {(ξ, a, z) \in \mathbbm R_{+} \times (0, \infty) \times Z : 0 < ξ \leq a}

G := {(ξ, a, z) \in \mathbbm R_{+} \times (0, \infty) \times Z : 0 < ξ \leq a}

ψ_{c} (ξ, a, z) := max {\mathbbm E_{z} \hat{β} \hat{R} (u^{'} \circ c) [\hat{R} (a - ξ) + \hat{Y}, \hat{Z}], u^{'} (a)} .

ψ_{c} (ξ, a, z) := max {\mathbbm E_{z} \hat{β} \hat{R} (u^{'} \circ c) [\hat{R} (a - ξ) + \hat{Y}, \hat{Z}], u^{'} (a)} .

x \mapsto (u^{'})^{- 1} [\mathbbm E_{z} \hat{β} \hat{R} (u^{'} \circ c) (\hat{R} x + \hat{Y}, \hat{Z})] is concave on \mathbbm R_{+},

x \mapsto (u^{'})^{- 1} [\mathbbm E_{z} \hat{β} \hat{R} (u^{'} \circ c) (\hat{R} x + \hat{Y}, \hat{Z})] is concave on \mathbbm R_{+},

u (c) = \frac{c ^{1 - γ}}{1 - γ} if γ > 0 and u (c) = lo g c if γ = 1,

u (c) = \frac{c ^{1 - γ}}{1 - γ} if γ > 0 and u (c) = lo g c if γ = 1,

\overset{s}{ˉ} < 1 and \mathbbm E_{z} \hat{β} \hat{R} u^{'} (\hat{R} \overset{s}{ˉ} a) \leq u^{'} (a) for all (a, z) \in \SS_{0},

\overset{s}{ˉ} < 1 and \mathbbm E_{z} \hat{β} \hat{R} u^{'} (\hat{R} \overset{s}{ˉ} a) \leq u^{'} (a) for all (a, z) \in \SS_{0},

\overset{s}{ˉ} := (z \in Z max \mathbbm E_{z} \hat{β} \hat{R}^{1 - γ})^{1/ γ}

\overset{s}{ˉ} := (z \in Z max \mathbbm E_{z} \hat{β} \hat{R}^{1 - γ})^{1/ γ}

a_{t + 1}

a_{t + 1}

Z_{t + 1}

\mathbbm P {Y_{t} \in A ∣ Z_{t} = z} = \int_{A} f (y ∣ z) d y

\mathbbm P {Y_{t} \in A ∣ Z_{t} = z} = \int_{A} f (y ∣ z) d y

\overset{ˉ}{h} (a, z) := h (a, z) - \mathbbm E h (a_{t}, Z_{t})

\overset{ˉ}{h} (a, z) := h (a, z) - \mathbbm E h (a_{t}, Z_{t})

γ_{h}^{2} := \mathbbm E [\overset{ˉ}{h}^{2} (a_{0}, Z_{0})] + 2 t = 1 \sum \infty \mathbbm E [\overset{ˉ}{h} (a_{0}, Z_{0}) \overset{ˉ}{h} (a_{t}, Z_{t})],

γ_{h}^{2} := \mathbbm E [\overset{ˉ}{h}^{2} (a_{0}, Z_{0})] + 2 t = 1 \sum \infty \mathbbm E [\overset{ˉ}{h} (a_{0}, Z_{0}) \overset{ˉ}{h} (a_{t}, Z_{t})],

Q^{t} ((a, z), \cdot) - ψ_{\infty}_{T V} \leq λ^{t} M V (a, z) for all (a, z) \in \SS .

Q^{t} ((a, z), \cdot) - ψ_{\infty}_{T V} \leq λ^{t} M V (a, z) for all (a, z) \in \SS .

\mathbbm P_{a, z} {T \to \infty lim \frac{1}{T} t = 1 \sum T h (a_{t}, Z_{t}) = \mathbbm E h (a_{t}, Z_{t})} = 1.

\mathbbm P_{a, z} {T \to \infty lim \frac{1}{T} t = 1 \sum T h (a_{t}, Z_{t}) = \mathbbm E h (a_{t}, Z_{t})} = 1.

\frac{1}{T γ _{h}^{2}} t = 1 \sum T \overset{ˉ}{h} (a_{t}, Z_{t}) \to d N (0, 1) as T \to \infty.

\frac{1}{T γ _{h}^{2}} t = 1 \sum T \overset{ˉ}{h} (a_{t}, Z_{t}) \to d N (0, 1) as T \to \infty.

\mathbbm P_{\overset{z}{ˉ}} {R (\overset{z}{ˉ}, \hat{ζ}) (1 - α (\overset{z}{ˉ})) > 1} > 0.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEconomic theories and models · Economic Theory and Policy · Financial Literacy, Pension, Retirement Analysis

Full text

Abstract.

We analyze the household savings problem in a general setting where returns on assets, non-financial income and impatience are all state dependent and fluctuate over time. All three processes can be serially correlated and mutually dependent. Rewards can be bounded or unbounded and wealth can be arbitrarily large. Extending classic results from an earlier literature, we determine conditions under which (a) solutions exist, are unique and are globally computable, (b) the resulting wealth dynamics are stationary, ergodic and geometrically mixing, and (c) the wealth distribution has a Pareto tail. We show how these results can be used to extend recent studies of the wealth distribution. Our conditions have natural economic interpretations in terms of asymptotic growth rates for discounting and return on savings.

Keywords: Income fluctuation, optimality, stochastic stability, wealth distribution.

The Income Fluctuation Problem and the

Evolution of Wealth111We thank the editors and two anonymous referees for many valuable comments and suggestions. This paper has also benefited from discussion with many colleagues. We particularly thank Fedor Iskhakov, Larry Liu and Chung Tran for their insightful feedback and suggestions. The second author gratefully acknowledges financial support from ARC grant FT160100423.

Email addresses: [email protected], [email protected], [email protected].

Qingyin Maa, John Stachurskib and Alexis Akira Todac

aInternational School of Economics and Management,

Capital University of Economics and Business

bResearch School of Economics, Australian National University

cDepartment of Economics, University of California San Diego

January 30, 2020

1. Introduction

It has been observed that, in the US and several other large economies, the wealth distribution is heavy tailed and wealth inequality has risen sharply over the last few decades.222For example, in a study based on capital income data, Saez and Zucman (2016) find that, in the case of the US, the share of total household wealth held by the top 0.1% increased from 7 percent to 22 percent between 1978 and 2012. For a discussion of the heavy-tailed property of the wealth distribution, see Pareto (1896), Davies and Shorrocks (2000), Benhabib and Bisin (2018), Vermeulen (2018) or references therein. This matters not only for its direct impact on taxation and redistribution policies, but also for potential flow-on effects for productivity growth, business cycles and fiscal policy, as well as for the political environment that shapes these and other economic outcomes.333One analysis of the two-way interactions between inequality and political decision making can be found in Acemoglu and Robinson (2002). Glaeser et al. (2003) show how inequality can alter economic and social outcomes through subversion of institutions. The same study contains references on linkages between inequality and growth. Regarding fiscal policy, Brinca et al. (2016) find strong correlations between wealth inequality and the magnitude of fiscal multipliers, while Bhandari et al. (2018) study the connection between fiscal-monetary policy, business cycles and inequality. Ahn et al. (2018) discuss the impact of distributional properties on macroeconomic aggregates.

At present, our understanding of these phenomena is hampered by the fact that standard tools of analysis—such as those used for heterogeneous agent models—are not well adapted to studying the wealth distribution as it stands. For example, while we have sound understanding of the household problem when returns on savings and rates of time discount are constant (see, e.g., Schechtman (1976), Schechtman and Escudero (1977), Deaton and Laroque (1992), Carroll (1997), or Açıkgöz (2018)), our knowledge is far more limited in settings where these values are stochastic. This is problematic, since injecting such features into the household problem is essential for accurately representing the joint distribution of income and wealth (e.g., Benhabib et al. (2015), Benhabib et al. (2017), Stachurski and Toda (2019)).444Also related is the recent experimental study of Epper et al. (2018), which finds a strong positive connection between dispersion in subjective rates of time discounting across the population and realized dispersion in the wealth distribution. This in turn is consistent with earlier empirical studies such as Lawrance (1991). Moreover, models with time-varying discount rates and returns on assets are at the forefront of recent quantitative analysis of wealth and inequality.555For a recent quantitative study see, for example, Hubmer et al. (2018), where returns on savings and discount rates are both state dependent (as is labor income). Kaymak et al. (2018) find that asset return heterogeneity is required to match the upper tail of the wealth distribution.

While it might be hoped that the analysis of the income fluctuation problem (or household consumption and savings problem) changes little when we shift from constant to state dependent asset returns and rates of time discount, this turns out not to be the case. Effectively modeling these features and the way they map to the wealth distribution requires significant advances in our understanding of choice and stochastic dynamics in the setting of optimal savings.

One difficulty is that state-dependent discounting takes us beyond the bounds of traditional dynamic programming theory. This matters little if there exists some constant $\bar{\beta}<1$ such that the discount process $\{\beta_{t}\}$ satisfies $\beta_{t}\leq\bar{\beta}$ for all $t$ with probability one, since, in this case, a standard contraction mapping argument can still be applied (see, e.g., Miao (2006) or Cao (2020)). However, recent quantitative studies extend beyond such settings. For example, AR(1) specifications are increasingly common, in which case the support of $\beta_{t}$ is unbounded above at every point in time.666See, for example, Hills and Nakata (2018), Hubmer et al. (2018) or Schorfheide et al. (2018). Even if discretization is employed, the outcome $\beta_{t}\geq 1$ can occur with positive probability when the approximation is sufficiently fine. Moreover, such outcomes are not inconsistent with empirical and experimental evidence, at least for some households in some states of the world.777See, for example, Loewenstein and Prelec (1991) and Loewenstein and Sicherman (1991). Do there exist conditions on $\{\beta_{t}\}$ that allow for $\beta_{t}\geq 1$ in some states and yet imply existence of optimal polices and practical computational techniques?

Another source of complexity for the income fluctuation problem in the general setting considered here is that the set of possible values for household assets is typically unbounded above. For example, when returns on assets are stochastic, a sufficiently long sequence of favorable returns can compound one another to project a household to arbitrarily high levels of wealth. This model feature is desirable: We wish to analyze these kinds of outcomes rather than rule them out. Indeed, Benhabib et al. (2015) and other related studies argue convincingly that such outcomes are a key causal mechanism behind the heavy tail of the current distribution of wealth.888One related study is Benhabib et al. (2011), who show that capital income risk is the driving force of the heavy-tail properties of the stationary wealth distribution. In Blanchard-Yaari style economies, Toda (2014), Toda and Walsh (2015) and Benhabib et al. (2016) show that idiosyncratic investment risk generates a double Pareto stationary wealth distribution. Gabaix et al. (2016) point out that a positive correlation of returns with wealth (“scale dependence”) in addition to persistent heterogeneity in returns (“type dependence”) can well explain the speed of changes in the tail inequality observed in the data. However, if we accept this logic, then stationarity and ergodicity of the wealth process—which are fundamental both for estimation and for simulation-based numerical methods—must now be established in a setting where the wealth distribution has unbounded support. In such a scenario, what conditions on preferences and financial and labor income are necessary for these properties to hold?

A final and related example of the need for deeper analysis is as follows: To understand the upper tail of the wealth distribution, we must avoid unnecessarily truncating the upper tail of the set of possible asset values in quantitative work. While truncation is convenient because finite or compact state spaces are easier to handle computationally, we can attain greater accuracy in modeling the wealth distribution if truncation at the upper tail can be replaced locally by a parameterized savings function, such as a linear function (Gouin-Bonenfant and Toda, 2018). However, any such approximation must be justified by theory. What conditions can be imposed on primitives to generate such properties while still maintaining realistic assumptions for asset returns and non-financial income?

In this paper we address all of these questions, along with other key properties of the income fluctuation problem, such as continuity and monotonicity of the optimal consumption policy. Our setting admits capital income risk, labor earnings shocks and time-varying discount rates, driven by a combination of iid innovations and an exogenous Markov chain $\{Z_{t}\}$ . The supports of the innovations can be unbounded, so we admit practical innovation sequences such as normal and lognormal. As a whole, this environment allows for a range of realistic features, such as stochastic volatility in returns on asset holdings, or correlation in the shocks impacting asset returns and non-financial income. The utility function can be unbounded both above and below, with no specific structure imposed beyond differentiability, concavity and the usual slope (Inada) conditions.999While the assumption that the exogenous state process $\{Z_{t}\}$ is a (finite state) Markov chain might appear restrictive, it fits most practical settings and avoids a host of technical issues that tend to obscure the key ideas. Moreover, the innovation shocks are not restricted to be discrete, and the same is true for assets and consumption.

To begin, when considering optimality in the household problem, we require a condition on the state dependent discount process $\{\beta_{t}\}$ that generalizes the classical condition $\beta<1$ from the constant case and, for reasons discussed above, permits $\beta_{t}>1$ with positive probability. To this end, we introduce the restriction101010Here and below we set $\beta_{0}\equiv 1$ , so $\prod_{t=1}^{n}\beta_{t}=\prod_{t=0}^{n}\beta_{t}$ .

[TABLE]

Condition (1) clearly generalizes the classical condition $\beta<1$ for the constant discount case. In the stochastic case, $\ln G_{\beta}$ can be understood as the asymptotic growth rate of the probability weighted average discount factor. Indeed, if $B_{n}:=\mathbbm{E}\prod_{t=1}^{n}\beta_{t}$ is the average $n$ -period discount factor, then, from the definition of $G_{\beta}$ and some straightforward analysis, we obtain $\ln(B_{n+1}/B_{n})\to\ln G_{\beta}$ , so the condition $G_{\beta}<1$ implies that the asymptotic growth rate of the average $n$ -period discount factor is negative, drifting down from its initial condition $\beta_{0}\equiv 1$ at the rate $\ln G_{\beta}$ . This does not, of course, preclude the possibility that $\beta_{t}>1$ at any given $t$ .

We show that condition (1) is in fact a necessary condition in those settings where the classical condition is necessary for finite lifetime values. In this sense it cannot be further weakened for the income fluctuation problem apart from special cases. At the same time, it admits the use of convenient specifications such as the discretized AR(1) process from Hubmer et al. (2018). In addition, we prove that $G_{\beta}$ can be represented as the spectral radius of a nonnegative matrix, and hence can be computed by numerical linear algebra (as discussed below).

We also generalize the standard condition $\beta R<1$ , where $R$ is the gross interest rate in the constant case, which is used to ensure stability of the asset path and finiteness of lifetime valuations, as well as existence of stationary Markov policies (see, e.g., Deaton and Laroque (1992), Chamberlain and Wilson (2000) or Li and Stachurski (2014)). Analogous to (1), we introduce the generalized condition

[TABLE]

Here $\{R_{t}\}$ is a stochastic capital income process. Analogous to the case of $G_{\beta}$ , the value $\ln G_{\beta R}$ can be understood as the asymptotic growth rate of average gross payoff on assets, discounted to present value.

We show that, when Conditions (1)–(2) hold and non-financial income satisfies two moment conditions, a unique optimal consumption policy exists. We also show that the policy can be computed by successive approximations and analyze its properties, such as monotonicity and asymptotic linearity. This asymptotic linearity can be used to successfully model wealth inequality by accurately representing asset path dynamics for very high wealth households (Gouin-Bonenfant and Toda, 2018).

One important feature of Conditions (1)–(2) is that they take into account the autocorrelation structure of preference shocks and asset returns. For example, if these processes depend only on iid innovations, then (1) reduces to $\mathbbm{E}\beta_{t}<1$ and (2) reduces to $\mathbbm{E}\beta_{t}R_{t}<1$ . But returns on assets are typically not iid, since both mean returns and volatility are, in general, time varying, and preference shocks are typically modeled as correlated (see, e.g., Hubmer et al. (2018) or Schorfheide et al. (2018)). This dependence must be and is accounted for in (2), since long upswings in $\{\beta_{t}\}$ and $\{R_{t}\}$ can lead to explosive paths for valuations and assets.

Next we study asymptotic stability, stationarity and ergodicity of wealth. Such properties are essential to existence of stationary equilibria in heterogeneous agent models (e.g., Huggett (1993), Aiyagari (1994) or Cao (2020)), as well as standard estimation, calibration and simulation techniques that connect time series averages with cross-sectional moments.111111A well-known example of a computational technique that uses ergodicity can be found in Krusell and Smith (1998). On the estimation side see, for example, Hansen and West (2002). These properties require an additional restriction, placed on the asymptotic growth rate of mean returns. Analogous to (1) and (2), this is defined as

[TABLE]

We show that if $G_{R}$ is sufficiently restricted and a degree of social mobility is present, then there exists a unique stationary distribution for the state process, the distributional path of the state process under the optimal path converges globally to the stationary distribution, and the stationary distribution is ergodic. We also show that, under some mild additional conditions, the rate of convergence of marginal distributions to the stationary distribution is geometric, and that a version of the Central Limit Theorem is valid. Finally, under some mild additional conditions, we prove that the stationary distribution of assets is Pareto tailed, consistent with the data.

Our study is related to Benhabib et al. (2015), who prove the existence of a heavy-tailed wealth distribution in an infinite horizon heterogeneous agent economy with capital income risk. In the process, they show that households facing a stochastic return on savings possess a unique optimal consumption policy characterized by the (boundary constraint-contingent) Euler equation, and that a unique and unbounded stationary distribution exists for wealth under this consumption policy. They assume isoelastic utility, constant discounting, and mutually independent, iid returns and labor income processes, both supported on bounded closed intervals with strictly positive lower bounds. We relax all of these assumptions. Apart from allowing more general utility and state dependent discounting, this permits such realistic features for household income as positive correlations between labor earnings and wealth returns (an extension that was suggested by Benhabib et al. (2015)), or time varying volatility in returns.121212Empirical motivation for these kinds of extensions can be found in numerous studies, including Guvenen and Smith (2014) and Fagereng et al. (2016a, b).

Another related paper is Chamberlain and Wilson (2000), which studies an income fluctuation problem with stochastic income and asset returns and obtains many significant results on asymptotic properties of consumption. Their study imposes relatively few restrictions on the wealth return and labor income processes. Our paper extends their work by allowing for random discounting, as well as dropping their boundedness restriction on the utility, which prevents their work from being used in many standard settings such as constant relative risk aversion. We also develop a set of new results on stability and ergodicity, as well as asymptotic normality of the wealth process.

Our optimality theory draws on techniques found in Li and Stachurski (2014), who show that the time iteration operator is a contraction mapping with respect to a metric that evaluates consumption differences in terms of marginal utility, while assuming a constant discount factor and constant rate of return on assets.131313Coleman (1990) introduced the time iteration operator as a constructive method for solving stochastic growth models. It has since been used in Datta et al. (2002), Morand and Reffett (2003) and many other studies. We show that these ideas extend to a setting where both returns and discount rates are stochastic and time varying. Our results on dynamics under the optimal policy have no counterparts in Li and Stachurski (2014).

In a similar vein, our work is related to several other papers that treat the standard income fluctuation problems with constant rates of return on assets and constant discount rates, such as Rabault (2002), Carroll (2004) and Kuhn (2013). While Carroll (2004) constructs a weighted supremum norm contraction and works with the Bellman operator, the other two papers focus on time iteration. In particular, Rabault (2002) exploits the monotonicity structure, while Kuhn (2013) applies a version of the Tarski fixed point theorem. Our techniques for studying optimality are close to those in Li and Stachurski (2014), as discussed above.141414Our paper is also related to Cao and Luo (2017), who study wealth inequality in a continuous-time framework with heterogeneous returns following a two-state Markov chain. While we do not pursue the connection here, the generality of our setup, including a persistent shock structure to wealth returns, might permit a study of the continuous-time limit that yields the tail results of Cao and Luo (2017) in a general framework.

The rest of this paper is structured as follows. Section 2 formulates the problem and establishes optimality results. Sufficient conditions for the existence and uniqueness of optimal policies are discussed. Section 3 focuses on stochastic stability. Section 4 discusses our key conditions and how they can be checked. Section 5 provides a set of applications and Section 6 concludes. All proofs are deferred to the appendix. Code that generates our figures can be found at https://github.com/jstac/ifp_public.

2. The Income Fluctuation Problem and Optimality Results

This section formulates the income fluctuation problem we consider, establishes the existence, uniqueness and computability of a solution, and derives its properties.

2.1. Problem Statement

We consider a general income fluctuation problem, where a household chooses a consumption-asset path $\{(c_{t},a_{t})\}$ to solve

[TABLE]

Here $u$ is the utility function, $\{\beta_{t}\}_{t\geq 0}$ is discount factor process with $\beta_{0}=1$ , $\{R_{t}\}_{t\geq 1}$ is the gross rate of return on wealth, and $\{Y_{t}\}_{t\geq 1}$ is non-financial income. These stochastic processes obey

[TABLE]

where $\beta$ , $R$ and $Y$ are measurable nonnegative functions and $\{Z_{t}\}_{t\geq 0}$ is an irreducible time-homogeneous $\mathsf{Z}$ -valued Markov chain taking values in finite set $\mathsf{Z}$ . Let $P(z,\hat{z})$ be the probability of transitioning from $z$ to $\hat{z}$ in one step. The innovation processes $\{\varepsilon_{t}\}$ , $\{\zeta_{t}\}$ and $\{\eta_{t}\}$ are iid independent and their supports can be continuous and vector-valued.

The function $u$ maps $\mathbbm{R}_{+}$ to $\{-\infty\}\cup\mathbbm{R}$ , is twice differentiable on $(0,\infty)$ , satisfies $u^{\prime}>0$ and $u^{\prime\prime}<0$ everywhere on $(0,\infty)$ , and that $u^{\prime}(c)\to\infty$ as $c\to 0$ and $u^{\prime}(c)<1$ as $c\to\infty$ . We define

[TABLE]

The next period value of a random variable $X$ is typically denoted $\hat{X}$ . Expectation without a subscript refers to the stationary process, where $Z_{0}$ is drawn from its (necessarily unique) stationary distribution.

2.2. Key Conditions

Our conditions for optimality are listed below. In what follows, $G_{\beta}$ is the asymptotic growth rate of the discount process as defined in (1).

Assumption 2.1.

The discount factor process satisfies $G_{\beta}<1$ .

Assumption 2.1 is a natural extension of the standard condition $\beta<1$ from the constant discount case. If $\beta_{t}\equiv\beta$ for all $t$ , then $G_{\beta}=\beta$ , as follows immediately from the definition. It is weaker than the obvious sufficient condition $\beta_{t}\leq\bar{\beta}$ with probability one for some constant $\bar{\beta}<1$ , since in such a setting we have $G_{\beta}\leq\bar{\beta}<1$ . In fact it cannot be significantly weakened, as the proposition shows.

Proposition 2.1 (Necessity of the discount condition).

Let $\beta_{t}$ and $u(Y_{t})$ be positive with probability one for all $t$ and all initial states $z$ in $\mathsf{Z}$ . If, in this setting, we have $G_{\beta}\geq 1$ , then the objective in (2.1) is infinite at every initial state $(a,z)$ .

The positivity assumed here may or may not hold in applications, but Proposition 2.1 shows that special conditions will have to be imposed on preferences if Assumption 2.1 fails. Put differently, allowing $G_{\beta}\geq 1$ is tantamount to allowing $\beta\geq 1$ in the case when the discount rate is constant.

Next, we need to ensure that the present discounted value of wealth does not grow too quickly, which requires a joint restriction on asset returns and discounting. When $\{R_{t}\}$ and $\{\beta_{t}\}$ are constant at values $R$ and $\beta$ , the standard restriction from the existing literature is $\beta R<1$ . A generalization using $G_{\beta R}$ as defined in (2) is

Assumption 2.2.

The discount factor and return processes satisfy $G_{\beta R}<1$ .

Finally, we impose routine technical restrictions on non-financial income. The second restriction is needed to exploit first order conditions.

Assumption 2.3.

$\mathbbm{E}\,Y<\infty$ and $\mathbbm{E}\,u^{\prime}(Y)<\infty$ .

Next we provide one example where Assumptions 2.1–2.3 are easily verified. More complex examples are deferred to Sections 4 and 5.

Example 2.1.

Suppose, as in Benhabib et al. (2015), that there is a constant discount factor $\beta<1$ , utility is CRRA with $\gamma\geq 1$ , $\left\{R_{t}\right\}$ and $\left\{Y_{t}\right\}$ are iid, mutually independent, supported on bounded closed intervals of strictly positive real numbers, and, moreover,

[TABLE]

Assumptions 2.1–2.3 are all satisfied in this case. To see this, observe that $G_{\beta}=\beta<1$ in the constant discount case, so Assumption 2.1 holds. Since $x\mapsto x^{1-\gamma}$ is convex when $\gamma\geq 1$ , Jensen’s inequality implies that $\mathbbm{E}R_{t}^{1-\gamma}\geq(\mathbbm{E}R_{t})^{1-\gamma}$ . Multiplying both sides of the last inequality by $\beta(\mathbbm{E}R_{t})^{\gamma}$ yields

[TABLE]

By the second condition of (7), Assumption 2.2 holds. Assumption 2.3 also holds because $Y_{t}$ is restricted to a compact subset of the positive reals.

2.3. Optimality: Definitions and Fundamental Properties

To consider optimality, we temporarily assume that $a_{0}>0$ and set the asset space to $(0,\infty)$ .151515Assumption 2.3 combined with $u^{\prime}(0)=\infty$ implies that $\mathbbm{P}\{Y_{t}>0\}=1$ for all $t\geq 1$ . Hence, $\mathbbm{P}\{a_{t}>0\}=1$ for all $t\geq 1$ and excluding zero from the asset space makes no difference to optimality. The state space for $\{(a_{t},Z_{t})\}_{t\geq 0}$ is then $\SS_{0}:=(0,\infty)\times\mathsf{Z}$ . A feasible policy is a Borel measurable function $c\colon\SS_{0}\to\mathbbm{R}$ with $0\leq c(a,z)\leq a$ for all $(a,z)\in\SS_{0}$ . A feasible policy $c$ and initial condition $(a,z)\in\SS_{0}$ generate an asset path $\{a_{t}\}_{t\geq 0}$ via (2.1) when $c_{t}=c(a_{t},Z_{t})$ and $(a_{0},Z_{0})=(a,z)$ . The lifetime value of policy $c$ is

[TABLE]

where $\{a_{t}\}$ is the asset path generated by $(c,(a,z))$ . In the Appendix we show that $V_{c}$ is well-defined on $\SS_{0}$ . A feasible policy $c^{*}$ is called optimal if $V_{c}\leq V_{c^{*}}$ on $\SS_{0}$ for any feasible policy $c$ . A feasible policy is said to satisfy the first order optimality condition if

[TABLE]

for all $(a,z)\in\SS_{0}$ , and equality holds when $c(a,z)<a$ . Noting that $u^{\prime}$ is decreasing, the first order optimality condition can be compactly stated as

[TABLE]

for all $(a,z)\in\SS_{0}$ . A feasible policy is said to satisfy the transversality condition if, for all $(a,z)\in\SS_{0}$ ,

[TABLE]

Theorem 2.1 (Sufficiency of first order and transversality conditions).

If Assumptions 2.1–2.3 hold, then every feasible policy satisfying the first order and transversality conditions is an optimal policy.

2.4. Existence and Computability of Optimal Consumption

Let $\mathscr{C}$ be the space of continuous functions $c\colon\SS_{0}\to\mathbbm{R}$ such that $c$ is increasing in the first argument, $0<c(a,z)\leq a$ for all $(a,z)\in\SS_{0}$ , and

[TABLE]

To compare two consumption policies, we pair $\mathscr{C}$ with the distance

[TABLE]

which evaluates the maximal difference in terms of marginal utility. While elements of $\mathscr{C}$ are not generally bounded, $\rho$ is a valid metric on $\mathscr{C}$ . In particular, $\rho$ is finite on $\mathscr{C}$ since $\rho(c,d)\leq\left\|u^{\prime}\circ c-u^{\prime}\right\|+\left\|u^{\prime}\circ d-u^{\prime}\right\|$ , and the last two terms are finite by (12). In Appendix B, we show that $(\mathscr{C},\rho)$ is a complete metric space. The following proposition shows that, for any policy in $\mathscr{C}$ , the first order optimality condition (10) implies the transversality condition.

Proposition 2.2 (Sufficiency of first order condition).

Let Assumptions 2.1–2.3 hold. If $c\in\mathscr{C}$ and the first order optimality condition (10) holds for all $(a,z)\in\SS_{0}$ , then $c$ satisfies the transversality condition. In particular, $c$ is an optimal policy.

We aim to characterize the optimal policy as the fixed point of the time iteration operator $T$ defined as follows: for fixed $c\in\mathscr{C}$ and $(a,z)\in\SS_{0}$ , the value of the image $Tc$ at $(a,z)$ is defined as the $\xi\in(0,a]$ that solves

[TABLE]

where $\psi_{c}$ is the function on

[TABLE]

defined by

[TABLE]

The following theorem shows that the time iteration operator is an $n$ -step contraction mapping on a complete metric space of candidate policies and its fixed point is the unique optimal policy.

Theorem 2.2 (Existence, uniqueness and computability of optimal policies).

If Assumptions 2.1–2.3 hold, then there exists an $n$ in $\mathbbm{N}$ such that $T^{n}$ is a contraction mapping on $(\mathscr{C},\rho)$ . In particular,

(1)

$T$ * has a unique fixed point $c^{*}\in\mathscr{C}$ .* 2. (2)

The fixed point $c^{*}$ is the unique optimal policy in $\mathscr{C}$ . 3. (3)

For all $c\in\mathscr{C}$ we have $\rho(T^{k}c,c^{*})\to 0$ as $k\to\infty$ .

Part (3) shows that, under our conditions, the familiar time iteration algorithm is globally convergent, provided one starts with some policy in the candidate class $\mathscr{C}$ .

2.5. Properties of Optimal Consumption

In this section we study the properties of the optimal consumption function obtained in Theorem 2.2. Assumptions 2.1–2.3 are held to be true throughout. The following two propositions show the monotonicity of the consumption function, which is intuitive.

Proposition 2.3 (Monotonicity with respect to wealth).

The optimal consumption and savings functions $c^{*}(a,z)$ and $i^{*}(a,z):=a-c^{*}(a,z)$ are increasing in $a$ .

Proposition 2.4 (Monotonicity with respect to income).

If $\{Y_{1t}\}$ and $\{Y_{2t}\}$ are two income processes satisfying $Y_{1t}\leq Y_{2t}$ for all $t$ and $c_{1}^{*}$ and $c_{2}^{*}$ are the corresponding optimal consumption functions, then $c_{1}^{*}\leq c_{2}^{*}$ pointwise on $\SS_{0}$ .

Under further assumptions we can show that the optimal policy is concave and asymptotically linear with respect to the wealth level.

Proposition 2.5 (Concavity and asymptotic linearity of consumption function).

If for each $z\in\mathsf{Z}$ and $c\in\mathscr{C}$ that is concave in its first argument,

[TABLE]

then

(1)

$a\mapsto c^{*}(a,z)$ * is concave, and* 2. (2)

there exists $\alpha(z)\in[0,1]$ such that $\lim_{a\to\infty}[c^{*}(a,z)/a]=\alpha(z)$ .

Remark 2.1.

Condition (17) imposes some concavity structure on utility. It holds for the constant relative risk aversion (CRRA) utility function

[TABLE]

as shown in Appendix B.

Proposition 2.5 states that $c^{*}(a,z)\approx\alpha(z)a+b(z)$ for some function $b(z)$ when $a$ is large. This provides justification for linearly extrapolating the policy functions when computing them at high wealth levels.

Together, parts (1) and (2) of Proposition 2.5 imply the linear lower bound $c^{*}(a,z)\geq\alpha(z)a$ , although they do not provide a concrete number for $\alpha(z)$ . The following proposition establishes an explicit linear lower bound.

Proposition 2.6 (Linear lower bound on consumption).

If there exists a nonnegative constant $\bar{s}$ such that

[TABLE]

then $c^{*}(a,z)\geq(1-\bar{s})a$ for all $(a,z)\in\SS_{0}$ .161616We adopt the convention $0\cdot\infty=0$ , so condition (19) does not rule out the case $\mathbbm{P}\{R_{t}=0\mid Z_{t-1}=z\}>0$ . Indeed, as shown in the proofs, the conclusions still hold if we replace this condition by the weaker alternative $\mathbbm{E}_{z}\hat{\beta}\hat{R}\,u^{\prime}[\hat{R}\bar{s}a+(1-\bar{s})\hat{Y}]\leq u^{\prime}(a)$ for all $(a,z)\in\SS_{0}$ .

The second inequality in (19) restricts marginal utility derived from transferring wealth to the next period and then consuming versus consuming wealth today. The value $\bar{s}$ can be clarified once primitives are specified, as the next example illustrates.

Example 2.2.

Suppose that utility is CRRA, as in (18). If we now take

[TABLE]

and $\bar{s}<1$ , then the conditions of Proposition 2.6 hold. In particular, the second inequality in (19) holds, as follows directly from the definition of $\bar{s}$ and $u^{\prime}(x)=x^{-\gamma}$ . In the case of Benhabib et al. (2015), where the discount rate is constant and returns are iid, the expression in (20) reduces to $\bar{s}:=(\beta\mathbbm{E}R_{t}^{1-\gamma})^{1/\gamma}$ . The requirement $\bar{s}<1$ then reduces to $\beta\mathbbm{E}R_{t}^{1-\gamma}<1$ , which is one of their assumptions (see Example 2.1).

3. Stationarity, Ergodicity, and Tail Behavior

This section focuses on stationarity, ergodicity and tail behavior of wealth under the unique optimal policy $c^{*}$ obtained in Theorem 2.2. So that this policy exists, Assumptions 2.1–2.3 are always taken to be valid. We extend $c^{*}$ to $\SS$ by setting $c^{*}(0,z)=0$ for all $z\in\mathsf{Z}$ and consider dynamics of $(a_{t},Z_{t})$ on $\SS:=\mathbbm{R}_{+}\times\mathsf{Z}$ , the law of motion for which is

[TABLE]

Let $Q$ be the joint stochastic kernel of $(a_{t},Z_{t})$ on $\SS$ . See Appendix A for this and related definitions.

3.1. Stationarity

To obtain existence of a stationary distribution we need to restrict the asymptotic growth rate for asset returns $G_{R}$ defined in (3).

Assumption 3.1.

There exists a constant $\bar{s}$ such that (19) holds and $\bar{s}\,G_{R}<1$ .

Below is one straightforward example of a setting where this holds, with more complex applications deferred to Sections 4–5.

Example 3.1.

Assumption 3.1 holds in the setting of Benhabib et al. (2015). As shown in Example 2.2, with $\bar{s}:=(\beta\mathbbm{E}R_{t}^{1-\gamma})^{1/\gamma}$ and the assumptions of Benhabib et al. (2015) in force, the conditions of (19) hold. Moreover, in their iid setting we have $G_{R}=\mathbbm{E}R_{t}$ , so $\bar{s}\,G_{R}<1$ reduces to $(\beta\mathbbm{E}R_{t}^{1-\gamma})^{1/\gamma}\mathbbm{E}R_{t}<1$ . This is one of their conditions, as discussed in Example 2.1.

By Proposition 2.6, the value $\bar{s}$ in Assumption 3.1 is an upper bound on the rate of savings. $G_{R}$ is an asymptotic growth rate for each unit of savings invested. If the product of these is less than one, then probability mass contained in the wealth distribution will not drift to $+\infty$ , which allows us to obtain the following result.171717Assumption 3.1 is weaker than any restriction implying wealth is bounded from above—a common device for compactifying the state space and thereby obtaining a stationary distribution. Indeed, under many specifications of $\{Y_{t}\}$ and $\{R_{t}\}$ that fall within our framework, wealth of a given household can and will, over an infinite horizon, exceed any finite bound with probability one. See, for example, Benhabib et al. (2015), Proposition 6.

Theorem 3.1 (Existence of a stationary distribution).

If Assumption 3.1 holds, then $Q$ admits at least one stationary distribution on $\SS$ .

Stationarity of the form obtained in Theorem 3.1 is required to establish existence of stationary recursive equilibria in heterogeneous agent models with idiosyncratic risk, such as Huggett (1993) or Aiyagari (1994).181818For models with aggregate shocks, such as Krusell and Smith (1998), a fully specified recursive equilibrium requires that households take the wealth distribution as one component of the state in their savings problem, and that stationarity holds for the entire joint distribution (defined over a product space encompassing both the wealth distribution and the exogenous state process). These problems fall outside the scope of Theorem 3.1, since $\{Z_{t}\}$ is finite-valued. For a careful treatment of stationary recursive equilibrium in Krusell–Smith type models, see Cao (2020).

3.2. Ergodicity

While Assumption 3.1 implies existence of a stationary distribution, it is not in general sufficient for uniqueness or stability. For these additional properties to hold, we must impose sufficient mixing. In doing so, we consider the following two cases:

(Y1)

The support of $\{Y_{t}\}$ is finite. 2. (Y2)

The process $\{Y_{t}\}$ admits a density representation.

Condition (Y2) means that there exists a function $f$ from $\mathbbm{R}_{+}\times\mathsf{Z}$ to $\mathbbm{R}_{+}$ such that

[TABLE]

for all Borel sets $A\subset\mathbbm{R}_{+}$ and all $z$ in $\mathsf{Z}$ .

Assumption 3.2.

There exists a $\bar{z}$ in $\mathsf{Z}$ such that $P(\bar{z},\bar{z})>0$ . Moreover, with $y_{\ell}\geq 0$ defined as the greatest lower bound of the support of $\{Y_{t}\}$ , either

•

(Y1) holds and $\mathbbm{P}\{Y_{t}=y_{\ell}\mid Z_{t}=\bar{z}\}>0$ , or

•

(Y2) holds and there exists a $\delta>y_{\ell}$ such that $f\left(\cdot\mid\bar{z}\right)>0$ on $(y_{\ell},\delta)$ .

Assumption 3.2 requires that there is a positive probability of receiving low labor income at some relatively persistent state of the world $\bar{z}$ . This is a mixing condition that enforces social mobility. The reason is that $\{Z_{t}\}$ is already assumed to be irreducible, so $\bar{z}$ is eventually visited by each household. For any such household, there is a positive probability of low labor income over a long period. Wealth then declines. In other words, currently rich households or dynasties will not be rich forever. This guarantees sufficient social mobility between rich and poor, generating ergodicity.

To state our uniqueness and stability results, let $Q^{t}$ be the $t$ -step stochastic kernel, let $\|\cdot\|_{TV}$ be total variation norm and let $V(a,z):=a+m_{V}$ , where $m_{V}$ is a constant to be defined in the proof. For any integrable real-valued function $h$ on $\SS$ , let

[TABLE]

and

[TABLE]

where, here and in the theorem below, $\mathbbm{E}$ indicates expectation under stationarity.

Theorem 3.2 (Uniqueness, stability, ergodicity and mixing).

If Assumptions 3.1 and 3.2 hold, then

(1)

the stationary distribution $\psi_{\infty}$ of $Q$ is unique and there exist constants $\lambda<1$ and $M<\infty$ such that,

[TABLE] 2. (2)

For all $(a,z)\in\SS$ and real-valued function $h$ on $\SS$ such that $\mathbbm{E}|h(a_{t},Z_{t})|<\infty$ ,

[TABLE] 3. (3)

$Q$ * is $V$ -geometrically mixing. Moreover, if $\gamma_{h}^{2}>0$ and $h^{2}/V$ is bounded,*

[TABLE]

Part 1 of Theorem 3.2 states that the stationary distribution $\psi_{\infty}$ is unique and asymptotically attracting at a geometric rate. Part 2 states that the state process is ergodic, and hence long-run sample moments for individual households coincide with cross-sectional moments. The notion of mixing discussed in Part 3 is defined in the appendix. It states that social mobility holds asymptotically and mixing occurs at a geometric rate, although the rate may be arbitrarily slow. This mixing is enough to provide a Central Limit Theorem for the state process, which is the second claim in Part 3.

3.3. Tail Behavior

Having established the stationarity and ergodicity of wealth, we now study the tail behavior of the wealth distribution. We show that the wealth distribution is either bounded or (unbounded and) heavy-tailed under mild conditions. To prove this result we introduce the following assumption.

Assumption 3.3.

The assumptions of Proposition 2.5 are satisfied, so the optimal policy $a\mapsto c^{*}(a,z)$ is concave and asymptotically linear: $\lim_{a\to\infty}c^{*}(a,z)/a=\alpha(z)\in[0,1]$ . Furthermore, there exists $\bar{z}\in\mathsf{Z}$ such that $P(\bar{z},\bar{z})>0$ and

[TABLE]

Remark 3.1.

Condition (23) implies that wealth grows with nonzero probability when it is large. Indeed, using the law of motion (21a) and noting that $Y\geq 0$ , if $Z_{t}=Z_{t+1}=\bar{z}$ , then by (23) we have

[TABLE]

with positive probability if $a_{t}$ is large enough.

To state our result on tail behavior, we introduce the following notation. For any nonnegative function $A(z,\hat{z},\hat{\zeta})$ , define the $\mathsf{Z}\times\mathsf{Z}$ matrix-valued function $M_{A}$ by

[TABLE]

Elements of $M_{A}(s)$ are conditional moment generating functions of $\log A$ . In the statement below, $\odot$ denotes the Hadamard (entry-wise) product, and $r(\cdot)$ returns the spectral radius of a matrix. Also $a_{\infty}$ is a random variable with distribution $\psi_{\infty}(\cdot,\mathsf{Z})$ .

Theorem 3.3 (Tail behavior).

Let Assumptions 3.1–3.3 hold and define

[TABLE]

Then $\lambda$ is convex in $s\geq 0$ . Assume that there exists $s>0$ in the interior of the domain of $\lambda$ such that $1<\lambda(s)<\infty$ and let

[TABLE]

If $a_{\infty}$ has unbounded support, then it is heavy-tailed. In particular, for any $\varepsilon>0$ ,

[TABLE]

Remark 3.2.

The assumption $1<\lambda(s)<\infty$ for some $s>0$ is weak. Because the $(\bar{z},\bar{z})$ -th element of $P\odot M_{A}(s)$ is

[TABLE]

by the definition of $G$ in (25a) and condition (23), we always have $\lambda(s)\to\infty$ as $s\to\infty$ . Hence there exists $s>0$ such that $\lambda(s)\in(1,\infty)$ if, for example, $\hat{\zeta}$ has a compact support.

Condition (27) implies that for any $\varepsilon>0$ , there exists a constant $C(\varepsilon)>0$ such that

[TABLE]

for large enough $a$ , so the upper tail of the wealth distribution is at least Pareto.

Remark 3.3.

Toda (2019) constructs an example of a Huggett (1993) economy with Pareto-tailed wealth distribution when discount factors are random. Theorem 3.3 is significantly more general as we allow for stochastic returns and income. Stachurski and Toda (2019) prove that with constant discount factor, constant asset return, and light-tailed income, the wealth distribution is always light-tailed. Theorem 3.3 shows that sufficient heterogeneity in discount factor or returns generates heavy tails.

Example 3.2.

The CRRA-iid setting of Benhabib et al. (2015) satisfies the assumptions of Theorem 3.3. When utility is CRRA, by Proposition 5 of Benhabib et al. (2015), condition (23) holds if $R(\bar{z},\hat{\zeta})>1/\bar{s}$ with positive probability, where $\bar{s}$ is given in Example 2.2. In the iid case, this condition reduces to $\mathbbm{P}\{(\beta\mathbbm{E}R_{t}^{1-\gamma})^{1/\gamma}R_{t}>1\}>0$ , which holds under the conditions of Benhabib et al. (2015).191919Benhabib et al. (2015) assume that $\mathbbm{P}\{\beta R_{t}>1\}>0$ , so it suffices to show that $(\beta\mathbbm{E}R_{t}^{1-\gamma})^{1/\gamma}\geq\beta$ or, equivalently, $\mathbbm{E}(\beta R_{t})^{1-\gamma}\geq 1$ . By Jensen’s inequality and their restriction $\gamma\geq 1$ , the last bound is true whenever $(\mathbbm{E}\beta R_{t})^{1-\gamma}\geq 1$ . But this must hold because, under their conditions, we have $\beta\mathbbm{E}R_{t}<1$ , as shown in Example 2.1. Thus, Assumption 3.3 holds. The existence of $s>0$ with $\lambda(s)\in(1,\infty)$ follows from Remark 3.2 and the assumption that $R_{t}$ has a compact support.

4. Testing the Growth Conditions

The three key conditions in the paper are the restrictions on the growth rates $G_{\beta}$ , $G_{\beta R}$ and $G_{R}$ , with the first two required for optimality and the last for stationarity (see Assumptions 2.1, 2.2 and 3.1 respectively). In this section we explore the restrictions implied by these conditions. We begin with the following result, which yields a straightforward method for computing these growth rates.

Lemma 4.1 (Long-run growth rates and spectral radii).

Let $\varphi_{t}=\varphi(Z_{t},\xi_{t})$ , where $\varphi$ is a nonnegative measurable function and $\{\xi_{t}\}$ is an iid sequence with marginal distribution $\pi$ . In this setting we have

[TABLE]

and $r(L_{\varphi})$ is the spectral radius of the matrix defined by

[TABLE]

The matrix $L_{\varphi}$ is expressed as a function on $\mathsf{Z}\times\mathsf{Z}$ in (29) but can be represented in traditional matrix notation by enumerating $\mathsf{Z}$ .202020Specifically, if $\mathsf{Z}:=\{z_{1},\dots,z_{N}\}$ , then $L_{\varphi}=PD_{\varphi}$ where $P$ is, as before, the transition matrix for the exogenous state, and $D_{\varphi}:=\operatorname{diag}\left(\mathbbm{E}_{z_{1}}\varphi,\dots,\mathbbm{E}_{z_{N}}\varphi\right)$ when $\mathbbm{E}_{z}\varphi:=\mathbbm{E}_{z}\varphi(z,\hat{\xi})$ . In what follows, $D_{\beta}$ , $D_{R}$ and $D_{\beta R}$ are defined analogously to $D_{\varphi}$ .

What factors determine the long-run average growth rates embedded in our assumptions, such as $G_{\beta}$ or $G_{R}$ ? Lemma 4.1 tells us how to compute these values for a given specification of dynamics, but how should we understand them intuitively and what factors determine their size? To address these questions, let us consider an AR(1) discount factor process, which has been adopted in several recent quantitative studies (see, e.g., Hubmer et al. (2018) or Hills and Nakata (2018)). In particular, suppose that the state process follows a discretized version of

[TABLE]

and $\beta_{t}=Z_{t}$ . (The discretization implies that $\beta_{t}$ is always positive.) To simplify interpretation, the process (30) is structured so that the stationary distribution of $\{Z_{t}\}$ is $N(\mu,\sigma^{2})$ . We use Rouwenhorst (1995)’s method to discretize $\{Z_{t}\}$ and then calculate $G_{\beta}$ using Lemma 4.1, studying how $G_{\beta}$ is affected by the parameters in (30).

Since $\beta_{t}=Z_{t}$ for all $t$ , the structure of (30) implies that $\mu$ is the long-run unconditional mean of $\{\beta_{t}\}$ . It can therefore be set to standard calibrated value for the discount factor, such as $0.99$ from Krusell and Smith (1998). What we wish to understand is how the remaining parameters $\rho$ and $\sigma$ affect the value of $G_{\beta}$ . While no closed form expression is available in this case, Figure 1 sheds some light by providing a contour plot of $G_{\beta}$ over a set of $(\rho,\sigma)$ pairs. The figure shows that $G_{\beta}$ grows with both the persistence term $\rho$ and volatility term $\sigma$ . In particular, the condition $G_{\beta}<1$ fails when the persistence and volatility of the discount factor process are sufficiently high. This is because $G_{\beta}$ is the limit of $\left(\mathbbm{E}\prod_{t=1}^{n}\beta_{t}\right)^{1/n}$ and, for positive random variables, sequence of large outcomes have a strong compounding effect on their product. High volatility and high persistence reinforce this effect.

This discussion has focused on $G_{\beta}$ but similar intuition applies to both $G_{R}$ and $G_{\beta R}$ . If $\beta_{t}$ and $R_{t}$ are both increasing functions of the state process, then these asymptotic growth rates also increase with greater persistence and volatility in the state process, as well as higher unconditional mean. The next section further illustrates these points.

5. Application: Stochastic Volatility and Mean Persistence

We showed in Examples 2.1, 2.2 and 3.1 that, in the setting of Benhabib et al. (2015), where the discount factor is constant and returns and labor income are iid, Assumptions 2.1–2.3 and Assumption 3.1 are all satisfied. Hence, by Theorems 2.2 and 3.1, the household optimization problem has a unique optimal policy and the wealth process under this policy has a stationary solution. If, in addition, the support of $Y_{t}$ is finite or $Y_{t}$ has a positive density, say, then the conditions of Theorem 3.2 also hold and the stationary solution is ergodic, geometrically mixing and its time series averages are asymptotically normal.

Let us now bring the model closer to the data by relaxing the iid restrictions on financial and non-financial returns, introducing both mean persistence and time varying volatility in returns on assets.212121The importance of these features for wealth dynamics was highlighted in Fagereng et al. (2016a). In particular, we set

[TABLE]

where $\{\zeta_{t}\}$ is iid and standard normal and $\{\mu_{t}\}$ and $\{\sigma_{t}\}$ are finite-state Markov chains, discretized from

[TABLE]

Innovations are iid and standard normal. Using the data in Fagereng et al. (2016b) on Norwegian financial returns over 1993–2003, we estimate these AR(1) models to obtain $\bar{\mu}=0.0281$ , $\rho_{\mu}=0.5722$ , $\delta_{\mu}=0.0067$ , $\bar{\sigma}=-3.2556$ , $\rho_{\sigma}=0.2895$ and $\delta_{\sigma}=0.1896$ . Based on this calibration, the stationary mean and standard deviation of $\{R_{t}\}$ are around $1.03$ and $4\%$ , respectively.

To distinguish the effects of stochastic volatility and mean persistence, we consider two subsidiary models. The first reduces $\{\mu_{t}\}$ to its stationary mean $\bar{\mu}$ , while the second reduces $\{\sigma_{t}\}$ to its stationary mean $\tilde{\sigma}:=\mathrm{e}^{\bar{\sigma}+\delta_{\sigma}^{2}/2(1-\rho_{\sigma}^{2})}$ . In summary,

[TABLE]

We set $\beta=0.95$ and $\gamma=1.5$ . To test the stability properties of Model @slowromancapi@, we explore a neighborhood of the calibrated $(\rho_{\sigma},\delta_{\sigma})$ values, while in Model @slowromancapii@, we do likewise for $(\rho_{\mu},\delta_{\mu})$ pairs. In each scenario, other parameters are fixed to the benchmark. The results are shown in Figures 2 and 3.

In part (a) of each figure, we see that $G_{\beta R}$ is increasing in the persistence and volatility parameters of the state process. The intuition behind this feature was explained in Section 4 for the case of $G_{\beta}$ and is similar here. (Note that $G_{\beta R}=\beta G_{R}$ in the present case, since $\beta_{t}\equiv\beta$ is a constant, so $G_{\beta R}$ has the same shape as $G_{R}$ in terms of contours.) The dots in the figures show that $G_{\beta R}<1$ at the estimated parameter values.

Part (b) of each figure shows the set of parameters under which the model is globally stable and ergodic. The stability threshold is the boundary of the set of parameter pairs that produce $\max\{G_{\beta R},\bar{s},\bar{s}G_{R}\}<1$ , where $\bar{s}$ is given by (20). For such pairs, Assumptions 2.2 and 3.1 both hold, so the conditions of Theorems 3.1–3.2 are satisfied. (We are continuing to suppose that $Y_{t}$ is finite or has a positive density, so that Assumption 3.2 holds. Assumptions 2.1 and 2.3 are always valid in the current setting). Observe that the estimated parameter values (dot points) lie inside the stable set.

6. Conclusion

We studied an updated version of the income fluctuation problem, the “common ancestor” of modern macroeconomic theory (Ljungqvist and Sargent (2012), p. 3.) Working in a setting where returns on financial assets, non-financial income and impatience are all state dependent and fluctuate over time, we obtained conditions under which the household savings problem has a unique solution that can be computed by successive approximations and the wealth process under the optimal savings policy has a unique stationary distribution with Pareto right tail. We also obtained conditions under which wealth is ergodic and exhibits geometric mixing and asymptotic normality. We investigated the nature of our conditions and provided methods for testing them in applications. While our work was motivated by the desire to better understand the joint distribution of income and wealth, the income fluctuation problem also has applications in asset pricing, life-cycle choice, fiscal policy, monetary policy, optimal taxation, and social security. The ideas contained in this paper should be helpful for those fields after suitable modifications or extensions.

Appendix A Preliminaries

Given a topological space $\mathsf{T}$ , let $\mathscr{B}(\mathsf{T})$ be the Borel $\sigma$ -algebra and $\mathscr{P}(\mathsf{T})$ be the probability measures on $\mathscr{B}(\mathsf{T})$ . A stochastic kernel $\Pi$ on $\mathsf{T}$ is a map $\Pi\colon\mathsf{T}\times\mathscr{B}(\mathsf{T})\to[0,1]$ such that $x\mapsto\Pi(x,B)$ is $\mathscr{B}(\mathsf{T})$ -measurable for each $B\in\mathscr{B}(\mathsf{T})$ and $B\mapsto\Pi(x,B)$ is a probability measure on $\mathscr{B}(\mathsf{T})$ for each $x\in\mathsf{T}$ . For all $t\in\mathbbm{N}$ , $x,y\in\mathsf{T}$ and $B\in\mathscr{B}(\mathsf{T})$ , we define $\Pi^{1}:=\Pi$ and $\Pi^{t}(x,B):=\int\Pi^{t-1}(y,B)\Pi(x,\mathop{}\!\mathrm{d}y)$ . Furthermore, for all $\mu\in\mathscr{P}(\mathsf{T})$ , let $(\mu\Pi^{t})(B):=\int\Pi^{t}(x,B)\mu(\mathop{}\!\mathrm{d}x)$ . $\Pi$ is called Feller if $x\mapsto\int h(y)\Pi(x,\mathop{}\!\mathrm{d}y)$ is continuous on $\mathsf{T}$ whenever $h$ is bounded and continuous on $\mathsf{T}$ . We call $\psi\in\mathscr{P}(\mathsf{T})$ stationary for $\Pi$ if $\psi\Pi=\psi$ .

A sequence $\{\mu_{n}\}\subset\mathscr{P}(\mathsf{T})$ is called tight, if, for all $\varepsilon>0$ , there exists a compact $K\subset\mathsf{T}$ such that $\mu_{n}(\mathsf{T}\backslash K)\leq\varepsilon$ for all $n$ . A stochastic kernel $\Pi$ is called bounded in probability if the sequence $\{Q^{t}(x,\cdot)\}_{t\geq 0}$ is tight for all $x\in\mathsf{T}$ . Given $\mu\in\mathscr{P}(\mathsf{T})$ , we define the total variation norm $\|\mu\|_{TV}:=\sup_{g:|g|\leq 1}\left|\int g\mathop{}\!\mathrm{d}\mu\right|$ . Given any measurable map $V\colon\mathsf{T}\to[1,\infty)$ , we say that $\Pi$ is $V$ -geometrically mixing if there exist constants $M<\infty$ and $\lambda<1$ such that, for all $x\in\mathsf{T}$ and $t\geq 0$ , the corresponding Markov process $\{X_{t}\}$ satisfies $\sup_{k\geq 0;\,h^{2},\,g^{2}\leq V}\left|\mathbbm{E}_{x}g(X_{t})h(X_{t+k})-\left[\mathbbm{E}_{x}g(X_{t})\right]\left[\mathbbm{E}_{x}h(X_{t+k})\right]\right|\leq\lambda^{t}MV(x)$ .

Below we use $(\Omega,\mathscr{F},\mathbbm{P})$ to denote a fixed probability space on which all random variables are defined. $\mathbbm{E}$ is expectations with respect to $\mathbbm{P}$ . The state process $\{Z_{t}\}$ and the innovation processes $\{\varepsilon_{t}\}$ , $\{\zeta_{t}\}$ and $\{\eta_{t}\}$ introduced in (5) live on this space. In what follows, $\{Z_{t}\}$ is a stationary version of the chain, where $Z_{0}$ is drawn from its unique stationary distribution—henceforth denoted $\pi_{Z}$ . The marginal distributions of the innovations are denoted by $\pi_{\varepsilon}$ , $\pi_{\zeta}$ and $\pi_{\eta}$ respectively. We let $\{\mathscr{F}_{t}\}$ be the natural filtration generated by $\{Z_{t}\}$ and the three innovation processes. $\mathbbm{P}_{z}$ conditions on $Z_{0}=z$ and $\mathbbm{E}_{z}$ is expectation under $\mathbbm{P}_{z}$ .

We first prove Lemma 4.1, since its implications will be used immediately below. In the proof, we consider the matrix $L_{\varphi}$ as a linear operator on $\mathbbm{R}^{\mathsf{Z}}$ and identify vectors in $\mathbbm{R}^{\mathsf{Z}}$ with real-valued functions on $\mathsf{Z}$ .

Proof of Lemma 4.1.

A proof by induction confirms that, for any function $h\in\mathbbm{R}^{\mathsf{Z}}$ ,

[TABLE]

where $L_{\varphi}^{n}$ is the $n$ -th composition of the operator $L_{\varphi}$ with itself (or, equivalently, the $n$ -th power of the matrix $L_{\varphi}$ ). The positivity of $L_{\varphi}$ and Theorem 9.1 of Krasnosel’skii et al. (2012) imply that $r(L_{\varphi})=\lim_{n\to\infty}\|L_{\varphi}^{n}\,h\|^{1/n}$ when $\|\cdot\|$ is any norm on $\mathbbm{R}^{\mathsf{Z}}$ and $h$ is everywhere positive on $\mathsf{Z}$ . With $h\equiv 1$ and $\|f\|=\mathbbm{E}|f(Z_{0})|$ , this becomes

[TABLE]

where the second equality is due to (32) and $h=\mathbbm{1}$ and the third is by the law of iterated expectations. ∎

Lemma A.1.

Let $\{\varphi_{t}\}$ and $G_{\varphi}$ be as defined in Lemma 4.1. If $G_{\varphi}<1$ , then there exists an $N$ in $\mathbbm{N}$ and a $\delta<1$ such that $\max_{z\in\mathsf{Z}}\mathbbm{E}_{z}\prod_{t=1}^{n}\varphi_{t}<\delta^{n}$ whenever $n\geq N$ .

Proof.

Recalling from the proof of Lemma 4.1 that $r(L_{\varphi})=\lim_{n\to\infty}\|L_{\varphi}^{n}\,h\|^{1/n}$ when $\|\cdot\|$ is any norm on $\mathbbm{R}^{\mathsf{Z}}$ and $h$ is everywhere positive on $\mathsf{Z}$ , we can again take $h\equiv 1$ but now switch to $\|f\|=\max_{z\in\mathsf{Z}}|f(z)|$ , so that (33) becomes

[TABLE]

Since $r(L_{\varphi})=G_{\varphi}$ and $G_{\varphi}<1$ , the claim in Lemma A.1 now follows. ∎

Appendix B Proof of Section 2 Results

Proof of Proposition 2.1.

Pick any $a\geq 0$ and $z\in\mathsf{Z}$ . Since $c_{t}=Y_{t}$ for all $t$ is dominated by a feasible consumption path, monotonicity of $u$ and the law of iterated expectations give

[TABLE]

where $h(Z_{t}):=\mathbbm{E}_{Z_{t}}u(Y)$ and the monotone convergence theorem has been employed to pass the expectation through the sum. In view of (32) and $\beta_{0}=1$ , we then have

[TABLE]

By the assumed almost sure positivity of $\beta_{t}$ and the irreducibility of $P$ , the matrix $L_{\beta}$ is irreducible. Hence, by the Perron–Frobenius theorem, we can choose an everywhere positive eigenfunction $e$ such that $L_{\beta}e=r(L_{\beta})e$ . By the everywhere positivity of $u(Y_{t})$ , the function $h$ is everywhere positive on $\mathsf{Z}$ , and hence we can choose $\alpha>0$ such that $e_{\alpha}:=\alpha e$ is less than $h$ pointwise on $\mathsf{Z}$ . We then have

[TABLE]

By lemma 4.1 we know that $r(L_{\beta})\geq 1$ , and since $\alpha$ and $e$ are positive, this expression is infinite. Returning to (35), we see that the value function is infinite at our arbitrarily chosen pair $(a,z)$ . ∎

For the rest of this section we suppose that Assumptions 2.1–2.3 hold.

Lemma B.1.

$M_{1}:=\sum_{t=0}^{\infty}\max_{z\in\mathsf{Z}}\mathbbm{E}_{z}\prod_{i=1}^{t}\beta_{i}$ * and $M_{2}:=\sum_{t=0}^{\infty}\max_{z\in\mathsf{Z}}\mathbbm{E}_{z}\prod_{i=1}^{t}\beta_{i}R_{i}$ , are finite, as are the constants $M_{3}=\max_{z\in\mathsf{Z}}\mathbbm{E}_{z}Y$ and $M_{4}=\max_{z\in\mathsf{Z}}\mathbbm{E}_{z}u^{\prime}(Y)$ .*

Proof.

That $M_{1}$ and $M_{2}$ are finite follows directly from Lemma A.1, with $\varphi_{t}=\beta_{t}$ and $\varphi_{t}=\beta_{t}R_{t}$ respectively. Regarding $M_{3}$ , Assumption 2.3 states that $\mathbbm{E}Y<\infty$ . By the Law of Iterated Expectations, we can write this as $\sum_{z\in\mathsf{Z}}\mathbbm{E}_{z}Y\pi_{Z}(z)<\infty$ . As $\{Z_{t}\}$ is irreducible, we know that $\pi_{Z}$ is positive everywhere on $\mathsf{Z}$ . Hence, $M_{3}<\infty$ must hold. The proof of $M_{4}<\infty$ is similar. ∎

Lemma B.2.

For the maximal asset path $\{\tilde{a}_{t}\}$ defined by

[TABLE]

we have, for each $(a,z)\in\SS_{0}$ , that $M(a,z):=\sum_{t=0}^{\infty}\mathbbm{E}_{a,z}\prod_{i=0}^{t}\beta_{i}\,\tilde{a}_{t}<\infty$ .

Proof.

Iterating backward on (36), we can show that $\tilde{a}_{t}=\prod_{i=1}^{t}R_{i}\,a+\sum_{j=1}^{t}Y_{j}\prod_{i=j+1}^{t}R_{i}$ . Taking expectation yields

[TABLE]

Then the Monotone Convergence Theorem and the Markov property imply that

[TABLE]

By Lemma B.1, we now have, for all $(a,z)\in\SS_{0}$ ,

[TABLE]

Applying Lemma B.1 again gives $M(a,z)<\infty$ , as was to be shown. ∎

Proposition B.1.

The value $V_{c}(a,z)$ in (8) is well-defined in $\{-\infty\}\cup\mathbbm{R}$ .

Proof.

By the assumptions on the utility function, there exists a constant $B\in\mathbbm{R}_{+}$ such that $u(c)\leq c+B$ , and hence $V_{c}(a,z)\leq\mathbbm{E}_{a,z}\sum_{t=0}^{\infty}\prod_{i=0}^{t}\beta_{i}\,u(\tilde{a}_{t})\leq M(a,z)+B\sum_{t=0}^{\infty}\mathbbm{E}_{z}\prod_{i=0}^{t}\beta_{i}$ . The last term is finite by Lemma A.1. ∎

Proof of Thoerem 2.1.

The proof is a long but relatively straightforward extension of Theorem 1 of Benhabib et al. (2015) and thus omitted. A full proof is available from the authors upon request. ∎

Proposition B.2.

$(\mathscr{C},\rho)$ * is a complete metric space.*

Proof.

The proof is a straightforward extension of Proposition 4.1 of Li and Stachurski (2014) and thus omitted. A full proof is available from the authors upon request. ∎

Proof of Proposition 2.2.

Let $c$ be a policy in $\mathscr{C}$ satisfying (10). To show that any asset path generated by $c$ satisfies the transversality condition (11), observe that, by condition (12), we have

[TABLE]

Regarding the first term on the right hand side of (38), fix $A>0$ and observe that

[TABLE]

with probability one, where $\tilde{a}_{t}$ is the maximal path defined in (36). We then have

[TABLE]

By Lemma B.1, we have

[TABLE]

and the last expression converges to zero as $t\to\infty$ by Lemma A.1. The second term in (39) also converges to zero by Lemma B.2. Hence $\mathbbm{E}_{a,z}\prod_{i=0}^{t}\beta_{i}\,u^{\prime}(a_{t})a_{t}\to 0$ as $t\to\infty$ , which, combined with (38) and another application of Lemma B.2, gives our desired result. ∎

Proposition B.3.

For all $c\in\mathscr{C}$ and $(a,z)\in\SS_{0}$ , there exists a unique $\xi\in(0,a]$ that solves (14).

Proof.

Fix $c\in\mathscr{C}$ and $(a,z)\in\SS_{0}$ . Because $c\in\mathscr{C}$ , the map $\xi\mapsto\psi_{c}(\xi,a,z)$ is increasing. Since $\xi\mapsto u^{\prime}(\xi)$ is strictly decreasing, the equation (14) can have at most one solution. Hence uniqueness holds.

Existence follows from the intermediate value theorem provided we can show that

(a)

$\xi\mapsto\psi_{c}(\xi,a,z)$ is a continuous function, 2. (b)

$\exists\xi\in(0,a]$ such that $u^{\prime}(\xi)\geq\psi_{c}(\xi,a,z)$ , and 3. (c)

$\exists\xi\in(0,a]$ such that $u^{\prime}(\xi)\leq\psi_{c}(\xi,a,z)$ .

For part (a), it suffices to show that

[TABLE]

is continuous on $(0,a]$ . To this end, fix $\xi\in(0,a]$ and $\xi_{n}\to\xi$ . By (37) we have

[TABLE]

The last term is integrable, as follows easily from Lemma B.1. Hence the dominated convergence theorem applies. From this fact and the continuity of $c$ , we obtain $g(\xi_{n})\to g(\xi)$ . Hence, $\xi\mapsto\psi_{c}(\xi,a,z)$ is continuous.

Part (b) clearly holds, since $u^{\prime}(\xi)\to\infty$ as $\xi\to 0$ and $\xi\mapsto\psi_{c}(\xi,a,z)$ is increasing and always finite (since it is continuous as shown in the previous paragraph). Part (c) is also trivial (just set $\xi=a$ ). ∎

Proposition B.4.

We have $Tc\in\mathscr{C}$ for all $c\in\mathscr{C}$ .

Proof.

Fix $c\in\mathscr{C}$ and let $g\left(\xi,a,z\right):=\mathbbm{E}_{z}\hat{\beta}\hat{R}\left(u^{\prime}\circ c\right)[\hat{R}\left(a-\xi\right)+\hat{Y},\,\hat{Z}]$ .

Step 1. We show that $Tc$ is continuous. To apply a standard fixed point parametric continuity result such as Theorem B.1.4 of Stachurski (2009), we first show that $\psi_{c}$ is jointly continuous on the set $G$ defined in (15). This will be true if $g$ is jointly continuous on $G$ . For any $\{(\xi_{n},a_{n},z_{n})\}$ and $(\xi,a,z)$ in $G$ with $(\xi_{n},a_{n},z_{n})\to(\xi,a,z)$ , we need to show that $g(\xi_{n},a_{n},z_{n})\to g(\xi,a,z)$ . To that end, we define

[TABLE]

where $\hat{\beta}:=\beta(\hat{Z},\hat{\varepsilon})$ , $\hat{R}:=R(\hat{Z},\hat{\zeta})$ and $\hat{Y}:=Y(\hat{Z},\hat{\eta})$ as defined in (5). Then $h_{1}$ and $h_{2}$ are continuous in $(\xi,a,\hat{Z})$ by the continuity of $c$ and nonnegative by (40).

By Fatou’s lemma and Theorem 1.1 of Feinberg et al. (2014),

[TABLE]

This implies that

[TABLE]

The function $g$ is then continuous, since the above inequality is equivalent to the statement $\liminf_{n\to\infty}g(\xi_{n},a_{n},z_{n})\geq g(\xi,a,z)\geq\limsup_{n\to\infty}g(\xi_{n},a_{n},z_{n})$ . Hence, $\psi_{c}$ is continuous on $G$ , as was to be shown. Moreover, since $\xi\mapsto\psi_{c}(\xi,a,z)$ takes values in the closed interval $I(a,z):=[u^{\prime}(a),u^{\prime}(a)+\mathbbm{E}_{z}\hat{\beta}\hat{R}(u^{\prime}(\hat{Y})+M)]$ , and the correspondence $(a,z)\mapsto I(a,z)$ is nonempty, compact-valued and continuous, Theorem B.1.4 of Stachurski (2009) then implies that $Tc$ is continuous on $\SS_{0}$ .

Step 2. We show that $Tc$ is increasing in $a$ . Suppose that for some $z\in\mathsf{Z}$ and $a_{1},a_{2}\in(0,\infty)$ with $a_{1}<a_{2}$ , we have $\xi_{1}:=Tc(a_{1},z)>Tc(a_{2},z)=:\xi_{2}$ . Since $c$ is increasing in $a$ by assumption, $\psi_{c}$ is increasing in $\xi$ and decreasing in $a$ . Then $u^{\prime}(\xi_{1})<u^{\prime}(\xi_{2})=\psi_{c}(\xi_{2},a_{2},z)\leq\psi_{c}(\xi_{1},a_{1},z)=u^{\prime}(\xi_{1})$ . This is a contradiction.

Step 3. We have shown in Proposition B.3 that $Tc(a,z)\in(0,a]$ for all $(a,z)\in\SS_{0}$ .

Step 4. We show that $\|u^{\prime}\circ(Tc)-u^{\prime}\|<\infty$ . Since $u^{\prime}[Tc(a,z)]\geq u^{\prime}(a)$ , we have

[TABLE]

for all $(a,z)\in\SS_{0}$ . The right hand side is easily shown to be finite via Lemma B.1. ∎

To prove Theorem 2.2, let $\mathscr{H}$ be all continuous functions $h:\SS_{0}\to\mathbbm{R}$ that is decreasing in its first argument and $(a,z)\mapsto h(a,z)-u^{\prime}(a)$ is bounded and nonnegative. Given $h\in\mathscr{H}$ , let $\tilde{T}h$ be the function mapping $(a,z)\in\SS_{0}$ into the $\kappa$ that solves

[TABLE]

Moreover, consider the bijection $H:\mathscr{C}\to\mathscr{H}$ defined by $Hc:=u^{\prime}\circ c$ .

Lemma B.3.

The operator $\tilde{T}\colon\mathscr{H}\to\mathscr{H}$ and satisfies $\tilde{T}H=HT$ on $\mathscr{C}$ .

Proof.

Pick any $c\in\mathscr{C}$ and $(a,z)\in\SS_{0}$ . Let $\xi:=Tc(a,z)$ , then $\xi$ solves

[TABLE]

We need to show that $HTc$ and $\tilde{T}Hc$ evaluate to the same number at $(a,z)$ . In other words, we need to show that $u^{\prime}(\xi)$ is the solution to

[TABLE]

But this is immediate from (42). Hence, we have shown that $\tilde{T}H=HT$ on $\mathscr{C}$ . Since $H\colon\mathscr{C}\to\mathscr{H}$ is a bijection, we have $\tilde{T}=HTH^{-1}$ . Since in addition $T\colon\mathscr{C}\to\mathscr{C}$ by Proposition B.4, we have $\tilde{T}\colon\mathscr{H}\to\mathscr{H}$ . This concludes the proof. ∎

Lemma B.4.

$\tilde{T}$ * is order preserving on $\mathscr{H}$ . That is, $\tilde{T}h_{1}\leq\tilde{T}h_{2}$ for all $h_{1},h_{2}\in\mathscr{H}$ with $h_{1}\leq h_{2}$ .*

Proof.

Let $h_{1},h_{2}$ be functions in $\mathscr{H}$ with $h_{1}\leq h_{2}$ . Suppose to the contrary that there exists $(a,z)\in\SS_{0}$ such that $\kappa_{1}:=\tilde{T}h_{1}(a,z)>\tilde{T}h_{2}(a,z)=:\kappa_{2}$ . Since functions in $\mathscr{H}$ are decreasing in the first argument, we have

[TABLE]

This is a contradiction. Hence, $\tilde{T}$ is order preserving. ∎

Lemma B.5.

There exists an $n\in\mathbbm{N}$ and $\theta<1$ such that $\tilde{T}^{n}$ is a contraction mapping of modulus $\theta$ on $(\mathscr{H},d_{\infty})$ .

Proof.

Since $\tilde{T}$ is order preserving and $\mathscr{H}$ is closed under the addition of nonnegative constants, based on Blackwell (1965), it remains to verify the existence of $n\in\mathbbm{N}$ and $\theta<1$ such that $\tilde{T}^{n}(h+\gamma)\leq\tilde{T}^{n}h+\theta\gamma$ for all $h\in\mathscr{H}$ and $\gamma\geq 0$ . By Lemma A.1 and Assumption 2.2, it suffices to show that for all $k\in\mathbbm{N}$ and $(a,z)\in\SS_{0}$ , we have

[TABLE]

Fix $h\in\mathscr{H}$ , $\gamma\geq 0$ , and let $h_{\gamma}(a,z):=h(a,z)+\gamma$ . By the definition of $\tilde{T}$ , we have

[TABLE]

Here, the first inequality is elementary and the second is due to the fact that $h\leq h_{\gamma}$ and $\tilde{T}$ is order preserving. Hence, $\tilde{T}(h+\gamma)(a,z)\leq\tilde{T}h(a,z)+\gamma\mathbbm{E}_{z}\beta_{1}R_{1}$ and (43) holds for $k=1$ . Suppose (43) holds for arbitrary $k$ . It remains to show that it holds for $k+1$ . For $z\in\mathsf{Z}$ , define $f(z):=\gamma\mathbbm{E}_{z}\beta_{1}R_{1}\cdots\beta_{k}R_{k}$ . By the induction hypothesis, the monotonicity of $\tilde{T}$ and the Markov property,

[TABLE]

Hence, (43) is verified by induction. This concludes the proof. ∎

Proof of Theorem 2.2.

Let $n$ and $\theta$ be as in Lemma B.5. In view of Propositions 2.2, B.2 and B.4, to show that $T^{n}$ is a contraction and verify claims (1)–(3) of Theorem 2.2, based on the Banach contraction mapping theorem, it suffices to show that $\rho(T^{n}c,T^{n}d)\leq\theta\rho(c,d)$ for all $c,d\in\mathscr{C}$ . To this end, pick any $c,d\in\mathscr{C}$ . Note that the topological conjugacy result established in Lemma B.3 implies that $\tilde{T}=HTH^{-1}$ . Hence, $\tilde{T}^{n}=(HTH^{-1})\cdots(HTH^{-1})=HT^{n}H^{-1}$ and $\tilde{T}^{n}H=HT^{n}$ . By the definition of $\rho$ and the contraction property established in Lemma B.5,

[TABLE]

Hence, $T^{n}$ is a contraction and claims (1)–(3) are verified. ∎

Our next goal is to prove Proposition 2.3. To begin with, we define

[TABLE]

Lemma B.6.

$\mathscr{C}_{0}$ * is a closed subset of $\mathscr{C}$ , and $Tc\in\mathscr{C}_{0}$ for all $c\in\mathscr{C}_{0}$ .*

Proof.

To see that $\mathscr{C}_{0}$ is closed, for a given sequence $\{c_{n}\}$ in $\mathscr{C}_{0}$ and $c\in\mathscr{C}$ with $\rho(c_{n},c)\to 0$ , we need to show that $c\in\mathscr{C}_{0}$ . This obviously holds since $a\mapsto a-c_{n}(a,z)$ is increasing for all $n$ , and, in addition, $\rho(c_{n},c)\to 0$ implies that $c_{n}(a,z)\to c(a,z)$ for all $(a,z)\in\SS_{0}$ .

Fix $c\in\mathscr{C}_{0}$ . We now show that $\xi:=Tc\in\mathscr{C}_{0}$ . Since $\xi\in\mathscr{C}$ by Proposition B.4, it remains to show that $a\mapsto a-\xi(a,z)$ is increasing. Suppose the claim is false, then there exist $z\in\mathsf{Z}$ and $a_{1},a_{2}\in(0,\infty)$ such that $a_{1}<a_{2}$ and $a_{1}-\xi(a_{1},z)>a_{2}-\xi(a_{2},z)$ . Since $a_{1}-\xi(a_{1},z)\geq 0$ , $a_{2}-\xi(a_{2},z)\geq 0$ and $\xi(a_{1},z)\leq\xi(a_{2},z)$ by Proposition B.4, we have $\xi(a_{1},z)<a_{1}$ and $\xi(a_{1},z)<\xi(a_{2},z)$ . However, based on the property of the time iteration operator, we then have

[TABLE]

which implies that $\xi(a_{1},z)\geq\xi(a_{2},z)$ . This is a contradiction. Hence, $a\mapsto a-\xi(a,z)$ is increasing, and $T$ is a self-map on $\mathscr{C}_{0}$ . ∎

Proof of Proposition 2.3.

Since $T$ maps elements of the closed subset $\mathscr{C}_{0}$ into itself by Lemma B.6, Theorem 2.2 implies that $c^{*}\in\mathscr{C}_{0}$ . Hence, the stated claims hold. ∎

Proof of Proposition 2.4.

Let $T_{j}$ be the time iteration operator for the income process $j$ established in Proposition B.4. It suffices to show $T_{1}c\leq T_{2}c$ for all $c\in\mathscr{C}$ . To see this, note that by Lemma B.4, we have $T_{j}c_{1}\leq T_{j}c_{2}$ whenever $c_{1}\leq c_{2}$ . Therefore if $T_{1}c\leq T_{2}c$ for all $c\in\mathscr{C}$ , we obtain $T_{1}c_{1}\leq T_{1}c_{2}\leq T_{2}c_{2}$ . Iterating this starting from any $c\in\mathscr{C}$ , by Theorem 2.2, it follows that $c_{1}^{*}=\lim_{n\to\infty}(T_{1})^{n}c\leq\lim_{n\to\infty}(T_{2})^{n}c=c_{2}^{*}$ , completing the proof.

To show that $T_{1}c\leq T_{2}c$ for any $c\in\mathscr{C}$ , take any $(a,z)\in\SS_{0}$ and define $\xi_{j}=(T_{j}c)(a,z)$ . To show $\xi_{1}\leq\xi_{2}$ , suppose on the contrary that $\xi_{1}>\xi_{2}$ . Since $c$ is increasing in $a$ and $u^{\prime\prime}<0$ (hence $u^{\prime}$ is decreasing), it follows from the definition of the time iteration operator in (14)–(16), $Y_{1}\leq Y_{2}$ , $u^{\prime\prime}<0$ and the monotonicity of $c\in\mathscr{C}$ that

[TABLE]

which is a contradiction. ∎

To prove Proposition 2.5, we need several lemmas.

Lemma B.7.

For all $c\in\mathscr{C}_{0}$ , there exists a threshold $\bar{a}_{c}(z)$ such that $Tc(a,z)=a$ if and only if $a\leq\bar{a}_{c}(z)$ . In particular, there exists a threshold $\bar{a}(z)$ such that $c^{*}(a,z)=a$ if and only if $a\leq\bar{a}(z)$ .

Proof.

Recall that, for all $c\in\mathscr{C}_{0}$ , $\xi(a,z):=Tc(a,z)$ solves

[TABLE]

For each $z\in\mathsf{Z}$ and $c\in\mathscr{C}_{0}$ , define

[TABLE]

To prove the first claim, by Lemma B.6, it suffices to show that $\xi(a,z)<a$ implies $a>\bar{a}_{c}(z)$ . This obviously holds since in view of (44), the former implies that

[TABLE]

which then yields $a>\bar{a}_{c}(z)$ . The second claim follows immediately from the first claim and the fact that $c^{*}\in\mathscr{C}_{0}$ is the unique fixed point of $T$ in $\mathscr{C}$ . ∎

Consider a subset $\mathscr{C}_{1}$ defined by $\mathscr{C}_{1}:=\left\{c\in\mathscr{C}_{0}\colon a\mapsto c(a,z)\text{ is concave for all }z\in\mathsf{Z}\right\}$ .

Lemma B.8.

$\mathscr{C}_{1}$ * is a closed subset of $\mathscr{C}_{0}$ and $\mathscr{C}$ , and, $Tc\in\mathscr{C}_{1}$ for all $c\in\mathscr{C}_{1}$ .*

Proof.

The first claim is immediate because limits of concave functions are concave. To prove the second claim, fix $c\in\mathscr{C}_{1}$ . We have $Tc\in\mathscr{C}_{0}$ by Lemma B.6. It remains to show that $a\mapsto\xi(a,z):=Tc(a,z)$ is concave for all $z\in\mathsf{Z}$ . Given $z\in\mathsf{Z}$ , Lemma B.7 implies that $\xi(a,z)=a$ for $a\leq\bar{a}_{c}(z)$ and that $\xi(a,z)<a$ for $a>\bar{a}_{c}(z)$ . Since in addition $a\mapsto\xi(a,z)$ is continuous and increasing, to show the concavity of $\xi$ with respect to $a$ , it suffices to show that $a\mapsto\xi(a,z)$ is concave on $(\bar{a}_{c}(z),\infty)$ .

Suppose there exist some $z\in\mathsf{Z}$ , $\alpha\in[0,1]$ , and $a_{1},a_{2}\in(\bar{a}_{c}(z),\infty)$ such that

[TABLE]

Let $h(a,z,\hat{\omega}):=\hat{R}\left[a-\xi(a,z)\right]+\hat{Y}$ , where $\hat{\omega}:=(\hat{R},\hat{Y})$ . Then by Lemma B.7 and noting that consumption is interior, we have

[TABLE]

Using condition (17) then yields

[TABLE]

which contradicts (46). Hence, $a\mapsto\xi(a,z)$ is concave for all $z\in\mathsf{Z}$ . ∎

Proof of Proposition 2.5.

By Theorem 2.2, $T\colon\mathscr{C}\to\mathscr{C}$ is a contraction mapping with unique fixed point $c^{*}$ . Since $\mathscr{C}_{1}$ is a closed subset of $\mathscr{C}$ and $T\mathscr{C}_{1}\subset\mathscr{C}_{1}$ by Lemma B.8, we know that $c^{*}\in\mathscr{C}_{1}$ . The first claim is verified. Regarding the second claim, note that $c^{*}\in\mathscr{C}_{1}$ implies that $a\mapsto c^{*}(a,z)$ is increasing and concave for all $z\in\mathsf{Z}$ . Hence, $a\mapsto c^{*}(a,z)/a$ is a decreasing function for all $z\in\mathsf{Z}$ . Since $0\leq c^{*}(a,z)\leq a$ for all $(a,z)\in\SS_{0}$ , $\alpha(z):=\lim_{a\to\infty}c^{*}(a,z)/a$ is well-defined and $\alpha(z)\in[0,1]$ . ∎

Proof of Remark 2.1.

For each $c$ in $\mathscr{C}$ concave in its first argument, let $h_{c}(x,\hat{\omega}):=c(\hat{R}x+\hat{Y},\hat{z})$ , where $\hat{\omega}:=(\hat{R},\hat{Y},\hat{z})$ . Then $x\mapsto h_{c}(x,\hat{\omega})$ is concave. Based on the generalized Minkowski’s inequality (see, e.g., Hardy et al. (1952), page 146, theorem 198), we have

[TABLE]

Since $u^{\prime}(c)=c^{-\gamma}$ , the above inequality implies that condition (17) holds. ∎

To prove Proposition 2.6, let $\bar{s}$ be as in (19) and define

[TABLE]

Lemma B.9.

$\mathscr{C}_{2}$ * is a closed subset of $\mathscr{C}$ , and $Tc\in\mathscr{C}_{2}$ for all $c\in\mathscr{C}_{2}$ .*

Proof.

To see that $\mathscr{C}_{2}$ is closed, for a given sequence $\{c_{n}\}$ in $\mathscr{C}_{2}$ and $c\in\mathscr{C}$ with $\rho(c_{n},c)\to 0$ , we need to verify that $c\in\mathscr{C}_{2}$ . This obviously holds since $c_{n}(a,z)/a\geq 1-\bar{s}$ for all $n$ and $(a,z)\in\SS_{0}$ , and, on the other hand, $\rho(c_{n},c)\to 0$ implies that $c_{n}(a,z)\to c(a,z)$ for all $(a,z)\in\SS_{0}$ .

We next show that $T$ is a self-map on $\mathscr{C}_{2}$ . Fix $c\in\mathscr{C}_{2}$ . We have $Tc\in\mathscr{C}$ since $T$ is a self-map on $\mathscr{C}$ . It remains to show that $\xi:=Tc$ satisfies $\xi(a,z)\geq(1-\bar{s})a$ for all $(a,z)\in\SS_{0}$ . Suppose $\xi(a,z)<(1-\bar{s})a$ for some $(a,z)\in\SS_{0}$ . Then

[TABLE]

Since $u^{\prime}((1-\bar{s})a)>u^{\prime}(a)$ and $c\in\mathscr{C}_{2}$ , this implies that

[TABLE]

which contradicts (19) since $((1-\bar{s})a,z)\in\SS_{0}$ . As a result, $\xi(a,z)\geq(1-\bar{s})a$ for all $(a,z)\in\SS_{0}$ and we conclude that $Tc\in\mathscr{C}_{2}$ . ∎

Proof of Proposition 2.6.

We have shown in Theorem 2.2 that $T$ is a contraction mapping on the complete metric space $(\mathscr{C},\rho)$ , with unique fixed point $c^{*}$ . Since in addition $\mathscr{C}_{2}$ is a closed subset of $\mathscr{C}$ and $T\mathscr{C}_{2}\subset\mathscr{C}_{2}$ by Lemma B.9, we know that $c^{*}\in\mathscr{C}_{2}$ . The stated claim is verified. ∎

Appendix C Proof of Section 3 Results

As before, Assumptions 2.1–2.3 are in force. Notice that Assumption 2.2, Assumption 3.1 and Lemma A.1 imply existence of an $n$ in $\mathbbm{N}$ such that

[TABLE]

Lemma C.1.

For all $(a,z)\in\SS$ , we have $\sup_{t\geq 0}\mathbbm{E}_{a,z}\,a_{t}<\infty$ .

Proof.

Since $c^{*}(0,z)=0$ , Proposition 2.6 implies that $c^{*}(a,z)\geq(1-\bar{s})a$ for all $(a,z)\in\SS$ . For all $t\geq 1$ , we have $t=kn+j$ in general, where the integers $k\geq 0$ and $j\in\{0,1,\dots,n-1\}$ . Using these facts and (2.1), we have:

[TABLE]

with probability one. Taking expectations of the above while noting that $M_{0}:=\max_{1\leq\ell\leq n,\,z\in\mathsf{Z}}\mathbbm{E}_{z}\prod_{t=1}^{\ell}R_{t}<\infty$ by Assumption 3.1 and Lemma A.1, we have

[TABLE]

or all $(a,z)\in\SS$ and $t\geq 0$ . Here we have used $M_{3}$ in Lemma B.1 and the Markov property. Hence, $\sup_{t\geq 0}\mathbbm{E}_{a,z}\,a_{t}<\infty$ for all $(a,z)\in\SS$ , as was claimed. ∎

A function $w^{*}\colon\SS\to\mathbbm{R}_{+}$ is called norm-like if all its sublevel sets (i.e., sets of the form $\{x\in\SS\colon w(x)\leq b\},b\in\mathbbm{R}_{+}$ ) are precompact in $\SS$ (i.e., any sequence in a given sublevel set has a subsequence that converges to a point of $\SS$ ).

Proof of Theorem 3.1.

Based on Lemma D.5.3 of Meyn and Tweedie (2009), a stochastic kernel $Q$ is bounded in probability if and only if for all $x\in\SS$ , there exists a norm-like function $w_{x}^{*}\colon\SS\to\mathbbm{R}_{+}$ such that the $(Q,x)$ -Markov process $\{X_{t}\}_{t\geq 0}$ satisfies $\limsup_{t\to\infty}\mathbbm{E}_{x}\left[w_{x}^{*}(X_{t})\right]<\infty$ . Fix $(a,z)\in\SS$ . Since $\mathsf{Z}$ is finite, $P$ is bounded in probability. Hence, there exists a norm-like function $w\colon\mathsf{Z}\to\mathbbm{R}_{+}$ such that $\limsup_{t\to\infty}\mathbbm{E}_{z}w(Z_{t})<\infty$ . Then $w^{*}\colon\SS\to\mathbbm{R}_{+}$ defined by $w^{*}(a_{0},Z_{0}):=a_{0}+w(Z_{0})$ is a norm-like function on $\SS$ . The stochastic kernel $Q$ is then bounded in probability since Lemma C.1 implies that $\limsup_{t\to\infty}\mathbbm{E}_{a,z}\,w^{*}(a_{t},Z_{t})\leq\sup_{t\geq 0}\mathbbm{E}_{a,z}\,a_{t}+\limsup_{t\to\infty}\mathbbm{E}_{z}\,w(Z_{t})<\infty$ . Regarding existence of stationary distribution, since $P$ is Feller (due to the finiteness of $\mathsf{Z}$ ), whenever $z_{n}\to z$ , the product measure satisfies

[TABLE]

Since in addition $c^{*}$ is continuous, a simple application of the generalized Fatou’s lemma of Feinberg et al. (2014) (Theorem 1.1) shows that the stochastic kernel $Q$ is Feller. Moreover, since $Q$ is bounded in probability, based on the Krylov-Bogolubov theorem (see, e.g., Meyn and Tweedie (2009), Proposition 12.1.3 and Lemma D.5.3), $Q$ admits at least one stationary distribution. ∎

Lemma C.2.

The borrowing constraint binds in finite time with positive probability. That is, for all $(a,z)\in\SS$ , we have $\mathbbm{P}_{a,z}\left(\cup_{t\geq 0}\{c_{t}=a_{t}\}\right)>0$ .

Proof.

The claim holds trivially when $a=0$ . Suppose the claim does not hold on $\SS_{0}$ (recall that $\SS_{0}=\SS\backslash\{0\}$ ), then $\mathbbm{P}_{a,z}\left(\cap_{t\geq 0}\{c_{t}<a_{t}\}\right)=1$ for some $(a,z)\in\SS_{0}$ , i.e., the borrowing constraint never binds with probability one. Hence,

[TABLE]

for all $t\geq 0$ . Then we have

[TABLE]

for all $t\geq 1$ . Let $n$ and $\theta$ be defined by (48). Let $t=kn+1$ . Based on the Markov property and Lemma B.1, as $k\to\infty$ ,

[TABLE]

Similarly, as $k\to\infty$ ,

[TABLE]

Letting $k\to\infty$ . (C) then implies that $\left(u^{\prime}\circ c^{*}\right)(a,z)\leq 0$ , contradicted with the fact that $u^{\prime}>0$ . Thus, we must have $\mathbbm{P}_{a,z}\left(\cup_{t\geq 0}\{c_{t}=a_{t}\}\right)>0$ for all $(a,z)\in\SS$ . ∎

Our next goal is to prove Theorem 3.2. In proofs we apply the theory of Meyn and Tweedie (2009). Important definitions (their information in the textbook) include: $\psi$ -irreducibility (Section 4.2), small set (page 102), strong aperiodicity (page 114), petite set (page 117), Harris chain (page 199), and positivity (page 230).

Recall that $\mathbbm{R}^{m}$ paired with its Euclidean topology is a second countable topological space (i.e., its topology has a countable base). Since $\mathbbm{R}_{+}$ and $\mathsf{Z}$ are respectively Borel subsets of $\mathbbm{R}$ and $\mathbbm{R}^{m}$ paired with the relative topologies, they are also second countable. Hence, $\SS:=\mathbbm{R}_{+}\times\mathsf{Z}$ satisfies $\mathscr{B}(\SS)=\mathscr{B}(\mathbbm{R}_{+})\otimes\mathscr{B}(\mathsf{Z})$ (see, e.g., page 149, Theorem 4.44 of Aliprantis and Border (2006)). Recall (22). With slight abuse of notation, in proofs, we use $f$ to denote the density of $\{Y_{t}\}$ in both cases (Y1) and (Y2) and write $\mathop{}\!\mathrm{d}y=\nu(\mathop{}\!\mathrm{d}y)$ , where $\nu$ is the related measure. Specifically, $\nu$ is the Lebesgue measure when (Y2) holds. Moreover, Let $\vartheta$ be the counting measure.

Recall $\bar{z}\in\mathsf{Z}$ and the greatest lower bound $y_{\ell}\geq 0$ of the support of $\{Y_{t}\}$ given by Assumption 3.2. Let $\bar{p}:=P(\bar{z},\bar{z})$ . Then $\bar{p}>0$ by Assumption 3.2.

Lemma C.3.

$\mathbbm{P}_{(a,\bar{z})}\left\{\cup_{t\geq 0}\left[\{c_{t}=a_{t}\}\cap\left(\cap_{i=0}^{t}\{Z_{i}=\bar{z}\}\right)\right]\right\}>0$ * for all $a\in(0,\infty)$ .*

Proof.

Fix $a\in(0,\infty)$ . If $a\leq\bar{a}(\bar{z})$ , the claim holds trivially by Lemma B.7. Now consider the case $a>\bar{a}(\bar{z})$ . Suppose $\mathbbm{P}_{(a,\bar{z})}\left\{\cup_{t\geq 0}\left[\{c_{t}=a_{t}\}\cap\left(\cap_{i=0}^{t}\{Z_{i}=\bar{z}\}\right)\right]\right\}=0$ . Then, based on the De Morgan’s law, we have

[TABLE]

Note that the set $\triangle(t):=\left(\cap_{i=0}^{t}\{c_{i}<a_{i}\}\right)\cup\left(\cup_{i=0}^{t}\{Z_{i}\neq\bar{z}\}\right)$ can be written as

[TABLE]

Assumption 3.2 then implies that, for all $t\geq 0$ ,

[TABLE]

Let $n$ and $\theta$ be defined by (48) and let $t=kn+1$ . Similar to the proof of Lemma B.7, we can show that, with probability $\bar{p}^{t}>0$ ,

[TABLE]

for some constant $M\in\mathbbm{R}_{+}$ . Since $\theta\in(0,1)$ and $(u^{\prime}\circ c^{*})(a,\bar{z})>0$ , Lemma B.1 implies that there exists $N\in\mathbbm{N}$ such that

[TABLE]

As a result, we have $(u^{\prime}\circ c^{*})(a,\bar{z})<(u^{\prime}\circ c^{*})(a,\bar{z})$ with probability $\bar{p}^{Nn+1}>0$ . This is a contradiction. Hence the stated claim is verified. ∎

Let $F(\mathop{}\!\mathrm{d}a_{t+1}\mid a_{t},Z_{t},Z_{t+1})$ be defined such that $\mathbbm{P}\{a_{t+1}\in A\mid(a_{t},Z_{t},Z_{t+1})=(a,z,z^{\prime})\}=\int\mathbbm{1}\{a^{\prime}\in A\}F(\mathop{}\!\mathrm{d}a^{\prime}\mid a,z,z^{\prime})$ at $A\in\mathscr{B}(\mathbbm{R}_{+})$ .

Lemma C.4.

Let $h:\SS\to\mathbbm{R}_{+}$ be an integrable map such that $a\mapsto h(a,z)$ is decreasing for all $z\in\mathsf{Z}$ . Then, for all $t\in\mathbbm{N}$ and $z\in\mathsf{Z}$ , the map $a\mapsto\ell(a,z,t):=\int h(a^{\prime},z^{\prime})Q^{t}((a,z),\mathop{}\!\mathrm{d}(a^{\prime},z^{\prime}))$ is decreasing.

Proof.

Fix $z\in\mathsf{Z}$ . When $t=1$ , (21a) implies that

[TABLE]

Since $a\mapsto h(a,z)$ is decreasing, and by Proposition 2.3 and (21a), the optimal asset accumulation path $a_{t+1}$ is increasing in $a_{t}$ with probability one, we know that $a\mapsto\int h(a^{\prime},z^{\prime})F(\mathop{}\!\mathrm{d}a^{\prime}\mid a,z,z^{\prime})$ is decreasing for all $z^{\prime}\in\mathsf{Z}$ . Thus, $a\mapsto\ell(a,z,1)$ is decreasing. The claim holds for $t=1$ . Suppose this claim holds for arbitrary $t$ , it remains to show that it holds for $t+1$ . Note that

[TABLE]

Since $a^{\prime}\mapsto\ell(a^{\prime},z^{\prime},t)$ is decreasing for all $z^{\prime}\in\mathsf{Z}$ , based on the induction argument, $a\mapsto\ell(a,z,t+1)$ is decreasing. The stated claim then follows. ∎

Lemma C.5.

The Markov process $\{(a_{t},Z_{t})\}_{t\geq 0}$ is $\psi$ -irreducible.

Proof.

Recall $\delta>y_{\ell}$ given by Assumption 3.2. Let $\mathsf{D}\in\mathscr{B}(\SS)$ be defined by $\mathsf{D}:=\{y_{\ell}\}\times\{\bar{z}\}$ if (Y1) holds and $\mathsf{D}:=(y_{\ell},\delta)\times\{\bar{z}\}$ if (Y2) holds. We define the measure $\varphi$ on $\mathscr{B}(\SS)$ by $\varphi(A):=(\nu\times\vartheta)(A\cap\mathsf{D})$ for $A\in\mathscr{B}(\SS)$ . Clearly $\varphi$ is a nontrivial measure. In particular, $\vartheta(\{\bar{z}\})=1$ as $\vartheta$ is the counting measure. Moreover, since $y_{\ell}$ is the greatest lower bound of the support of $\{Y_{t}\}$ , it must be the case that $\nu(\{y_{\ell}\})>0$ if (Y1) holds and that $\nu((y_{\ell},\delta))>0$ if (Y2) holds. As a result, $\varphi(\SS)=\nu(\{y_{\ell}\})\times\vartheta(\{\bar{z}\})>0$ when (Y1) holds and $\varphi(\SS)=\nu((y_{\ell},\delta))\times\vartheta(\{\bar{z}\})>0$ when (Y2) holds.

We first show that $\{(a_{t},Z_{t})\}$ is $\varphi$ -irreducible. Let $A$ be an element of $\mathscr{B}(\SS)$ such that $\varphi(A)>0$ . Fix $(a,z)\in\SS$ . We need to show that $\{(a_{t},Z_{t})\}$ visits set $A$ in finite time with positive probability.

Since $\{z_{t}\}$ is irreducible, $\mathbbm{P}_{z}\{Z_{N_{0}}=\bar{z}\}>0$ for some integer $N_{0}\geq 0$ . By Lemma C.1, there exists $\tilde{a}<\infty$ such that $\mathbbm{P}_{(a,z)}\{a_{N_{0}}<\tilde{a},Z_{N_{0}}=\bar{z}\}>0$ . By Lemma C.3, there exists $T\in\mathbbm{N}$ such that $\mathbbm{P}_{(\tilde{a},\bar{z})}\left\{c_{T}=a_{T},\,Z_{T}=\bar{z}\right\}\geq\mathbbm{P}_{(\tilde{a},\bar{z})}\left\{c_{T}=a_{T},\,\cap_{i=0}^{T}\{Z_{i}=\bar{z}\}\right\}>0$ . Lemma B.7 and Lemma C.4 then imply that $\mathbbm{P}_{(a^{\prime},\bar{z})}\left\{c_{T}=a_{T},\,Z_{T}=\bar{z}\right\}>0$ for all $a^{\prime}\in(0,\tilde{a})$ . Hence, for $N:=N_{0}+T$ and $E:=\left\{c_{N}=a_{N},\,Z_{N}=\bar{z}\right\}$ , we have

[TABLE]

based on the Markov property. By (21a), we have

[TABLE]

Note that, by Assumption 3.2, $f(y^{\prime\prime}\mid z^{\prime\prime})P(\bar{z},z^{\prime\prime})>0$ whenever $(y^{\prime\prime},z^{\prime\prime})\in\mathsf{D}$ . Since in addition $\varphi(A)=(\nu\times\vartheta)(A\cap\mathsf{D})>0$ , we have

[TABLE]

Let $\triangle:=\mathbbm{P}_{(a,z)}\{(a_{N+1},Z_{N+1})\in A\}$ . Then (50) and (C) imply that

[TABLE]

Therefore, we have shown that any measurable subset with positive $\varphi$ measure can be reached in finite time with positive probability, i.e., $\{(a_{t},Z_{t})\}$ is $\varphi$ -irreducible. Based on Proposition 4.2.2 of Meyn and Tweedie (2009), there exists a maximal probability measure $\psi$ on $\mathscr{B}(\SS)$ such that $\{(a_{t},Z_{t})\}$ is $\psi$ -irreducible. ∎

Lemma C.6.

Let the function $\bar{a}$ be defined as in (45). Then $\bar{a}(\bar{z})\geq y_{\ell}$ if (Y1) holds, while $\bar{a}(\bar{z})>y_{\ell}$ if (Y2) holds.

Proof.

Suppose (Y1) holds and $\bar{a}(\bar{z})<y_{\ell}$ . Then, by Lemma B.7, for all $t\in\mathbbm{N}$ ,

[TABLE]

Hence, for all $a\in(0,\infty)$ and $t\in\mathbbm{N}$ ,

[TABLE]

where the last equality follows from (21a), which implies that $a_{t}\geq Y_{t}\geq y_{\ell}$ with probability one. This is contradicted with Lemma C.3.

Suppose (Y2) holds and $\bar{a}(\bar{z})\leq y_{\ell}$ . By definition, $\mathbbm{P}_{z}\{Y_{t}\leq y_{\ell}\}=0$ for all $z\in\mathsf{Z}$ and $t\in\mathbbm{N}$ . Since $a_{t}\geq Y_{t}$ with probability one, we have $\mathbbm{P}_{(a,z)}\{a_{t}\leq y_{\ell}\}=0$ for all $(a,z)\in\SS$ and $t\in\mathbbm{N}$ . Via similar analysis to (C), Lemma B.7 implies that $\left[\{c_{t}=a_{t}\}\cap\left(\cap_{i=0}^{t}\{Z_{i}=\bar{z}\}\right)\right]\,\subset\,\{a_{t}\leq y_{\ell}\}$ for all $t\in\mathbbm{N}$ . Hence, for all $a\in(0,1)$ and $t\in\mathbbm{N}$ , we have $\mathbbm{P}_{(a,\bar{z})}\left[\{c_{t}=a_{t}\}\cap\left(\cap_{i=0}^{t}\{Z_{i}=\bar{z}\}\right)\right]\leq\mathbbm{P}_{(a,\bar{z})}\{a_{t}\leq y_{\ell}\}=0$ . Again, this contradicts Lemma C.3. ∎

Lemma C.7.

The Markov process $\{(a_{t},Z_{t})\}_{t\geq 0}$ is strongly aperiodic.

Proof.

By the definition of strong aperiodicity, we need to show that there exists a $v_{1}$ -small set $\mathsf{D}_{1}$ with $v_{1}(\mathsf{D}_{1})>0$ , i.e., there exists a nontrivial measure $v_{1}$ on $\mathscr{B}(\SS)$ and a subset $\mathsf{D}_{1}\in\mathscr{B}(\SS)$ such that $v_{1}(\mathsf{D}_{1})>0$ and

[TABLE]

For $\delta>0$ given by Assumption 3.2, let $\mathsf{C}:=\left(y_{\ell},\min\left\{\delta,\,\bar{a}(\bar{z})\right\}\right)$ and let $\mathsf{D}_{1}:=\{y_{\ell}\}\times\{\bar{z}\}$ if (Y1) holds and $\mathsf{D}_{1}:=\mathsf{C}\times\{\bar{z}\}$ if (Y2) holds. We now show that $\mathsf{D}_{1}$ satisfies the above conditions. Define $r(a^{\prime},z^{\prime}):=f(a^{\prime}\,|\,z^{\prime})P(\bar{z},z^{\prime})$ and note that $r(a^{\prime},z^{\prime})>0$ on $\mathsf{D}_{1}$ . Define the measure $v_{1}$ on $\mathscr{B}(\SS)$ by $v_{1}(A):=\int_{A}r(a^{\prime},z^{\prime})(\nu\times\vartheta)[\mathop{}\!\mathrm{d}(a^{\prime},z^{\prime})]$ . If (Y1) holds, then $\nu(\{y_{\ell}\})>0$ as shown above, and, if (Y2) holds, Lemma C.6 implies that $\nu(\mathsf{C})>0$ . Since in addition $\vartheta(\{\bar{z}\})>0$ , it always holds that $(\nu\times\vartheta)(\mathsf{D}_{1})>0$ . Moreover, since $r(a^{\prime},z^{\prime})>0$ on $\mathsf{D}_{1}$ , we have $v_{1}(\mathsf{D}_{1})>0$ and $v_{1}$ is a nontrivial measure.

For all $(a,z)\in\mathsf{D}_{1}$ and $A\in\mathscr{B}(S)$ , Lemma B.7 implies that

[TABLE]

Hence, $\mathsf{D}_{1}$ satisfies (53) and $\{(a_{t},Z_{t})\}_{t\geq 0}$ is strongly aperiodic. ∎

Lemma C.8.

The set $[0,d]\times\mathsf{Z}$ is a petite set for all $d\in\mathbbm{R}_{+}$ .

Proof.

Fix $d\in(0,\infty)$ and $z\in\mathsf{Z}$ . Let $B:=[0,d]\times\{z\}$ . By Lemma C.3,

[TABLE]

We start by showing that there exists a nontrivial measure $v_{N}$ on $\mathscr{B}(\SS)$ such that

[TABLE]

In other words, $B$ is a $v_{N}$ -small set. Fix $A\in\mathscr{B}(\SS)$ . For all $z^{\prime}\in\mathsf{Z}$ , define

[TABLE]

Note that for all $(a,z)\in B$ , Lemma B.7 implies that

[TABLE]

Since $a^{\prime}\mapsto m(z^{\prime})\mathbbm{1}\{a^{\prime}\leq\bar{a}(z^{\prime}),z^{\prime}=\bar{z}\}$ is decreasing for all $z^{\prime}\in\mathsf{Z}$ , by Lemma C.4,

[TABLE]

Note that $v_{N}$ is a nontrivial measure on $\mathscr{B}(\SS)$ since (54) implies that $v_{N}(\SS)>0$ . Furthermore, since $(a,z)$ is chosen arbitrarily, the above inequality implies that (55) holds. We have shown that $B$ is a $v_{N}$ -small set, and hence a petite set. Since finite union of petite sets is petite for $\psi$ -irreducible chains (see, e.g., Proposition 5.5.5 of Meyn and Tweedie (2009)), the set $[0,d]\times\mathsf{Z}$ must also be petite. ∎

Recall $\bar{s}\in[0,1)$ in Assumption 3.1, $n\in\mathbbm{N}$ and $\gamma\in(0,1)$ in (48). Let $B:=[0,d]\times\mathsf{Z}$ .

Lemma C.9.

There exist constants $b\in\mathbbm{R}_{+}$ , $\rho\in(0,1)$ and a measurable map $V\colon\SS\to[n/\rho,\infty)$ that is bounded on $B$ , such that, for sufficiently large $d\in\mathbbm{R}_{+}$ and all $(a,z)\in\SS$ , we have $\mathbbm{E}_{a,z}V(a_{n},Z_{n})-V(a,z)\leq-\rho V(a,z)+b\mathbbm{1}\{(a,z)\in B\}$ .

Proof.

Since $c^{*}(a,z)\geq(1-\bar{s})a$ by Proposition 2.6 and $M_{0}:=\max_{z\in\mathsf{Z}}\mathbbm{E}_{z}\hat{R}<\infty$ by Assumption 3.1 and Lemma A.1, by Lemma B.1 and the Markov property,

[TABLE]

Define $b_{0}:=\sum_{t=1}^{n}{\bar{s}}^{n-t}M_{0}^{n-t}M_{3}$ . Note that $b_{0}<\infty$ . Choose $\rho\in(0,1-\gamma)$ , $m_{V}\geq n/\rho$ and $d\in\mathbbm{R}_{+}$ such that $(1-\gamma-\rho)d\geq b_{0}+\rho m_{V}$ . Then, for $V(a,z):=a+m_{V}$ ,

[TABLE]

In particular, if $(a,z)\notin B$ , then $a>d$ and (C) implies that

[TABLE]

Let $b:=b_{0}+\rho m_{V}$ . Then the stated claim follows from (C)–(57) and the fact that $V$ is bounded on $B$ . ∎

Proof of Theorem 3.2.

Claim (1) can be proved by applying Theorem 19.1.3 (or a combination of Proposition 5.4.5 and Theorem 15.0.1) of Meyn and Tweedie (2009). The required conditions in those theorems have been established by Lemmas C.5, C.7, C.8 and C.9 above. Regarding claim (2), Lemmas C.8 and C.9 imply that $\mathbbm{E}_{a,z}V(a_{n},Z_{n})-V(a,z)\leq-n+b\mathbbm{1}\{(a,z)\in B\}$ for all $(a,z)\in\SS$ , where $B:=[0,d]\times\mathsf{Z}$ is petite. Since in addition $\{(a_{t},Z_{t})\}$ is $\psi$ -irreducible by Lemma C.5, Theorem 19.1.2 of Meyn and Tweedie (2009) implies that $\{(a_{t},Z_{t})\}$ is a positive Harris chain. Claim (2) then follows from Theorem 17.1.7 of Meyn and Tweedie (2009).

To verify claim (3), since we have shown that $\Phi:=\{(a_{t},Z_{t})\}$ is positive Harris with stationary distribution $\psi_{\infty}$ , based on Theorem 16.1.5 and Theorem 17.5.4 of Meyn and Tweedie (2009), it suffices to show that $Q$ is $V$ -uniformly ergodic. Let $\Phi^{n}$ be the $n$ -skeleton of $\Phi$ (see page 62 of Meyn and Tweedie (2009)). Then $\Phi^{n}$ is $\psi$ -irreducible and aperiodic by Proposition 5.4.5 of Meyn and Tweedie (2009). Theorem 16.0.1 of Meyn and Tweedie (2009) and Lemmas C.8 and C.9 then imply that $\Phi^{n}$ is $V$ -uniformly ergodic, and, there exists $N\in\mathbbm{N}$ such that $|||Q^{nN}-1\otimes\psi_{\infty}|||_{V}<1$ , where $\|\mu\|_{V}:=\sup_{g:|g|\leq V}|\int g\mathop{}\!\mathrm{d}\mu|$ for $\mu\in\mathscr{P}(\SS)$ and, for all $t\in\mathbbm{N}$ ,

[TABLE]

To show that $Q$ is $V$ -uniformly ergodic, by Theorem 16.0.1 of Meyn and Tweedie (2009), it remains to verify: $|||Q^{t}-1\otimes\psi_{\infty}|||_{V}<\infty$ for $t\leq nN$ . This obviously holds since, by the proof of Lemma C.9, there exist $L_{0},L_{1}\in\mathbbm{R}$ such that, for all $t\in\mathbbm{N}$ ,

[TABLE]

Hence, $Q$ is $V$ -uniformly ergodic and claim (3) follows. The proof is now complete. ∎

Proof of Theorem 3.3.

Take an arbitrarily large constant $k<1$ such that

[TABLE]

which is possible by Assumption 3.3 and the definition of $G$ in (25a). For this $k$ , since $\lim_{a\to\infty}c^{*}(a,z)/a=\alpha(z)$ and $\mathsf{Z}$ is a finite set, we can take $\bar{a}>0$ such that

[TABLE]

for all $z\in\mathsf{Z}$ and $a\geq\bar{a}$ . Multiplying both sides by $R(\hat{z},\hat{\zeta})\geq 0$ , it follows from the law of motion (21a), $Y(\hat{z},\hat{\eta})\geq 0$ , and the definition of $G$ in (25a) that for $a\geq\bar{a}$ ,

[TABLE]

Let $\tilde{A}(z,\hat{z},\hat{\zeta}):=kG(z,\hat{z},\hat{\zeta})\mathbbm{1}\{kG(z,\hat{z},\hat{\zeta})>1\}$ . Then for all $z,\hat{z},\hat{\zeta},\hat{\eta}$ and all $a\geq\bar{a}$ ,

[TABLE]

Start the wealth accumulation process $a_{t}$ from $a_{0}\geq\bar{a}$ . Consider the following process:

[TABLE]

where $S_{0}=a_{0}$ . We now show that $a_{t}\geq S_{t}$ with probability one for all $t$ by induction. Since $S_{0}=a_{0}$ , the case $t=0$ is trivial. Suppose the claim holds up to $t$ . Because $a_{t}\geq 0$ and $S_{t}$ remains 0 once it becomes 0, without loss of generality we may assume $S_{0},\dots,S_{t}$ are all positive. Hence $\tilde{A}_{1},\dots,\tilde{A}_{t}>0$ . By the definition of $\tilde{A}$ , we have $\tilde{A}>1$ whenever $\tilde{A}>0$ . Therefore

[TABLE]

Hence applying (58), we get

[TABLE]

Now take any $p\in(0,1)$ and let $T$ be a geometric random variable with mean $1/p$ that is independent of everything. Define

[TABLE]

where $M_{\tilde{A}}(s)$ is as in (24). Since clearly $A\geq\tilde{A}$ and $p>0$ , we have $\lambda>\tilde{\lambda}$ . By Lemma 3.1 of Beare and Toda (2017), $\lambda,\tilde{\lambda}$ are convex, and hence continuous in the interior of their domains. Therefore $\lambda(\kappa)=1$ and $\lambda(s)>1$ for small enough $s>\kappa$ . Hence, for any $\varepsilon>0$ , we can take small enough $p\in(0,1)$ and large enough $k<1$ such that $\tilde{\lambda}(\kappa)<1<\tilde{\lambda}(\kappa+\varepsilon)<\infty$ . By Lemma 3.1 of Beare and Toda (2017), there exists a unique $\tilde{\kappa}\in(\kappa,\kappa+\varepsilon)$ such that $\tilde{\lambda}(\tilde{\kappa})=1$ . Theorem 3.4 of Beare and Toda (2017) then implies that

[TABLE]

for all $(a_{0},z_{0})\in\SS$ . In particular, for any initial $(a_{0},z_{0})\in\SS$ with $a_{0}\geq\bar{a}$ ,

[TABLE]

Now suppose that we draw $a_{0}$ from the ergodic distribution. Then $a_{t}$ has the same distribution as $a_{\infty}$ , and so does $a_{T}$ . Therefore

[TABLE]

If the ergodic distribution of $\{a_{t}\}$ has unbounded support, then $\mathbbm{P}\{a_{0}\geq\bar{a}\}>0$ . As we have seen above, conditional on $a_{0}\geq\bar{a}$ , we have $a_{t}\geq S_{t}$ for all $t$ . Therefore

[TABLE]

by (59), and so (27) follows from (60) and (61). ∎

Bibliography61

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Acemoglu and Robinson (2002) Acemoglu, D. and J. A. Robinson (2002): “The Political Economy of the Kuznets curve,” Review of Development Economics , 6, 183–203.
2Açıkgöz (2018) Açıkgöz, Ö. T. (2018): “On the Existence and Uniqueness of Stationary Equilibrium in Bewley Economies with Production,” Journal of Economic Theory , 173, 18–55.
3Ahn et al. (2018) Ahn, S., G. Kaplan, B. Moll, T. Winberry, and C. Wolf (2018): “When Inequality Matters for Macro and Macro Matters for Inequality,” NBER Macroeconomics Annual , 32, 1–75.
4Aiyagari (1994) Aiyagari, S. R. (1994): “Uninsured Idiosyncratic Risk and Aggregate Saving,” Quarterly Journal of Economics , 109, 659–684.
5Aliprantis and Border (2006) Aliprantis, C. D. and K. C. Border (2006): Infinite Dimensional Analysis: A Hitchhiker’s Guide , Springer.
6Beare and Toda (2017) Beare, B. K. and A. A. Toda (2017): “Geometrically Stopped Markovian Random Growth Processes and Pareto Tails,” Tech. rep., UC San Diego.
7Benhabib and Bisin (2018) Benhabib, J. and A. Bisin (2018): “Skewed Wealth Distributions: Theory and Empirics,” Journal of Economic Literature , 56, 1261–1291.
8Benhabib et al. (2017) Benhabib, J., A. Bisin, and M. Luo (2017): “Earnings Inequality and Other Determinants of Wealth Inequality,” American Economic Review: Papers and Proceedings , 107, 593–597.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Abstract.

1. Introduction

2. The Income Fluctuation Problem and Optimality Results

2.1. Problem Statement

2.2. Key Conditions

Assumption 2.1**.**

Proposition 2.1** (Necessity of the discount condition).**

Assumption 2.2**.**

Assumption 2.3**.**

Example 2.1**.**

2.3. Optimality: Definitions and Fundamental Properties

Theorem 2.1** (Sufficiency of first order and transversality conditions).**

2.4. Existence and Computability of Optimal Consumption

Proposition 2.2** (Sufficiency of first order condition).**

Theorem 2.2** (Existence, uniqueness and computability of optimal policies).**

2.5. Properties of Optimal Consumption

Proposition 2.3** (Monotonicity with respect to wealth).**

Proposition 2.4** (Monotonicity with respect to income).**

Proposition 2.5** (Concavity and asymptotic linearity of consumption function).**

Remark 2.1**.**

Proposition 2.6** (Linear lower bound on consumption).**

Example 2.2**.**

3. Stationarity, Ergodicity, and Tail Behavior

3.1. Stationarity

Assumption 3.1**.**

Example 3.1**.**

Theorem 3.1** (Existence of a stationary distribution).**

3.2. Ergodicity

Assumption 3.2**.**

Theorem 3.2** (Uniqueness, stability, ergodicity and mixing).**

3.3. Tail Behavior

Assumption 3.3**.**

Remark 3.1**.**

Theorem 3.3** (Tail behavior).**

Remark 3.2**.**

Remark 3.3**.**

Example 3.2**.**

4. Testing the Growth Conditions

Lemma 4.1** (Long-run growth rates and spectral radii).**

5. Application: Stochastic Volatility and Mean Persistence

6. Conclusion

Appendix A Preliminaries

Proof of Lemma 4.1.

Lemma A.1**.**

Proof.

Appendix B Proof of Section 2 Results

Proof of Proposition 2.1.

Lemma B.1**.**

Proof.

Lemma B.2**.**

Proof.

Proposition B.1**.**

Proof.

Proof of Thoerem 2.1.

Proposition B.2**.**

Proof.

Proof of Proposition 2.2.

Proposition B.3**.**

Proof.

Proposition B.4**.**

Proof.

Lemma B.3**.**

Proof.

Lemma B.4**.**

Proof.

Lemma B.5**.**

Proof.

Proof of Theorem 2.2.

Lemma B.6**.**

Proof.

Proof of Proposition 2.3.

Proof of Proposition 2.4.

Lemma B.7**.**

Assumption 2.1.

Proposition 2.1 (Necessity of the discount condition).

Assumption 2.2.

Assumption 2.3.

Example 2.1.

Theorem 2.1 (Sufficiency of first order and transversality conditions).

Proposition 2.2 (Sufficiency of first order condition).

Theorem 2.2 (Existence, uniqueness and computability of optimal policies).

Proposition 2.3 (Monotonicity with respect to wealth).

Proposition 2.4 (Monotonicity with respect to income).

Proposition 2.5 (Concavity and asymptotic linearity of consumption function).

Remark 2.1.

Proposition 2.6 (Linear lower bound on consumption).

Example 2.2.

Assumption 3.1.

Example 3.1.

Theorem 3.1 (Existence of a stationary distribution).

Assumption 3.2.

Theorem 3.2 (Uniqueness, stability, ergodicity and mixing).

Assumption 3.3.

Remark 3.1.

Theorem 3.3 (Tail behavior).

Remark 3.2.

Remark 3.3.

Example 3.2.

Lemma 4.1 (Long-run growth rates and spectral radii).

Lemma A.1.

Lemma B.1.

Lemma B.2.

Proposition B.1.

Proposition B.2.

Proposition B.3.

Proposition B.4.

Lemma B.3.

Lemma B.4.

Lemma B.5.

Lemma B.6.

Lemma B.7.

Lemma B.8.

Lemma B.9.

Lemma C.1.

Lemma C.2.

Lemma C.3.

Lemma C.4.

Lemma C.5.

Lemma C.6.

Lemma C.7.

Lemma C.8.

Lemma C.9.