Inferential results for a new measure of inequality

Youri Davydov; Francesca Greselin

arXiv:1706.05576·math.ST·June 20, 2017

Inferential results for a new measure of inequality

Youri Davydov, Francesca Greselin

PDF

TL;DR

This paper introduces a new inequality index tailored to detect significant changes in both tails of income distributions, providing estimators with proven asymptotic properties and applying them to real Italian income data.

Contribution

It proposes a novel inequality measure, develops two estimators, and establishes their asymptotic properties, including consistency and normality, with application to real income data.

Findings

01

Estimators are asymptotically equivalent.

02

The proposed estimator is consistent and asymptotically normal.

03

Application to Italian income data demonstrates practical relevance.

Abstract

In this paper we derive inferential results for a new index of inequality, specifically defined for capturing significant changes observed both in the left and in the right tail of the income distributions. The latter shifts are an apparent fact for many countries like US, Germany, UK, and France in the last decades, and are a concern for many policy makers. We propose two empirical estimators for the index, and show that they are asymptotically equivalent. Afterwards, we adopt one estimator and prove its consistency and asymptotic normality. Finally we introduce an empirical estimator for its variance and provide conditions to show its convergence to the finite theoretical value. An analysis of real data on net income from the Bank of Italy Survey of Income and Wealth is also presented, on the base of the obtained inferential results.

Figures8

Click any figure to enlarge with its caption.

Equations248

μ_{F} = \int_{0}^{\infty} x F (d x) = \int_{0}^{1} F^{- 1} (p) d p .

μ_{F} = \int_{0}^{\infty} x F (d x) = \int_{0}^{1} F^{- 1} (p) d p .

l_{F} (p) = \frac{1}{μ _{F}} \int_{0}^{p} F^{- 1} (t) d t .

l_{F} (p) = \frac{1}{μ _{F}} \int_{0}^{p} F^{- 1} (t) d t .

D_{F} (p) = \frac{m _{F} ( p ) - l _{F} ( p )}{m _{F} ( p )} .

D_{F} (p) = \frac{m _{F} ( p ) - l _{F} ( p )}{m _{F} ( p )} .

D_{F} = \int_{0}^{1} D_{F} (p) d p .

D_{F} = \int_{0}^{1} D_{F} (p) d p .

l_{n} (p) = \frac{1}{\vbox \hrule height=0.5pt X _{n}} \int_{0}^{p} F_{n}^{- 1} (s) d s, and its dual m_{n} (p) = \frac{1}{\vbox \hrule height=0.5pt X _{n}} \int_{1 - p}^{1} F_{n}^{- 1} (s) d s,

l_{n} (p) = \frac{1}{\vbox \hrule height=0.5pt X _{n}} \int_{0}^{p} F_{n}^{- 1} (s) d s, and its dual m_{n} (p) = \frac{1}{\vbox \hrule height=0.5pt X _{n}} \int_{1 - p}^{1} F_{n}^{- 1} (s) d s,

D_{n}

D_{n}

:= 1 - \int_{0}^{1} G_{n} (p) d p

D_{n}

D_{n}

:= 1 - \frac{1}{n} i = 1 \sum n G_{n} (i / n)

D_{n}

D_{n}

n (D_{n} - D) = \frac{1}{n} i = 1 \sum n h (X_{i}) + o_{P} (1),

n (D_{n} - D) = \frac{1}{n} i = 1 \sum n h (X_{i}) + o_{P} (1),

\displaystyle h(X_{i})=\int_{0}^{+\infty}\left[\mathds{1}_{[0,x]}\left(X_{i}\right)-F(x)\right]\omega\big{(}F(x)\big{)}dx

\displaystyle h(X_{i})=\int_{0}^{+\infty}\left[\mathds{1}_{[0,x]}\left(X_{i}\right)-F(x)\right]\omega\big{(}F(x)\big{)}dx

ω_{1} (t) = \int_{t}^{1} \frac{1}{M ( s )} d s and ω_{2} (t) = \int_{0}^{t} \frac{L ( 1 - s )}{[ M ( 1 - s ) ] ^{2}} d s .

ω_{1} (t) = \int_{t}^{1} \frac{1}{M ( s )} d s and ω_{2} (t) = \int_{0}^{t} \frac{L ( 1 - s )}{[ M ( 1 - s ) ] ^{2}} d s .

n (D_{n} - D) n \to \infty ⟹ N (0, σ_{F}^{2}),

n (D_{n} - D) n \to \infty ⟹ N (0, σ_{F}^{2}),

\sigma_{F}^{2}=\int_{0}^{\infty}\left[\int_{0}^{y}F(x)\,\omega\big{(}F(x)\big{)}\,dx\right]\big{(}1-F(y)\big{)}\,\omega\big{(}F(y)\big{)}\,dy.

\sigma_{F}^{2}=\int_{0}^{\infty}\left[\int_{0}^{y}F(x)\,\omega\big{(}F(x)\big{)}\,dx\right]\big{(}1-F(y)\big{)}\,\omega\big{(}F(y)\big{)}\,dy.

n (D_{n} - D)

n (D_{n} - D)

= - n \int_{0}^{1} \frac{L _{n} ( t ) - L ( t )}{M ( t )} d t + n \int_{0}^{1} \frac{L ( t ) [ M _{n} ( t ) - M ( t ) ]}{[ M ( t ) ] ^{2}} d t + R_{n} (t)

R_{n}^{(1)}

R_{n}^{(1)}

R_{n}^{(2)}

\displaystyle V_{n}(p)=\int_{0}^{p}\big{(}F_{n}^{-1}(t)-F^{-1}(t)\big{)}dt+\int_{0}^{F^{-1}(p)}\big{(}F_{n}(x)-F(x)\big{)}dx

\displaystyle V_{n}(p)=\int_{0}^{p}\big{(}F_{n}^{-1}(t)-F^{-1}(t)\big{)}dt+\int_{0}^{F^{-1}(p)}\big{(}F_{n}(x)-F(x)\big{)}dx

\displaystyle V^{*}_{n}(p)=\int_{p}^{1}\big{(}F_{n}^{-1}(t)-F^{-1}(t)\big{)}dt+\int_{F^{-1}(p)}^{\infty}\big{(}F_{n}(x)-F(x)\big{)}dx

\displaystyle V^{*}_{n}(p)=\int_{p}^{1}\big{(}F_{n}^{-1}(t)-F^{-1}(t)\big{)}dt+\int_{F^{-1}(p)}^{\infty}\big{(}F_{n}(x)-F(x)\big{)}dx

n V_{n} (p) \leq ∣ e_{n} (p) ∣∣ F_{n}^{- 1} (p) - F^{- 1} (p) ∣,

n V_{n} (p) \leq ∣ e_{n} (p) ∣∣ F_{n}^{- 1} (p) - F^{- 1} (p) ∣,

-\sqrt{n}\displaystyle{\int_{0}^{1}\frac{L_{n}(t)-L(t)}{M(t)}\,\,dt=\sqrt{n}\int_{0}^{1}\frac{1}{M(t)}\left[\int_{0}^{F^{-1}(t)}\big{(}F_{n}(x)-F(x)\big{)}dx\right]dt+\mathcal{O}(R_{n}^{(3)})}

-\sqrt{n}\displaystyle{\int_{0}^{1}\frac{L_{n}(t)-L(t)}{M(t)}\,\,dt=\sqrt{n}\int_{0}^{1}\frac{1}{M(t)}\left[\int_{0}^{F^{-1}(t)}\big{(}F_{n}(x)-F(x)\big{)}dx\right]dt+\mathcal{O}(R_{n}^{(3)})}

R_{n}^{(3)}

R_{n}^{(3)}

\leq \int_{0}^{1} \frac{1}{M ( t )} ∣ e_{n} (t) ∣ F_{n}^{- 1} (t) - F^{- 1} (t) d t .

n \int_{0}^{1}

n \int_{0}^{1}

\displaystyle=-\sqrt{n}\int_{0}^{1}\frac{L(t)}{\left[M(t)\right]^{2}}\left[\int_{F^{-1}(1-t)}^{+\infty}\big{(}F_{n}(x)-F(x)\big{)}dx\right]dt+\mathcal{O}(R_{n}^{4}),

R_{n}^{(4)}

R_{n}^{(4)}

\leq \int_{0}^{1} ∣ e_{n} (t) ∣ F_{n}^{- 1} (t) - F^{- 1} (t) \frac{L ( t )}{[ M ( t ) ] ^{2}} d t .

n (D_{n} - D)

n (D_{n} - D)

\displaystyle\displaystyle{-\sqrt{n}\int_{0}^{1}\frac{L(t)}{\left[M(t)\right]^{2}}\left[\int_{F^{-1}(1-t)}^{+\infty}\big{(}F_{n}(x)-F(x)\big{)}dx\right]dt+o_{\textbf{P}}(1).}

n \int_{0}^{1} \frac{1}{M ( t )}

n \int_{0}^{1} \frac{1}{M ( t )}

\displaystyle=\displaystyle{\sqrt{n}\int_{0}^{1}\left(\int_{0}^{+\infty}\big{(}F_{n}(x)-F(x)\big{)}\mathds{1}_{[0,F^{-1}(t)]}(x)dx\right)\frac{1}{M(t)}dt}

\displaystyle=\displaystyle{\sqrt{n}\int_{0}^{+\infty}\left(\int_{0}^{1}\big{(}F_{n}(x)-F(x)\big{)}\mathds{1}_{[F(x),1]}(t)\frac{1}{M(t)}dt\right)dx}

\displaystyle=\displaystyle{\sqrt{n}\int_{0}^{+\infty}\big{(}F_{n}(x)-F(x)\big{)}\left(\int_{F(x)}^{1}\frac{1}{M(t)}dt\right)dx}

= \frac{1}{n} i = 1 \sum n h_{1} (X_{i})

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Inferential results

for a new measure of inequality

Youri Davydov ${}^{\textrm{a}}$ , Francesca Greselin ${}^{\textrm{b}}$

${}^{\textrm{a}}$ *Laboratoire Paul Painlevé, Université des Sciences et Technologies de Lille (Lille 1), Lille, France

and Saint Petersburg State University, Saint Petersburg, Russia*

${}^{\textrm{b}}$ Dipartimento di Statistica e Metodi Quantitativi, Università di Milano Bicocca, Milan, Italy

Abstract. In this paper we derive inferential results for a new index of inequality, specifically defined for capturing significant changes observed both in the left and in the right tail of the income distributions. The latter shifts are an apparent fact for many countries like US, Germany, UK, and France in the last decades, and are a concern for many policy makers. We propose two empirical estimators for the index, and show that they are asymptotically equivalent. Afterwards, we adopt one estimator and prove its consistency and asymptotic normality. Finally we introduce an empirical estimator for its variance and provide conditions to show its convergence to the finite theoretical value. An analysis of real data on net income from the Bank of Italy Survey of Income and Wealth is also presented, on the base of the obtained inferential results.

Keywords and phrases: Income inequality, Lorenz curve, Gini Index, consistency, asymptotic normality, economic inequality, confidence interval, nonparametric estimator.

1 Introduction

In view of measuring economic inequality in a society, suppose that we are interested, for instance, in incomes. Let $X$ be an ’income’ random variable with non negatively supported cdf $F(x)$ .

Next, define $Q(p)=F^{-1}(p)=\inf\{x\,:\,F(x)\geq p,\;\;p\in[0,1]\}$ as the $p$ -th quantile of $X$ , and suppose that $X$ possesses a finite mean

[TABLE]

The Lorenz curve, introduced by Lorenz (1905), is an irreplaceable tool in this domain. It is defined by

[TABLE]

The curve $l_{F}(p)$ expresses the share of income possessed by the $p$ % poorer part of population. It has been expressed firstly by Pietra (1915, with English translation now available as Pietra, 2014), and mathematically formulated as in (1.1) by Gastwirth (1971).

In the following we will also employ $m_{F}(p)=1-l_{F}(1-p),$ which provides the share of income owned by the richer $p$ % of the population. Obviously, $m_{F}(p)$ is the curve obtained by applying a central symmetry to $l_{F}(p)$ , with respect to the center of the unit square, as shown in Figure 1.1 and allows us also to rephrase the Gini into $G_{F}=\int_{0}^{1}(m_{F}(p)-l_{F}(p))dp$ .

We recall that the Gini can be rephrased as the weighted average of all comparisons made among the mean income of the poorest and the overall mean (Greselin et al. 2012, Greselin 2014). When dealing with skewed distributions, as it is the case for many economic size distributions, the median should be preferred to the mean, in such a way that Gastwirth (2014) proposed to modify the Gini accordingly.

Very recently, motivated by the observed shifts toward the extreme values in income distributions, a new focus is introduced in Gastwirth (2016), almost contemporarily to Davydov and Greselin (2016). Policy makers are nowadays interested in understanding what happens in the more critical portions of the population, as significant changes have been observed both in the left and in the right tail of the income distributions in countries like US, Germany, UK, France in the last decades. Notice that the classical Lorenz curve provides useful pointwise information with reference to poorest people, while on the other hand, as $L(p)$ approaches 1 as $p$ approaches 1, it does not display the variation within the upper end (f.i., top 5% or 1%) of the distribution clearly. The novel approach is to consider equally sized and opposite groups of population, and compare their mean income. Aiming at contrasting the economic position of the group of the poorer $p$ % to the one of the $p$ % of the richest, the following inequality curve has been introduced

[TABLE]

In the case of perfect equality, each fraction $p$ of population has same mean income, hence $D_{F}(p)=0$ for all $p\in[0,1]$ . While the income distribution moves toward more variability, the mean income $\frac{\mu_{F}m_{F}(p)}{p}$ of the $p\%$ of richest people will be moving far from the mean income $\frac{\mu_{F}l_{F}(p)}{p}$ of the $p\%$ of poorest part of the population, and $D_{F}(p)$ raises toward $1$ . Hence, we can represent the pointwise measure of inequality in the population by plotting $D_{F}(p)$ .

Naturally, we can summarize all the information given by the curve $D_{F}(p)$ in a single measure of inequality $D_{F}$ , by taking the expected value

[TABLE]

Notice that $D_{F}$ is the area between the observed inequality curve $D_{F}(p)$ and the curve of perfect equality, which is the horizontal line passing through the origin of the axes.

The structure of the paper is as follows. Section 2 introduces two estimators for the new inequality measure, and provide reasons for selecting them in view of their main purpose. The third section, which is the core of the paper, states the main inferential results, in more detail we will show the consistency of the estimators, state their asymptotical distribution, and the asymptotic negligibility of their difference. We also introduce an empirical estimator for the variance, and establish its convergence to the finite variance of the estimator. Some lemmas useful for the inferential theory have been presented in Section 4, along with their proof. Section 5 shows how the inferential results can be employed to develop an analysis on real income data. Final considerations are given in Section 6.

2 Estimators

Economic data on the entire (or complete) population is rarely available, so most studies are based on data obtained from well-designed sample surveys. Hence we usually have to estimate summary measures from samples. We introduce here two empirical estimators, say $\widehat{D}_{n}$ and $\widetilde{D}_{n}$ for estimating $D_{F}$ . The first one is derived, in a very natural way, by replacing the population cdf $F(x)$ and mean value $\mu_{F}$ in (1.3) by their empirical counterparts $F_{n}(x)=\frac{1}{n}\sum_{i=1}^{n}\mathds{1}_{[0,x]}\big{(}X_{i}\big{)}$ , and $\hbox{\vbox{\hrule height=0.5pt\kern 2.15277pt\hbox{\kern-1.00006pt$ X $}}}_{n}=\frac{1}{n}\sum_{i=1}^{n}X_{i}$ , and then considering the empirical Lorenz curve, say

[TABLE]

as follows

[TABLE]

where we set $G_{n}(p)={l_{n}(p)}/{m_{n}(p)}$ .

Then, the second estimator is defined in terms of the order statistics $X_{1:n}\leq\cdots\leq X_{n:n}$ of the i.i.d. sample $X_{1},X_{2},...,X_{n}$ drawn from $X$ , therefore we define

[TABLE]

where $G_{n}(i/n)$ expresses the ratio between the mean income of the poorest $i$ and of the richest $i$ elements in the sample.

We will show later, in Theorem 3.25, that the two estimators $\widehat{D}_{n}$ and $\widetilde{D}_{n}$ are asymptotically equivalent. While the estimator $\widehat{D}_{n}$ is suitable for developing inferential results, $\widetilde{D}_{n}$ is much simpler when it comes to implement code for the analysis of real data.

3 Inferential results

In this Section we will present our main results, starting from the consistency of the estimator $\widehat{D}_{n}$ , next we state its asymptotic normal distribution, and then we deal with its variance estimation. Finally, we will show the asymptotic equivalence of the two estimators $\widetilde{D}_{n}$ and $\widehat{D}_{n}$ .

Unless explicitly stated otherwise, we assume throughout that the cdf $F(x)$ of $X$ is a continuous function. This is a natural choice when modeling income or wealth distributions, and for many other economic size distributions.

3.1 Consistency of $\widehat{D}_{n}$

Theorem 3.1.

$\widehat{D}_{n}$ * is a consistent estimator for $D_{F}$ .*

Proof.

From the normalized definition of the empirical Lorenz curve and its dual, say $l_{n}(p)$ and $m_{n}(p)$ , it is useful here to introduce their absolute versions, given by $L_{n}(p)=\int_{0}^{p}F_{n}^{-1}(s)ds$ and $M_{n}(p)=\int_{1-p}^{1}F_{n}^{-1}(s)ds$ . We may rephrase $\widehat{D}_{n}$ as

[TABLE]

For all $p\in[0,1]$ , we have that $L_{n}(p)$ converges, with probability 1, uniformly to $L(p)=\int_{0}^{p}F^{-1}(s)ds$ (see Goldie, 1977). With the same approach, we have that $M_{n}(p)$ converges, with probability 1, uniformly to $M(p)=\int_{1-p}^{1}F^{-1}(s)ds$ . As $L(p)\leq M(p)$ , due to the Lebesgue dominated convergence theorem we get the thesis. ∎

3.2 Asympthotic normality of the estimator $\widehat{D}_{n}$

Theorem 3.2.

If the moment $\mathbb{E}|X|^{2+\delta}$ is finite for some $\delta>0$ , then we have the asymptotic representation

[TABLE]

where $o_{\textbf{P}}(1)$ denotes a random variable that converges to 0 in probability when $n\to\infty$ , and

[TABLE]

with the weight function $\omega(t)=\omega_{1}(t)+\omega_{2}(t)$ , where

[TABLE]

Corollary 3.3.

Under the conditions of Theorem 3.2, we have that $\widehat{D}_{n}$ is asymptotically normally distributed, that is

[TABLE]

where

[TABLE]

The proof follows immediately from (3.2) by applying the Central Limit Theorem of P. Lévy.

*Proof of Theorem 3.2

*From the definition of $\widehat{D}_{n}$ and $D$ , we get

[TABLE]

where the remainder term is given by $R_{n}(t)=R_{n}^{(1)}+R_{n}^{(2)}$ and

[TABLE]

We will later show (Lemma 4.1 and 4.2, respectively) that $R_{n}^{(1)}$ and $R_{n}^{(2)}$ are of order $o_{\textbf{P}}(1).$ The proof follows the approach of Greselin, Pasquazzi and Zitikis (2010), to state the asymptotic normality for the Zenga inequality index (Zenga, 2007). Hence we now proceed our analysis of the first two terms in (3.4), by using the Vervaat process

[TABLE]

and its dual,

[TABLE]

for which we know that $V^{*}_{n}(p)=-V_{n}(p)$ . For mathematical and historical details on the Vervaat process, see Zitikis (1998), Davydov and Zitikis (2004), and Greselin et al. (2009). Now, denoting the uniform on $[0,1]$ empirical process by $e_{n}(p)=\sqrt{n}(F_{n}(F^{-1}(p))-p)$ and using one property of the Vervaat process, namely

[TABLE]

we find a bound for the first term in (3.4) as follows

[TABLE]

where

[TABLE]

We will later show (Lemma 4.3) that $R_{n}^{(3)}=o_{\textbf{P}}(1).$

Now we deal with the second term in (3.4), and we obtain, using similar arguments as before

[TABLE]

where

[TABLE]

In Lemma 4.4 we show that $R_{n}^{(4)}=o_{\textbf{P}}(1),$ therefore we have

[TABLE]

We notice that the first term in the right hand side of equation (3.10) could be rewritten as

[TABLE]

where

[TABLE]

and

[TABLE]

For the second term in the right hand side of equation (3.10), using the change of variable $t=1-s$ , we obtain:

[TABLE]

where

[TABLE]

and

[TABLE]

This completes the proof of Theorem 3.2. ∎

3.3 Convergence of the empirical variance

We deal here with the theoretical variance $Var\big{(}h(X_{1})\big{)}$ , that is

[TABLE]

and its empirical counterpart

[TABLE]

Let $x_{0}\geq 0$ be the minimum value in the support of $F(x)$ , i.e. the value such that $F(y)=0$ for $y<x_{0}$ and $F(y)>0$ if $y>x_{0}$ . Analogously, let $T_{0}$ be maximum value in the support of $F(x)$ , i.e. such that $F(x)<1$ $\forall x<T_{0}$ and $F(T_{0})=1$ . Notice that we may have $T_{0}=+\infty$ . Then we have

•

$F_{n}(x)=0\quad\forall x<x_{0}$ because $X\geq x_{0}$ a.s.,

•

$\sigma_{F}^{2}=\int_{x_{0}}^{\infty}\left[\ldots\right]\ldots dy$ ,

•

$\sigma_{n}^{2}=\int_{x_{0}}^{\infty}\left[\ldots\right]\ldots dy$ .

Therefore, without loss of generality, we can take $x_{0}=0$ .

Theorem 3.4.

Assume that $\mathbb{E}|X|^{2+\delta}<+\infty$ for some $\delta>0$ . Then, we have a.s.

[TABLE]

Proof.

The proof is composed by three steps.

Step 1:

For all $\epsilon,T$ such that $0<\epsilon<T<T_{0}$ , we will show that, with probability 1, for almost all $y\in[\epsilon,T]$ , and with $\omega(t)=\omega_{1}(t)+\omega_{2}(t)$ given by (3.3) we have

[TABLE]

We begin by the study of the first part of (3.14) related to $\omega_{1}(t)$ , i.e.

[TABLE]

We know that

•

as $F_{n}(x)\to F(x)$ a.s. (uniformly), we have the convergence $\mathds{1}_{[F_{n}(x),1]}(s)\to\mathds{1}_{[F(x),1]}(s)$ with probability 1 for almost all $s\in[0,1]$ and $x\in R^{+}$ ,

•

$\forall s\in[0,1]$ and with probability 1, we have that

[TABLE]

•

$M_{n}(s)\geq s\,\hbox{\vbox{\hrule height=0.5pt\kern 2.15277pt\hbox{\kern-1.00006pt$ X $}}}_{n}\quad\forall s\in[0,1]$ .

As $\hbox{\vbox{\hrule height=0.5pt\kern 2.15277pt\hbox{\kern-1.00006pt$ X $}}}_{n}\to\mu_{F}$ a.s., with probability 1 there exists a constant $C>0$ such that

[TABLE]

Hence, Lebesgue theorem gives (3.14), with $\omega$ replaced by $\omega_{1}$ .

Now we consider the second part of (3.14), where $\omega_{2}(t)$ takes the place of $\omega$ :

[TABLE]

and observe that

•

$\mathds{1}_{[0,F_{n}(x)]}(s)\to\mathds{1}_{[0,F_{(}x)]}(s)$ with probability 1 for almost all $s\in[0,1]$ and $x\in R^{+}$ ,

•

$\forall s\in[0,1]$ , with probability 1, we have that $L_{n}(1-s)\to L(1-s)$ , and $M_{n}(1-s)\to M(1-s)$ ,

•

$L_{n}(1-s)\leq M(1-s)\quad\forall s\in[0,1]$ .

Therefore

[TABLE]

Once more using Lebesgue theorem we get $\forall y\leq T$

[TABLE]

which completes the proof of (3.14).

Step 2:

For all $\epsilon,T$ such that $0<\epsilon<T<T_{0}$ and given

[TABLE]

where

[TABLE]

we will show that, with probability 1,

[TABLE]

Due to the previous step, for every $y$ we know that $\Psi_{n}(y)$ converges, with probability 1, to

[TABLE]

We have shown that, with probability 1,

[TABLE]

and using

[TABLE]

it follows that we have a.s.

[TABLE]

Hence

[TABLE]

Observing now that the function at the right hand side in (3.16) is integrable on $[\epsilon,T]$ , we can apply Lebesgue’s dominated convergence theorem and prove (3.15).

Step 3:

To complete the proof of Theorem 3.13 we need to obtain a bound for the integrals $\sigma_{n}^{2}[0,\epsilon]$ and $\sigma_{n}^{2}[T,\infty]$ . We use the following more delicate estimation of $\omega(t)$ , for all $\gamma>0$ : there exists a positive constant $C_{\gamma}$ such that

[TABLE]

Indeed, as $L(t)\leq t\mu_{F}\leq M(t)$ , for $t\in[0,1]$ , we have

[TABLE]

and

[TABLE]

Hence, for every $\gamma>0$ , there exists a constant $0<C_{\gamma}<+\infty$ , depending only on $\gamma$ , such that for all $t\in(0,1)$

[TABLE]

which jointly give the estimation (3.17).

For $\gamma\leq 1/2$ , we have

[TABLE]

We have similarly

[TABLE]

Let us introduce a new probability space $(\widetilde{\Omega},\widetilde{\mathcal{F}},\widetilde{\mathbb{P}})$ and a new random variable $Y$ , taking the values $X_{i}$ , for $i=1,\ldots,n$ , such that $\widetilde{\mathbb{P}}\big{(}Y=X_{i}\big{)}=\frac{1}{n}$ . Then

[TABLE]

If $p\leq\delta$ then, due to the strong law of large numbers,

[TABLE]

hence, with probability 1, we have

[TABLE]

Observing now that, for $\gamma<\delta/[2(2+\delta)]$ , the integral $\int_{T}^{\infty}y^{\left(1-(1-2\gamma)(2+\delta)\right)}dy$ converges, and defining $\beta$ such that $\big{(}1-(1-2\gamma)(2+\delta)\big{)}=-(1+\beta)$ , then (3.20) takes the form

[TABLE]

Evidently, replacing $F_{n}(x)$ by $F(x)$ in (3.18) and (3.21), we obtain their theoretical counterparts

[TABLE]

and

[TABLE]

Now, collecting the bounds (3.18) and (3.22), the convergence stated in (3.15), and finally bounds (3.21) and (3.23) from the three steps

[TABLE]

Taking $\epsilon\to 0$ and $T\to\infty$ in (3.24), we arrive at (3.13). ∎

Having established the consistency and asymptotic normality for the estimator $\widehat{D}_{n}$ , we would like to prove similar properties for the second estimator $\widetilde{D}_{n}$ defined in (2.2). To do this, we will focus on their difference $\Delta_{n}:=\widehat{D}_{n}-\widetilde{D}_{n}$ and prove its asymptotic negligibility.

Theorem 3.5.

If the moment $\mathbb{E}|X|^{2+\delta}$ is finite for some $\delta>0$ , then we have

[TABLE]

Before proving Theorem 3.25, it is worth to state two useful Corollaries.

Corollary 3.6.

If the moment $\mathbb{E}|X|^{2+\delta}$ is finite for some $\delta>0$ , then we have

[TABLE]

where $\sigma_{F}^{2}=Var\big{(}h(X_{1})\big{)}$ is the theoretical variance.

Corollary 3.7.

*Under the same assumptions, we have also *

[TABLE]

where $\sigma_{n}^{2}$ is the empirical counterpart for $\sigma_{F}^{2}$ given by (3.12).

The same is true if we replace $\widetilde{D}_{n}$ by $\widehat{D}_{n}$ .

Proof.

(of Theorem 3.25). Let $\epsilon:=\epsilon_{n}=\frac{m_{n}}{n}\sim n^{-\alpha-1/2}$ , where $0<\alpha<\frac{\delta}{2(2+\delta)}$ . We have

[TABLE]

where $\omega_{G_{n}}^{[\epsilon,1]}$ is the modulus of continuity of $G_{n}$ on the interval $[\epsilon,1]$ given by

[TABLE]

Let $t\in\left[(i-1)/n,i/n\right]$ , then

[TABLE]

where we used $M_{n}(s)=1-L_{n}(1-s)$ and the inequalities $L_{n}(s)\leq M_{n}(s)$ , $s\,\,\hbox{\vbox{\hrule height=0.5pt\kern 2.15277pt\hbox{\kern-1.00006pt$ X $}}}_{n}\leq M_{n}(s)$ that hold true $\forall s\in[0,1]$ . From the bounds in (3.28) and (3.29) we get

[TABLE]

As for $t\in[(i-1)/n,i/n]$

[TABLE]

we get

[TABLE]

Therefore, due to (3.30), we obtain

[TABLE]

As $2/n^{\alpha}\rightarrow 0$ and $\hbox{\vbox{\hrule height=0.5pt\kern 2.15277pt\hbox{\kern-1.00006pt$ X $}}}_{n}\rightarrow\mu_{F}$ , it is sufficient to state the convergence in probability of $M_{n}/n^{1/2-\alpha}$ to 0. We have, for $t>0:$

[TABLE]

as we have chosen $\alpha<\frac{\delta}{2(2+\delta)}$ . ∎

4 Proofs

Lemma 4.1.

Under the conditions of Theorem 3.2, we have that

[TABLE]

Proof.

We estimate $R_{n}^{(1)}$ by splitting the integral in two parts, by choosing $\rho\in(0,1)$

[TABLE]

We now look for getting a bound for $R_{n}^{(1,1)}$ and initially deal with its first part, given by

[TABLE]

Provided that $t<\rho$ , we have that $M(t)\geq t\,F^{-1}(1-\rho)$ , and

[TABLE]

Now we consider the lefthand term in (4.3), and set

[TABLE]

where $e_{n}(t)=\sqrt{n}\big{(}F_{n}(F^{-1}(t))-t\big{)}$ . We know that (see (9.2) in Greselin et al. 2010)

[TABLE]

Therefore, employing the inequality in (3.9) related to the Vervaat process, and choosing $\epsilon$ such that $0<\epsilon<\frac{1}{2}$ , we obtain

[TABLE]

As $|F_{n}^{-1}(t)-F^{-1}(t)|\leq F_{n}^{-1}(\rho)+F^{-1}(\rho)\,{\buildrel a.s.\over{\longrightarrow}}\,2\,F^{-1}(\rho)$ , by Lebesgue dominated convergence theorem the integral

[TABLE]

Hence

[TABLE]

The righthand term in (4.3) may be estimated as follows

[TABLE]

for all $0<\epsilon<1/2$ , where we set $\phi(\rho)=\displaystyle{\int_{0}^{\rho}\ t^{-1/2-\epsilon}dt}$ , and $C_{1}=\int_{0}^{+\infty}\big{(}1-F(x)\big{)}^{1/2-\epsilon}dx<+\infty$ . The latter quantity is finite, due to the existence of the $2+\delta$ moment of $X$ .

To complete the analysis of $R_{n}^{(1,1)}$ we have to deal now with its second part, given by

[TABLE]

As $M_{n}(t)\geq tF_{n}^{-1}(1-\rho)$ for $t\in[0,1]$ and $F_{n}^{-1}(1-\rho)\,{\buildrel a.s.\over{\longrightarrow}}\,F^{-1}(1-\rho)$ , the bound for (4.7) can be found by following the same steps as for (4.2).

We continue our proof now by finding a bound for the second term $R_{n}^{(1,2)}$ in (4.1). As $\rho\leq t\leq 1$ , then

[TABLE]

and

[TABLE]

Therefore, setting $H_{n}:=\displaystyle{\sup_{s\in[1-\rho,1]}\,\,{\frac{1}{M_{n}(s)\,M(s)}}}=\mathcal{O}_{\textbf{P}}(1)$ , we have

[TABLE]

We observe that (4.8) is $o_{\textbf{P}}(1)$ if the following two equalities hold true:

[TABLE]

and

[TABLE]

by recalling that

[TABLE]

due to

[TABLE]

To get (4.9), remark that $\sqrt{n}\,\,\big{|}\hbox{\vbox{\hrule height=0.5pt\kern 2.15277pt\hbox{\kern-1.00006pt$ X $}}}_{n}-\mu_{F}\big{|}=O_{\textbf{P}}(1)$ , and

[TABLE]

Finally, to get (4.10), we begin with the inequality

[TABLE]

and use the following bound for the latter integrand

[TABLE]

Recalling that

[TABLE]

and exploiting (4.4) we get

[TABLE]

Integrating in $dt$ on $[\rho,1]$ we hence obtain the desired bound.

From the previous estimates it follows that $\forall\rho:\,\,0<\rho<1$ we have

[TABLE]

where

[TABLE]

Fixing $\epsilon>0$ , let $C>0$ be such that $\mathbb{P}\left\{|K_{n}|>C\right\}<\epsilon$ $\,\forall n$ , and let $\rho_{\epsilon}>0$ be such that for $\rho<\rho_{\epsilon}$ we have $\phi(\rho)<{\epsilon}/{2C_{1}C}$ . Then, having

[TABLE]

we get, for $\rho<\rho_{\epsilon}$ ,

[TABLE]

which finally gives $R_{n}^{(1)}=o_{\textbf{P}}(1).$ ∎

Lemma 4.2.

*Under the conditions of Theorem 3.2, we have that *

[TABLE]

Proof.

We start from the definition of $R_{n}^{(2)}$ in (3.6) here recalled for convenience

[TABLE]

Observing that $L(t)\leq M(t)$ for $t\in[0,1]$ and using (4.11) to rewrite $\big{(}M_{n}(t)-M(t)\big{)}$ , the proof can be established following the proof of Lemma (4.1) with minor modifications. ∎

Lemma 4.3.

Under the conditions of Theorem 3.2, we have that

[TABLE]

Proof.

We estimate $R_{n}^{(3)}$ by splitting it in two integrals as follows

[TABLE]

Let us consider the first term $R_{n}^{(3,1)}$ and observe that

[TABLE]

where we assume that $F^{-1}(\frac{1}{2})>0$ , otherwise we may replace $F^{-1}(\frac{1}{2})$ by $F^{-1}(a)>0$ , with $a\in(0,1)$ appropriately chosen. Hence, by choosing $\epsilon\leq\frac{1}{2}$ , and recalling that $e_{n}(t)=\sqrt{n}\left(F_{n}\big{(}F^{-1}(t)\big{)}-t\right)$ , we arrive at

[TABLE]

as $K_{n}=\mathcal{O}_{\textbf{P}}(1)$ and $F_{n}^{-1}(t)\,{\buildrel a.s.\over{\longrightarrow}}\,F^{-1}(t)$ for $t\in[0,1]$ .

Now we deal with $R_{n}^{(3,2)}$ , i.e. the second term in (4.12). Observing that $M(t)=\int_{1-t}^{1}F^{-1}(s)ds\geq\int_{1/2}^{1}F^{-1}(s)ds=c>0$ , we obtain

[TABLE]

∎

Lemma 4.4.

Under the conditions of Theorem 3.2, we have that

[TABLE]

Proof.

We will deal with $R_{n}^{(4)}$ , as for the previous Lemma, by splitting it as follows

[TABLE]

and we initially consider $R_{n}^{(4,1)}$ . Observing that $M(t)\geq tF^{-1}(1/2)$ for $t<1/2$ , we have

[TABLE]

Finally, the result on $R_{n}^{(4,2)}$ comes from observing that for $t\in(1/2,1)$ there exists a constant $C$ such that $M(t)\geq C$ , and that $\sup_{t\in(0,1)}e_{n}(1-t)\leq K_{n}$ , we have

[TABLE]

due to the assumption on the second (hence the first) moment finite on $X$ . ∎

5 The new inequality measure on real data

The purpose of this section is to show, through a real data application, the theoretical results obtained in the previous sections. We employ the Bank of Italy Survey on Household Income and Wealth (hereafter named by its acronym, SHIW) dataset, published in 2016. This survey contains information on household post-tax income and wealth in the year 2014, covering 8,156 households, and 19,366 individuals. The sample is representative of the Italian population, which is composed of about 24,7 million households and 60,8 million individuals. The SHIW provides information on each individual’s Personal Income Tax net income, but does not contain the corresponding gross income. We employ an updated version of the microsimulation model described in Morini and Pellegrino (2016) to estimate the latter for each taxpayer. A comparison of the results from the microsimulation model with the official statistics published by the Italian Ministry of Finance (2016) shows that the distribution of gross income and of net tax, according to bands of gross income and type of employment, are close to each other. The empirical analysis we develop here is based on the observed net income from the SHIV, while tax data and gross income arise from the microsimulation model.

To appreciate the asymptotic results of Section 3 on the empirical estimator $\widetilde{D}_{n}$ , we calculate four types of confidence intervals: the normal, the basic, the percentile and the BCa confidence intervals. After drawing the bootstrap samples, the empirical estimator is evaluated at each sample, and Figure 5.1 show the histograms of the obtained values when considering Gross Income in panel (a), Net Income in panel (b) and Taxes in panel (c). While inequality estimators have a skewed distribution in case of low sample size, here the accuracy of the normal approximation is apparent, due to the large sample size. As a further check of the quality of the first order approximation, Figure 5.2 shows the Q-Q plots obtained for the three cases.

Finally, from Figure 5.3 we observe that the four methods for constructing Confidence Intervals have a substantial agreement. They all agree in assuring that there is a statistically significant increase in inequality when passing from Net Income to Gross Income (the redistributive effect of taxation) and from Gross Income to Taxes. We recall that adjusted percentile methods (also named BCa) for calculating confidence limits are inherently more accurate than the basic bootstrap and percentile methods (Davison and Hinkley, 1997).

6 Concluding remarks

Moved from the considerations that nowadays, in many developed countries, the more critical (i.e the extremes) portions of the population are facing great reshaping of their economic situation, a new index for measuring inequality have been proposed in Davydov and Greselin (2016). In the cited paper, a discussion of the properties of the index has been given, to motivate its introduction in the literature and to show its descriptive features. Inferential results for the index were still missing, and this paper is a first contribution to fill the gap. After proposing two empirical estimators, we have shown their asymptotic equivalence. Then, consistency and asymptotic normality for the first estimator have been derived. We also proved the convergence of the empirical estimator for the variance to its finite theoretical value. Finally, we used the new statistical inferential results to analyze data on Net Income from the Bank of Italy Survey on Household Income and Wealth, and to compare them with Gross Income and Taxes.

References

Atkinson A. B. 1970. On the Measurement of Inequality, Journal of Economic Theory, 2, 244–263.

Cobham A., Sumner A. 2013. Putting the Gini back in the bottle? The Palma as a Policy-Relevant Measure of Inequality, Technical report, Kings College London.

Davydov Yu., Zitikis R. 2004. Convex rearrangements of random elements in Asymptotic Methods in Stochastics, vol. 44 of Fields Institute Communications, pp. 141–171, American Mathematical Society, Providence, RI, USA.

Davydov Yu., Greselin F. 2016. Comparisons between poorest and richest to measure inequality, (under revision)

Davison A. C., Hinkley D. V. 1997. Bootstrap methods and their application (Vol. 1). Cambridge University Press.

Gastwirth J.L. 2016. Measures of Economic Inequality Focusing on the Status of the Lower and Middle Income Groups, Statistics and Public Policy, 3:1, 1-9.

Gastwirth J.L. 2014. Median-based measures of inequality: reassessing the increase in income inequality in the U.S. and Sweden, Journal of the International Association for Official Statistics, 30, 311–320.

Gini C. 1914. Sulla misura della concentrazione e della variabilità dei caratteri, Atti del Reale Istituto Veneto di Scienze, Lettere ed Arti, 73, 1203–1248. (English translation in Gini, C. 2005. On the measurement of concentration and variability of characters. Metron, 63, 3–38.)

Greselin F. 2014. More equal and poorer, or richer but more unequal?, Economic Quality Control, 29, 2, 99–117.

Greselin F., Pasquazzi L., Zitikis R. 2010. Zenga’s new index of economic inequality, its estimation, and an analysis of incomes in Italy, Journal of Probability and Statistics, 26 pp., DOI:10.1155/2010/718905.

Greselin F., Pasquazzi L., Zitikis R. 2012. Contrasting the Gini and Zenga indices of economic inequality, Journal of Applied Statistics, 40, 2, 282–297.

Greselin F., Puri M. L., Zitikis R. 2009. L-functions, processes, and statistics in measuring economic inequality and actuarial risks, Statistics and Its Interface, 2, 2, pp. 227–245.

Goldie C. M. 1977. Convergence theorems for empirical Lorenz curves and their inverses

Advances in Applied Probability*, 9, 765–791.

Lorenz M.C. 1905. Methods of measuring the concentration of wealth, Journal of the American Statistical Association, 9, 209–219.

Morini M., Pellegrino S. 2016. Personal income tax reforms: A genetic algorithm approach, European Journal of Operational Research, first online, https://doi-org.proxy.unimib.it/10.1016/j.ejor.2016.07.059 Pietra G. 1915. Delle relazioni tra gli indici di variabilità. Note I. Atti del Reale Istituto Veneto di Scienze, Lettere ed Arti, 74, 775–792

Pietra G. 2014. On the relationships between variability indices. Note I. Metron, 72, 5–16

Zenga M. 2007. Inequality curve and inequality index based on the ratios between lower and upper arithmetic means, Statistica & Applicazioni, 5, 3–27.

Zitikis R. 1998. The Vervaat process, in Szyszkowicz, B. (ed.), Asymptotic Methods in Probability and Statistics, Elsevier Science, Amsterdam, pp. 667-694.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

1 Introduction

2 Estimators

3 Inferential results

3.1 Consistency of D^n\widehat{D}_{n}Dn​

Theorem 3.1**.**

Proof.

3.2 Asympthotic normality of the estimator D^n\widehat{D}_{n}Dn​

Theorem 3.2**.**

Corollary 3.3**.**

3.3 Convergence of the empirical variance

Theorem 3.4**.**

Proof.

Theorem 3.5**.**

Corollary 3.6**.**

Corollary 3.7**.**

Proof.

4 Proofs

Lemma 4.1**.**

Proof.

Lemma 4.2**.**

Proof.

Lemma 4.3**.**

Proof.

Lemma 4.4**.**

Proof.

5 The new inequality measure on real data

6 Concluding remarks

References

3.1 Consistency of $\widehat{D}_{n}$

Theorem 3.1.

3.2 Asympthotic normality of the estimator $\widehat{D}_{n}$

Theorem 3.2.

Corollary 3.3.

Theorem 3.4.

Theorem 3.5.

Corollary 3.6.

Corollary 3.7.

Lemma 4.1.

Lemma 4.2.

Lemma 4.3.

Lemma 4.4.