Unified and robust Lagrange multiplier type tests for cross-sectional   independence in large panel data models

Zhenhong Huang; Zhaoyuan Li; Jianfeng Yao

arXiv:2302.14387·econ.EM·March 1, 2023

Unified and robust Lagrange multiplier type tests for cross-sectional independence in large panel data models

Zhenhong Huang, Zhaoyuan Li, Jianfeng Yao

PDF

Open Access

TL;DR

This paper introduces a unified, robust Lagrange multiplier test for detecting cross-sectional dependence in large panel data models, applicable across various model types and error distributions, with theoretical validation and simulation support.

Contribution

It develops a unified test procedure and a power enhancement version for cross-sectional independence, valid under broad panel data settings and error distributions.

Findings

01

The tests are asymptotically valid under large panel asymptotics.

02

Monte Carlo experiments confirm robustness and power of the proposed tests.

03

The power enhancement technique improves detection capabilities.

Abstract

This paper revisits the Lagrange multiplier type test for the null hypothesis of no cross-sectional dependence in large panel data models. We propose a unified test procedure and its power enhancement version, which show robustness for a wide class of panel model contexts. Specifically, the two procedures are applicable to both heterogeneous and fixed effects panel data models with the presence of weakly exogenous as well as lagged dependent regressors, allowing for a general form of nonnormal error distribution. With the tools from Random Matrix Theory, the asymptotic validity of the test procedures is established under the simultaneous limit scheme where the number of time periods and the number of cross-sectional units go to infinity proportionally. The derived theories are accompanied by detailed Monte Carlo experiments, which confirm the robustness of the two tests and also suggest…

Tables8

Table 1. Table 1: Application scope of each test for cross-sectional independence

	$C D_{P}$	$L M_{a d j}$	$L M_{b c}$	$L M_{R M T}$	$R L M$	$R L M_{P E}$
Heterogeneous coefficients	✓	✓	$?$	✓	✓	✓
Fixed effects panels	*	*	✓	*	✓	✓
Dynamic panels	✓	*	✓	*	✓	✓
Weakly exogenous regressors	*	$?$	$?$	*	✓	✓
Non-normal errors	✓	*	*	*	✓	✓
SIM-L	✓	$?$	✓	✓	✓	✓

Table 2. Table 2: Empirical size of tests in DGP1

$k = 2$
		Chi-squared			Normal			Student-t
	$(T, n)$	(50,25)	(100,50)	(200,100)	(50,25)	(100,50)	(200,100)	(50,25)	(100,50)	(200,100)
$\frac{n}{T} = \frac{1}{2}$	$R L M$	5.15	5.15	5.55	5.55	4.65	5.35	4.9	5.25	5.15
	$R L M_{P E}$	5.4	5.7	5.6	5.2	4.45	5.8	4.8	5	5.55
	$L M_{a d j}$	5.85	5.35	5.85	6	4.7	5.45	5.4	5.45	5.25
	$C D_{P}$	4.75	4.45	4.65	4.85	4.25	5.25	4.5	4.15	5.1
		Chi-squared			Normal			Student-t
	$(T, n)$	(50,50)	(100,100)	(200,200)	(50,50)	(100,100)	(200,200)	(50,50)	(100,100)	(200,200)
$\frac{n}{T} = 1$	$R L M$	5.5	5.15	4.55	5.05	4.5	4.95	5.15	4.95	6
	$R L M_{P E}$	5.5	5.9	4.75	4.45	4.85	4.9	5.05	4.7	5.6
	$L M_{a d j}$	5.7	5.25	4.6	5.1	4.6	5	5.35	4.95	6
	$C D_{P}$	5.4	4.85	5.05	5.25	5.2	5.1	5.2	4.75	5.2
		Chi-squared			Normal			Student-t
	$(T, n)$	(50,100)	(100,200)	(200,400)	(50,100)	(100,200)	(200,400)	(50,100)	(100,200)	(200,400)
$\frac{n}{T} = 2$	$R L M$	5.8	5.3	6.2	5.1	5.45	5	5.4	5.6	5.5
	$R L M_{P E}$	5.5	5	5.55	4.95	5.55	4.65	5.4	5.85	5.35
	$L M_{a d j}$	5.7	5.3	6.2	5.05	5.45	5	5.2	5.5	5.5
	$C D_{P}$	4.05	4.75	5.7	5	4.85	5.4	4.85	5.05	4.9
$k = 4$
		Chi-squared			Normal			Student-t
	$(T, n)$	(50,25)	(100,50)	(200,100)	(50,25)	(100,50)	(200,100)	(50,25)	(100,50)	(200,100)
$\frac{n}{T} = \frac{1}{2}$	$R L M$	5.65	5.05	5.65	5.5	5.25	5.4	5.3	5.4	5.3
	$R L M_{P E}$	5.4	5.3	5.5	5.35	5.2	5	5.4	5.65	5.45
	$L M_{a d j}$	5.75	5.05	5.75	5.6	5.25	5.4	5.4	5.45	5.3
	$C D_{P}$	4.95	5.1	4.55	5.45	5.3	5.1	4.85	5.35	5.2
		Chi-squared			Normal			Student-t
	$(T, n)$	(50,50)	(100,100)	(200,200)	(50,50)	(100,100)	(200,200)	(50,50)	(100,100)	(200,200)
$\frac{n}{T} = 1$	$R L M$	7.3	5	5.25	6.45	6	5.35	6.4	6.25	5.3
	$R L M_{P E}$	7.1	5.35	5.05	5.75	5.7	5.35	6.2	5.5	5.1
	$L M_{a d j}$	6.7	4.55	5.2	5.75	5.6	5.2	5.6	5.7	5.1
	$C D_{P}$	5.55	4.75	5	4.35	4.7	5.25	4.3	4.4	5.35
		Chi-squared			Normal			Student-t
	$(T, n)$	(50,100)	(100,200)	(200,400)	(50,100)	(100,200)	(200,400)	(50,100)	(100,200)	(200,400)
$\frac{n}{T} = 2$	$R L M$	6.75	5.85	6.25	7.65	5.9	5.55	7.55	5.75	4.95
	$R L M_{P E}$	6.25	6.15	5.55	7.4	5.75	5.55	7.25	5.75	4.95
	$L M_{a d j}$	5	5	5.55	5.8	4.75	5.1	5.65	4.9	4.65
	$C D_{P}$	5.9	5.7	4.85	5.15	5.3	5.5	4.45	5.5	4.95

Table 3. Table 3: Empirical power of of tests in DGP1 for dense case

$k = 2, h = 3$
		Chi-squared			Normal			Student-t
	$(T, n)$	(50,25)	(100,50)	(200,100)	(50,25)	(100,50)	(200,100)	(50,25)	(100,50)	(200,100)
$\frac{n}{T} = \frac{1}{2}$	$R L M$	91.45	98.9	99.85	99.55	98.75	100	99.6	99.75	99.8
	$R L M_{P E}$	95.95	99.85	100	99.85	99.75	100	99.85	100	99.95
	$L M_{a d j}$	92.3	98.95	99.85	99.55	98.8	100	99.6	99.8	99.8
	$C D_{P}$	5.15	4.9	4.45	5.29	5.05	5.1	5.35	4.75	4.7
		Chi-squared			Normal			Student-t
	$(T, n)$	(50,50)	(100,100)	(200,200)	(50,50)	(100,100)	(200,200)	(50,50)	(100,100)	(200,200)
$\frac{n}{T} = 1$	$R L M$	75.45	91.45	96.7	81.25	80.55	95.45	79.95	86.1	98.1
	$R L M_{P E}$	84.2	97.4	99.9	89.6	93.1	99.85	89.65	95.2	100
	$L M_{a d j}$	75.95	91.6	96.8	81.85	80.8	95.45	80.4	86.25	98.1
	$C D_{P}$	5.05	11	5.2	4.9	4.95	4.85	4.3	4.7	4.35
		Chi-squared			Normal			Student-t
	$(T, n)$	(50,100)	(100,200)	(200,400)	(50,100)	(100,200)	(200,400)	(50,100)	(100,200)	(200,400)
$\frac{n}{T} = 2$	$R L M$	60.4	59.8	74.75	43	60.25	72.9	62.7	55.55	62.6
	$R L M_{P E}$	73.85	78.55	91.35	55.35	78.35	91.75	75.9	73.85	84.7
	$L M_{a d j}$	60.2	59.8	74.75	42.8	60.2	72.9	62.6	55.5	62.45
	$C D_{P}$	7.2	4.75	4.7	4.65	4.6	5.4	4.9	5.6	5.05
$k = 4, h = 3$
		Chi-squared			Normal			Student-t
	$(T, n)$	(50,25)	(100,50)	(200,100)	(50,25)	(100,50)	(200,100)	(50,25)	(100,50)	(200,100)
$\frac{n}{T} = \frac{1}{2}$	$R L M$	90.1	99.2	99.55	98.15	98.9	99.95	95.1	99.4	99.9
	$R L M_{P E}$	94.9	99.85	100	99.4	99.85	100	97.55	99.85	100
	$L M_{a d j}$	90.2	99.2	99.55	98.15	98.95	99.95	95.15	99.4	99.9
	$C D_{P}$	5.2	7.65	4.4	31.65	4.85	4.85	4.7	5.55	4.8
		Chi-squared			Normal			Student-t
	$(T, n)$	(50,50)	(100,100)	(200,200)	(50,50)	(100,100)	(200,200)	(50,50)	(100,100)	(200,200)
$\frac{n}{T} = 1$	$R L M$	80.7	91.9	96.25	79.2	87.3	95.65	78.3	91.85	96.55
	$R L M_{P E}$	88.45	98.1	99.65	88.85	95.85	99.55	87.6	97.95	99.65
	$L M_{a d j}$	79.55	91.6	96.2	78.5	87	95.5	76.8	91.4	96.55
	$C D_{P}$	4.1	5.2	4.6	4.05	5	4.35	4	5.2	5.35
		Chi-squared			Normal			Student-t
	$(T, n)$	(50,100)	(100,200)	(200,400)	(50,100)	(100,200)	(200,400)	(50,100)	(100,200)	(200,400)
$\frac{n}{T} = 2$	$R L M$	48.15	70.95	70.15	49.15	60.8	68.6	47.4	60.1	65.7
	$R L M_{P E}$	60.1	87.5	91.05	60.75	78.6	89.1	58.65	78.25	86.3
	$L M_{a d j}$	43.35	68.3	68.75	43.65	58.25	67.1	42.65	56.75	63.85
	$C D_{P}$	5.25	4.7	4.9	4.55	5.05	4.95	7.55	5.3	4.3

Table 4. Table 4: Empirical power of tests in DGP1 for sparse case

$k = 2$
		Chi-squared			Normal			Student-t
	$(T, n)$	(50,25)	(100,50)	(200,100)	(50,25)	(100,50)	(200,100)	(50,25)	(100,50)	(200,100)
$\frac{n}{T} = \frac{1}{2}$	$R L M$	20.4	28.85	43.95	20.55	25.9	39.85	20.25	20.3	27.7
	$R L M_{P E}$	18.35	30.65	52.25	17.2	28.15	45.6	18.75	21.05	31.25
	$L M_{a d j}$	21.5	29.25	44.45	21.55	26.75	40.25	21.6	21	28
	$C D_{P}$	7.45	6.75	6.65	7.15	7.8	6.9	7.4	6.85	6.5
		Chi-squared			Normal			Student-t
	$(T, n)$	(50,50)	(100,100)	(200,200)	(50,50)	(100,100)	(200,200)	(50,50)	(100,100)	(200,200)
$\frac{n}{T} = 1$	$R L M$	23.6	13.6	38.35	14.05	18.95	39.55	11.9	23.75	31.3
	$R L M_{P E}$	25.6	13.8	49.3	14.15	20.5	50.5	11.95	25.85	38
	$L M_{a d j}$	24.2	13.65	38.45	14.75	19.2	39.7	12.45	24.05	31.4
	$C D_{P}$	7.9	6	7.35	7.35	6	7.25	6.8	5.9	6.4
		Chi-squared			Normal			Student-t
	$(T, n)$	(50,100)	(100,200)	(200,400)	(50,100)	(100,200)	(200,400)	(50,100)	(100,200)	(200,400)
$\frac{n}{T} = 2$	$R L M$	12.2	11.85	27.9	10.45	18.4	38.45	11.8	19.75	54.15
	$R L M_{P E}$	13.4	12.5	35.8	10.8	20.35	52.75	12.65	21.9	72.15
	$L M_{a d j}$	12.1	11.8	27.8	10.4	18.35	38.3	11.7	19.7	54.1
	$C D_{P}$	6.55	5.45	6.55	6.65	5.9	7.1	6.2	6	6.9
$k = 4$
		Chi-squared			Normal			Student-t
	$(T, n)$	(50,25)	(100,50)	(200,100)	(50,25)	(100,50)	(200,100)	(50,25)	(100,50)	(200,100)
$\frac{n}{T} = \frac{1}{2}$	$R L M$	10.45	24.3	26.4	9.15	27.25	27	17.5	51.7	28.5
	$R L M_{P E}$	10	27.85	30.95	9.65	28.65	28.9	15.85	58.05	30.35
	$L M_{a d j}$	10.5	24.35	26.4	9.3	27.6	27.05	17.55	51.95	28.5
	$C D_{P}$	6.25	7.75	5.95	7.1	7.75	7	7.1	9.1	6.75
		Chi-squared			Normal			Student-t
	$(T, n)$	(50,50)	(100,100)	(200,200)	(50,50)	(100,100)	(200,200)	(50,50)	(100,100)	(200,200)
$\frac{n}{T} = 1$	$R L M$	16.5	29.9	16.45	15	13.35	36.35	16.85	9.5	50.3
	$R L M_{P E}$	16.45	34	18.8	15.05	13.55	46.5	17.15	10.35	64.95
	$L M_{a d j}$	15.35	28.75	16.1	13.65	12.9	35.95	15.6	9.05	49.9
	$C D_{P}$	7	6.95	6.2	7.4	7	6.9	7.7	5.75	7
		Chi-squared			Normal			Student-t
	$(T, n)$	(50,100)	(100,200)	(200,400)	(50,100)	(100,200)	(200,400)	(50,100)	(100,200)	(200,400)
$\frac{n}{T} = 2$	$R L M$	14.3	18.2	55.95	11.15	15.55	47.65	8.4	15.85	77.55
	$R L M_{P E}$	14.1	19.85	75.55	11.1	15.6	64.25	9.1	17.1	93.3
	$L M_{a d j}$	10.85	16	54.4	8.25	13.7	46.35	6.2	13.75	75.85
	$C D_{P}$	5.1	6.15	7.5	5.4	6.1	7.25	5.1	6.25	7.6

Table 5. Table 5: Empirical power of tests in DGP1 for less sparse case

$k = 2$
		Chi-squared			Normal			Student-t
	$(T, n)$	(50,25)	(100,50)	(200,100)	(50,25)	(100,50)	(200,100)	(50,25)	(100,50)	(200,100)
$\frac{n}{T} = \frac{1}{2}$	$R L M$	99.3	100	100	98.95	99.9	100	99.15	97.85	100
	$R L M_{P E}$	99.9	100	100	99.6	100	100	99.75	99.4	100
	$L M_{a d j}$	99.55	100	100	99.1	99.9	100	99.2	97.9	100
	$C D_{P}$	54.1	71.6	96.9	55.7	66.5	96.8	54.45	46.55	95.15
		Chi-squared			Normal			Student-t
	$(T, n)$	(50,50)	(100,100)	(200,200)	(50,50)	(100,100)	(200,200)	(50,50)	(100,100)	(200,200)
$\frac{n}{T} = 1$	$R L M$	99.9	100	100	91.45	100	100	75.6	100	100
	$R L M_{P E}$	100	100	100	96.7	100	100	84.95	100	100
	$L M_{a d j}$	99.9	100	100	91.6	100	100	76.05	100	100
	$C D_{P}$	59.8	68.3	97.85	43.1	76.85	97.6	33.9	82.45	98.2
		Chi-squared			Normal			Student-t
	$(T, n)$	(50,100)	(100,200)	(200,400)	(50,100)	(100,200)	(200,400)	(50,100)	(100,200)	(200,400)
$\frac{n}{T} = 2$	$R L M$	99.8	100	100	97.95	100	100	97.45	100	100
	$R L M_{P E}$	100	100	100	99.65	100	100	99.45	100	100
	$L M_{a d j}$	99.75	100	100	97.85	100	100	97.45	100	100
	$C D_{P}$	60.85	79.15	96.9	51	80.2	94.75	49.9	78	98.6
$k = 4$
		Chi-squared			Normal			Student-t
	$(T, n)$	(50,25)	(100,50)	(200,100)	(50,25)	(100,50)	(200,100)	(50,25)	(100,50)	(200,100)
$\frac{n}{T} = \frac{1}{2}$	$R L M$	99.4	99.95	100	86.9	99.95	100	96.05	100	100
	$R L M_{P E}$	99.7	100	100	92.65	100	100	98.35	100	100
	$L M_{a d j}$	99.4	99.95	100	87	99.95	100	96.1	100	100
	$C D_{P}$	56.55	64.8	95.15	38.7	62.5	95.75	46.15	87.2	93.05
		Chi-squared			Normal			Student-t
	$(T, n)$	(50,50)	(100,100)	(200,200)	(50,50)	(100,100)	(200,200)	(50,50)	(100,100)	(200,200)
$\frac{n}{T} = 1$	$R L M$	87.8	100	100	78.65	100	100	94.85	99.75	100
	$R L M_{P E}$	94.45	100	100	87.75	100	100	98.4	100	100
	$L M_{a d j}$	86.45	100	100	77.65	100	100	94.15	99.75	100
	$C D_{P}$	39.7	75.2	96.85	34.5	71	93.6	45.25	58	96.35
		Chi-squared			Normal			Student-t
	$(T, n)$	(50,100)	(100,200)	(200,400)	(50,100)	(100,200)	(200,400)	(50,100)	(100,200)	(200,400)
$\frac{n}{T} = 2$	$R L M$	97.4	100	100	95.25	100	100	91.15	100	100
	$R L M_{P E}$	99.45	100	100	98.6	100	100	96.65	100	100
	$L M_{a d j}$	96.55	100	100	93.45	100	100	88.4	100	100
	$C D_{P}$	46.15	74.4	98.4	44.15	74.9	93.3	40.75	80.35	97.75

Table 6. Table 6: Empirical size of tests in DGP2

Weakly exogenous
		Chi-squared			Normal			Student-t
	$(T, n)$	(50,25)	(100,50)	(200,100)	(50,25)	(100,50)	(200,100)	(50,25)	(100,50)	(200,100)
$\frac{n}{T} = \frac{1}{2}$	$R L M$	5.85	4.45	4.75	5.1	4.7	5.55	4.7	5.2	5.8
	$R L M_{P E}$	5.15	4.15	4.9	4.75	4.9	6.4	5.3	5.05	5.6
	$L M_{a d j}$	9.25	8.4	8.6	8.8	8.8	9.2	8.95	8.85	9.2
	$C D_{P}$	4.65	4.25	4	5	4.5	5.05	5.6	4.4	5.35
		Chi-squared			Normal			Student-t
	$(T, n)$	(50,50)	(100,100)	(200,200)	(50,50)	(100,100)	(200,200)	(50,50)	(100,100)	(200,200)
$\frac{n}{T} = 1$	$R L M$	4.55	5.4	5.8	5.65	4.95	5.05	5.55	5	5.9
	$R L M_{P E}$	5.2	5.05	5.15	5.6	4.95	4.9	5.85	4.9	5.05
	$L M_{a d j}$	13.15	13.6	14	13.65	13	12.6	13.75	13.4	12.6
	$C D_{P}$	5.6	4.35	5.2	6.25	5.2	5.6	6.1	5.2	6.2
		Chi-squared			Normal			Student-t
	$(T, n)$	(50,100)	(100,200)	(200,400)	(50,100)	(100,200)	(200,400)	(50,100)	(100,200)	(200,400)
$\frac{n}{T} = 2$	$R L M$	5.85	6.9	5.3	5.05	5.45	4.8	4.4	5.2	5.2
	$R L M_{P E}$	5.05	5.75	5.1	5.5	5.75	4.7	5.7	5.2	5.6
	$L M_{a d j}$	28.2	27.25	27.3	27.35	28.25	26.3	30.35	28.5	26.7
	$C D_{P}$	4.5	5.1	5.1	5.7	5.05	5.65	4.85	5.1	5.9

Table 7. Table 7: Empirical size of tests in DGP3

$k = 2$
		Chi-squared			Normal			Student-t
	$(T, n)$	(50,25)	(100,50)	(200,100)	(50,25)	(100,50)	(200,100)	(50,25)	(100,50)	(200,100)
$\frac{n}{T} = \frac{1}{2}$	$R L M$	5.7	4.7	5.05	5.1	5.45	5.55	5	5.1	4.95
	$R L M_{P E}$	5.55	4.85	4.95	4.9	4.8	5.6	5.2	4.7	4.6
	$L M_{a d j}$	6.4	5.15	5.1	5.6	5.75	5.65	5.7	5.4	5.1
	$C D_{P}$	4.5	4.7	5.05	4.8	4.45	5.3	5.1	4.45	4.85
		Chi-squared			Normal			Student-t
	$(T, n)$	(50,50)	(100,100)	(200,200)	(50,50)	(100,100)	(200,200)	(50,50)	(100,100)	(200,200)
$\frac{n}{T} = 1$	$R L M$	4.85	5.25	4.6	5	5	4.7	5.05	4.35	5.6
	$R L M_{P E}$	4.9	4.9	4.75	5.25	4.7	5.05	5.5	4	5.65
	$L M_{a d j}$	5.45	5.4	4.7	5.2	5.05	4.95	5.4	4.55	5.65
	$C D_{P}$	5.2	4.75	6.2	5.45	5.5	5.7	5.55	5.6	5.5
		Chi-squared			Normal			Student-t
	$(T, n)$	(50,100)	(100,200)	(200,400)	(50,100)	(100,200)	(200,400)	(50,100)	(100,200)	(200,400)
$\frac{n}{T} = 2$	$R L M$	6.45	6.35	4.35	5.05	5.3	4.7	4.65	5	4.3
	$R L M_{P E}$	6.55	5.9	4.75	5.65	5.4	5.2	5	5.1	4.55
	$L M_{a d j}$	6.45	6.35	4.35	5.15	5.35	4.7	4.7	5	4.3
	$C D_{P}$	4.85	5.95	4.9	4.95	5.4	5.55	4.25	5.45	5
$k = 4$
		Chi-squared			Normal			Student-t
	$(T, n)$	(50,25)	(100,50)	(200,100)	(50,25)	(100,50)	(200,100)	(50,25)	(100,50)	(200,100)
$\frac{n}{T} = \frac{1}{2}$	$R L M$	5.1	6.05	4.7	4.7	4.65	5.35	5.05	4.45	5.45
	$R L M_{P E}$	4.95	5.05	5.05	4.45	4.25	5.05	4.2	4.85	5.25
	$L M_{a d j}$	5.4	6.4	4.75	5	4.7	5.35	5.35	4.5	5.45
	$C D_{P}$	4.95	6.1	4.6	4.65	5.2	5.25	4.35	5.05	5.05
		Chi-squared			Normal			Student-t
	$(T, n)$	(50,50)	(100,100)	(200,200)	(50,50)	(100,100)	(200,200)	(50,50)	(100,100)	(200,200)
$\frac{n}{T} = 1$	$R L M$	5.75	5.15	4.75	4.8	5.25	4.75	5	5.8	5.3
	$R L M_{P E}$	5.7	5.05	4.85	4.7	5.7	4.75	5.2	5.45	5.2
	$L M_{a d j}$	5.4	4.8	4.55	4.2	5.05	4.6	4.5	5.55	5.25
	$C D_{P}$	5.05	4.9	5.65	4.95	4.85	5	5.3	5.35	4.9
		Chi-squared			Normal			Student-t
	$(T, n)$	(50,100)	(100,200)	(200,400)	(50,100)	(100,200)	(200,400)	(50,100)	(100,200)	(200,400)
$\frac{n}{T} = 2$	$R L M$	5.35	6.15	5.1	6.05	5.4	4.75	4.9	5.3	4.9
	$R L M_{P E}$	5.65	5.8	4.95	5.55	5.1	5.05	5.55	5.1	5.1
	$L M_{a d j}$	4.15	5.25	4.85	4.55	4.3	4.4	3.95	4.9	4.4
	$C D_{P}$	5	4.9	3.75	5.15	5.15	5.15	5.3	5.4	4.7

Table 8. Table 8: Empirical size of tests in DGP4

Dynamic
		Chi-squared			Normal			Student-t
	$(T, n)$	(50,25)	(100,50)	(200,100)	(50,25)	(100,50)	(200,100)	(50,25)	(100,50)	(200,100)
$\frac{n}{T} = \frac{1}{2}$	$R L M$	6.3	5.25	5.1	5.05	5.5	5.25	5.4	4.75	4.85
	$R L M_{P E}$	5.8	5.15	4.45	4.7	5.3	4.9	5.4	4.7	4.9
	$L M_{a d j}$	6.75	5.45	5.1	5.45	5.6	5.3	5.85	5.15	4.95
	$C D_{P}$	4.9	5.3	5.4	4.35	5	5.45	4.35	5.8	5.15
		Chi-squared			Normal			Student-t
	$(T, n)$	(50,50)	(100,100)	(200,200)	(50,50)	(100,100)	(200,200)	(50,50)	(100,100)	(200,200)
$\frac{n}{T} = 1$	$R L M$	6.65	5.25	5.95	5.8	5.35	4.6	6.2	5.3	5
	$R L M_{P E}$	5.7	5.35	5.4	6.1	5.45	4.45	6.05	5.5	4.6
	$L M_{a d j}$	6.7	5.3	6	5.9	5.35	4.6	6.35	5.4	5
	$C D_{P}$	4.75	4.85	4.85	4.85	4.6	4.25	4.8	4.8	5.5
		Chi-squared			Normal			Student-t
	$(T, n)$	(50,100)	(100,200)	(200,400)	(50,100)	(100,200)	(200,400)	(50,100)	(100,200)	(200,400)
$\frac{n}{T} = 2$	$R L M$	5.7	5.9	5.65	6.2	5.1	5.35	5.95	5.5	5.45
	$R L M_{P E}$	6.45	6.05	5.1	5.4	5.45	5.05	5.05	5.3	5.15
	$L M_{a d j}$	5.3	5.7	5.6	5.75	5	5.3	5.6	5.35	5.3
	$C D_{P}$	4.4	6	4.55	5.7	5.75	5.05	5	4.95	5.6

Equations285

SEQ-L: T \to \infty, followed by n \to \infty,

SEQ-L: T \to \infty, followed by n \to \infty,

SIM-L: (n, T) \to \infty such that \frac{n}{T} \to c \in (0, \infty),

SIM-L: (n, T) \to \infty such that \frac{n}{T} \to c \in (0, \infty),

y_{i t} = x_{i t}^{'} β_{i} + v_{i t}, for i = 1, \dots, n; t = 1, \dots, T,

y_{i t} = x_{i t}^{'} β_{i} + v_{i t}, for i = 1, \dots, n; t = 1, \dots, T,

H_{0} : v_{i t} is independent of v_{j t}, for all t and i \neq = j .

H_{0} : v_{i t} is independent of v_{j t}, for all t and i \neq = j .

\overset{ρ}{^}_{ij} = \overset{ρ}{^}_{j i} = \frac{\sum _{t = 1}^{T} v ^ _{i t} v ^ _{j t}}{( \sum _{t = 1}^{T} v ^ _{i t}^{2} ) ^{1/2} ( \sum _{t = 1}^{T} v ^ _{j t}^{2} ) ^{1/2}},

\overset{ρ}{^}_{ij} = \overset{ρ}{^}_{j i} = \frac{\sum _{t = 1}^{T} v ^ _{i t} v ^ _{j t}}{( \sum _{t = 1}^{T} v ^ _{i t}^{2} ) ^{1/2} ( \sum _{t = 1}^{T} v ^ _{j t}^{2} ) ^{1/2}},

\overset{v}{^}_{i t} = y_{i t} - x_{i t}^{'} \hat{β}_{i},

\overset{v}{^}_{i t} = y_{i t} - x_{i t}^{'} \hat{β}_{i},

L M = \frac{T}{2} 1 \leq i \neq = j \leq n \sum \overset{ρ}{^}_{ij}^{2} = \frac{T}{2} [t r (\hat{R}^{2}) - n],

L M = \frac{T}{2} 1 \leq i \neq = j \leq n \sum \overset{ρ}{^}_{ij}^{2} = \frac{T}{2} [t r (\hat{R}^{2}) - n],

C D_{L M} = \frac{1}{4 n ( n - 1 )} 1 \leq i \neq = j \leq n \sum (T \overset{ρ}{^}_{ij}^{2} - 1) = \frac{T ^{2}}{4 n ( n - 1 )} [t r (\hat{R}^{2}) - n - \frac{n ( n - 1 )}{T}],

C D_{L M} = \frac{1}{4 n ( n - 1 )} 1 \leq i \neq = j \leq n \sum (T \overset{ρ}{^}_{ij}^{2} - 1) = \frac{T ^{2}}{4 n ( n - 1 )} [t r (\hat{R}^{2}) - n - \frac{n ( n - 1 )}{T}],

C D_{P} = \frac{T}{2 n ( n - 1 )} 1 \leq i \neq = j \leq n \sum \overset{ρ}{^}_{ij} .

C D_{P} = \frac{T}{2 n ( n - 1 )} 1 \leq i \neq = j \leq n \sum \overset{ρ}{^}_{ij} .

L M_{b c} = C D_{L M} - \frac{n}{2 ( T - 1 )} = \frac{T ^{2}}{4 n ( n - 1 )} [t r (\hat{R}^{2}) - n - \frac{n ( n - 1 )}{T} - \frac{n n ( n - 1 )}{T ( T - 1 )}] .

L M_{b c} = C D_{L M} - \frac{n}{2 ( T - 1 )} = \frac{T ^{2}}{4 n ( n - 1 )} [t r (\hat{R}^{2}) - n - \frac{n ( n - 1 )}{T} - \frac{n n ( n - 1 )}{T ( T - 1 )}] .

L M_{a d j} = \frac{1}{2 n ( n - 1 )} 1 \leq i \neq = j \leq n \sum \frac{( T - k ) ρ ^ _{ij}^{2} - μ _{T, i, j}}{σ _{T, i, j}},

L M_{a d j} = \frac{1}{2 n ( n - 1 )} 1 \leq i \neq = j \leq n \sum \frac{( T - k ) ρ ^ _{ij}^{2} - μ _{T, i, j}}{σ _{T, i, j}},

μ_{T, i, j} = \frac{1}{T - k} t r (M_{i} M_{j}), σ_{T, i, j}^{2} = [t r (M_{i} M_{j})]^{2} a_{1 T} + t r (M_{i} M_{j})^{2} a_{2 T},

μ_{T, i, j} = \frac{1}{T - k} t r (M_{i} M_{j}), σ_{T, i, j}^{2} = [t r (M_{i} M_{j})]^{2} a_{1 T} + t r (M_{i} M_{j})^{2} a_{2 T},

a_{1 T} = a_{2 T} - \frac{1}{( T - k ) ^{2}}, a_{2 T} = \frac{3}{( T - k + 2 ) ^{2}},

a_{1 T} = a_{2 T} - \frac{1}{( T - k ) ^{2}}, a_{2 T} = \frac{3}{( T - k + 2 ) ^{2}},

L M_{R M T} = σ_{R M T}^{- 1} [t r (\hat{R}^{2}) - n - \frac{n ^{2}}{T} - \frac{n ^{2}}{T ^{2}} + \frac{n}{T}],

L M_{R M T} = σ_{R M T}^{- 1} [t r (\hat{R}^{2}) - n - \frac{n ^{2}}{T} - \frac{n ^{2}}{T ^{2}} + \frac{n}{T}],

σ_{R M T} = \frac{4 n ( 2 n + T ) ( n + 2 T )}{T ^{3}} - \frac{4 ( κ - 1 ) n ( n + T ) ^{2}}{T ^{3}} - \frac{( κ - 3 ) n ( n - 4 T ) ^{2} ( n + T ) ^{2}}{T ^{5}}

σ_{R M T} = \frac{4 n ( 2 n + T ) ( n + 2 T )}{T ^{3}} - \frac{4 ( κ - 1 ) n ( n + T ) ^{2}}{T ^{3}} - \frac{( κ - 3 ) n ( n - 4 T ) ^{2} ( n + T ) ^{2}}{T ^{5}}

R L M = \frac{t r ( R ^ ^{2} ) - μ _{0}}{σ _{0}} ⟶ d N (0, 1),

R L M = \frac{t r ( R ^ ^{2} ) - μ _{0}}{σ _{0}} ⟶ d N (0, 1),

μ_{0} = n (1 + \frac{n}{T - 1}) - c_{T} = n (1 + \frac{n}{T} (1 + \frac{1}{T - 1})) - c_{T} = n + \frac{n ^{2}}{T} + \frac{n ^{2}}{T ^{2}} - \frac{n}{T} + o (1),

μ_{0} = n (1 + \frac{n}{T - 1}) - c_{T} = n (1 + \frac{n}{T} (1 + \frac{1}{T - 1})) - c_{T} = n + \frac{n ^{2}}{T} + \frac{n ^{2}}{T ^{2}} - \frac{n}{T} + o (1),

σ_{0}^{2} = σ_{R M T}^{2} + o (1)

σ_{0}^{2} = σ_{R M T}^{2} + o (1)

R L M = L M_{R M T} + o (1),

R L M = L M_{R M T} + o (1),

C D_{L M} = L M_{b c} + \frac{n}{2 ( T - 1 )},

C D_{L M} = L M_{b c} + \frac{n}{2 ( T - 1 )},

C D_{L M} = \frac{n}{n - 1} (R L M + \frac{n}{2 ( T - 1 )})

C D_{L M} = \frac{n}{n - 1} (R L M + \frac{n}{2 ( T - 1 )})

L M_{b c} = \frac{n}{n - 1} (R L M + \frac{n}{2 ( T - 1 ) ( n + n - 1 )}) .

L M_{b c} = \frac{n}{n - 1} (R L M + \frac{n}{2 ( T - 1 ) ( n + n - 1 )}) .

H_{1} : corr (v_{t}) = I_{n} + P_{n},

H_{1} : corr (v_{t}) = I_{n} + P_{n},

i \neq = j \sum ∣ ρ_{ij} ∣^{m} \sim card (E) \cdot (i < j max ∣ ρ_{ij} ∣)^{m}, as m \to \infty,

i \neq = j \sum ∣ ρ_{ij} ∣^{m} \sim card (E) \cdot (i < j max ∣ ρ_{ij} ∣)^{m}, as m \to \infty,

R L M_{P E} = \frac{t r ( R ^ ^{4} ) - μ _{P E}}{σ _{P E}} ⟶ d N (0, 1)

R L M_{P E} = \frac{t r ( R ^ ^{4} ) - μ _{P E}}{σ _{P E}} ⟶ d N (0, 1)

y_{i t} = α_{i} y_{i, t - 1} + x_{i t}^{'} β_{i} + v_{i t},

y_{i t} = α_{i} y_{i, t - 1} + x_{i t}^{'} β_{i} + v_{i t},

R L M ⟶ d N (0, 1) .

R L M ⟶ d N (0, 1) .

R L M_{P E} ⟶ d N (0, 1) .

R L M_{P E} ⟶ d N (0, 1) .

y_{i t} = x_{i t}^{'} β + μ_{i} + v_{i t},

y_{i t} = x_{i t}^{'} β + μ_{i} + v_{i t},

\hat{β} = (t = 1 \sum T i = 1 \sum n \tilde{x}_{i t} \tilde{x}_{i t}^{'})^{- 1} (t = 1 \sum T i = 1 \sum n \tilde{x}_{i t} \tilde{y}_{i t}),

\hat{β} = (t = 1 \sum T i = 1 \sum n \tilde{x}_{i t} \tilde{x}_{i t}^{'})^{- 1} (t = 1 \sum T i = 1 \sum n \tilde{x}_{i t} \tilde{y}_{i t}),

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpatial and Panel Data Analysis · Global trade and economics · Economic Growth and Productivity

MethodsTest

Full text

Unified and robust Lagrange multiplier type tests for cross-sectional independence in large panel data models

Zhenhong Huang111 Department of Statistics and Actuarial Science, The University of Hong Kong. Email: [email protected], Zhaoyuan Li222School of Data Science, The Chinese University of Hong Kong, Shenzhen. Email: [email protected] and Jianfeng Yao333School of Data Science, The Chinese University of Hong Kong (Shenzhen). Email: [email protected]

Abstract

This paper revisits the Lagrange multiplier type test for the null hypothesis of no cross-sectional dependence in large panel data models. We propose a unified test procedure and its power enhancement version, which show robustness for a wide class of panel model contexts. Specifically, the two procedures are applicable to both heterogeneous and fixed effects panel data models with the presence of weakly exogenous as well as lagged dependent regressors, allowing for a general form of non-normal error distribution. With the tools from Random Matrix Theory, the asymptotic validity of the test procedures is established under the simultaneous limit scheme where the number of time periods ( $T$ ) and the number of cross-sectional units ( $n$ ) go to infinity proportionally. The derived theories are accompanied by detailed Monte Carlo experiments, which confirm the robustness of the two tests and also suggest the validity of the power enhancement technique.

Keywords: Cross-sectional dependence, Large panels, Coefficient heterogeneity, Weak exogenous, Random Matrix Theory

1 Introduction

In panel data analysis, the problem of error cross-sectional dependence has attracted substantial attention in recent years. This cross-sectional dependence can arise for various reasons, such as omitted common or spatial effects. Ignoring error cross-sectional dependence can have dramatic effects on conventional panel estimators, e.g., the least squares, fixed and random effect estimators, and yield invalid inferential procedures such as commonly used panel unit root tests, where these tests assume cross-sectional independence. Therefore, designing efficacious tests for cross-sectional dependence is essential in panel data analysis.

There has been much work on testing for cross-sectional dependence in the literature. Breusch and Pagan (1980) proposed a Lagrange multiplier ( $LM$ ) test based on the squared pair-wise Pearson correlation coefficients of the residuals. Under the null hypothesis of no cross-sectional dependence, the $LM$ test is asymptotically chi-squared distributed with $T\rightarrow\infty$ and $n$ fixed. However, it is not applicable for large $n$ , which renders its popularity considering recent researches have focused on the large panels where both $T$ and $n$ can be large. In such high dimensional setting, there are two mainstream schemes considered by statisticians and econometricians, being known as sequential limit scheme and simultaneous limit scheme defined as following:

[TABLE]

and

[TABLE]

respectively. Under the SIM-L scheme, Frees (1995) proposed a distribution free $LM$ type test allowing for large $n$ , $R^{2}_{AVE}$ , based on the squared pair-wise Spearman rank correlation coefficients, which is asymptotically distributed as Chi-squared. The test has however imposed limitations on the number of regressors and could be oversized for small $T$ . Pesaran (2004) suggested a scaled version of the $LM$ test, denoted by $CD_{LM}$ , and showed its asymptotic property. The author however pointed out that the $CD_{LM}$ test is not correctly centered at zero for small $T$ , and is likely to exhibit large size distortions as $n$ increases. Pesaran (2004) also proposed an alternative approach, the $CD_{P}$ test, which employs pair-wise Pearson correlation coefficients of the residuals as well, but without squaring them. This test has universally correct size under a broad course of panel data model designs, but it lacks power when correlation coefficients within panel units have variable signs leading to certain cancellation effect. Particularly, this happens when the errors are generated from a factor model where the loadings average to zero. Pesaran (2015) extended the $CD_{P}$ test to the scenario of weak cross-sectional dependence. Pesaran et al. (2008) put forward another approach, $LM_{adj}$ , by deriving the exact expected values and variances of the squared correlation coefficients under the assumption of normally distributed errors and strictly exogenous regressors. By applying the classical central limit theory, it is shown that the $LM_{adj}$ test converges to standard normal distribution under the SEQ-L scheme. Bailey et al. (2021) proposed a new $LM$ type test, $LM_{RMT}$ , and proved its asymptotic normality under the SIM-L scheme using Random Matrix Theory. However, they require the assumptions of normal regressors and normal errors. In a slope homogeneity setting, Baltagi et al. (2012) analyzed the performance of the $CD_{LM}$ test in the fixed effects panel data model, then presented a bias corrected test, $LM_{bc}$ , and established its asymptotic normality under the SIM-L scheme assuming normal errors and strictly exogenous regressors. Baltagi et al. (2012) also showed that the $LM_{bc}$ test can be applied to dynamic panel data model with fixed effects, using the within estimator proposed by Hahn and Kuersteiner (2002). However, the slope homogeneity restriction has often been rejected in empirical analyses, see a detailed survey by Baltagi et al. (2008). Meanwhile, there are limited tests designed for the dynamic panel data model. Examples include the GMM approach of Sarafidis et al. (2009) applied to panels with homogeneous coefficients under factor model representation, and a heteroskedasticity robust $LM$ test, $LM_{HOY}$ , of Halunga et al. (2017) with $n^{2}/T\rightarrow 0$ required.

We make two distinct contributions in this paper. First, we propose a $LM$ type test statistic that can be applied to a wide class of linear panel models. Specifically, we show that our test is robust to both static and dynamic heterogeneous panel data models with weakly exogenous regressors and non-normal errors. With tools from Random Matrix Theory, we treat sample correlation matrix of residuals directly as a random matrix to establish the asymptotic normality of the test statistics under the SIM-L scheme, which has been suggested to be a more reliable strategy when dealing with high-dimensional statistical problems, see Yao et al. (2015). We also show that the proposed test is mathematically equivalent to both the $LM_{RMT}$ and the $LM_{bc}$ test. This finding theoretically enriches the $LM_{RMT}$ and $LM_{bc}$ test by relaxing the restrictive assumptions they need. It is worth mentioning that the existing literature on testing for cross-sectional dependence has mostly focused on the case of strictly exogenous regressors, including the $LM_{adj}$ , $LM_{bc}$ and $LM_{HOY}$ tests. Though Pesaran (2015) showed that the $CD_{P}$ test is also applicable to autoregressive panel data models so long as the errors are symmetrically distributed, the properties of the $CD_{P}$ test for dynamic panels that include weakly exogenous regressors have not yet been investigated. The new proposed test fills this gap for the weakly exogenous regressors case, and to the best of our knowledge, there is no such a unified test so far.

Second, weak cross-sectional dependence is common in empirical applications, see, for example, Bailey et al. (2016), Ertur and Musolesi (2017). This leads to a sparse correlation structure with few nonzero off-diagonal entries. Therefore, it is important to test the existence of such weak cross-sectional dependence, which corresponds to sparse alternatives in the high-dimensional statistics literature. The mainstream of more powerful tests for sparse alternatives are based on the maxima absolute value of sample correlations, see Cai et al. (2011) and Hall et al. (2010). However, these tests require stringent conditions that are ‘unfeasible’ in econometric applications and often suffer from size distortions due to slow rates of convergence. In view of this, we propose a novel and easy-implemented test statistic for cross-sectional dependence based on the fourth moment of sample correlation to boost the power of detecting sparse alternatives. Again, this test can be used in any aforementioned panel setting under suitable conditions on higher moments of the errors.

The remainder of the paper is organized as follows. Section 2 discusses the existing tests for no cross-sectional dependence. Section 3 introduces the new test statistics and establishes the limiting distributions under the SIM-L scheme. Sections 4 and 5 demonstrate that the proposed tests can be extended to both the dynamic and fixed effects panel data models. Section 6 reports the results of Monte Carlo simulations. Section 7 provides concluding remarks and further discussions.

Notations. Throughout the paper, for a matrix $\mathbf{A}\in\mathcal{R}^{n\times n}$ , $tr(\mathbf{A})$ represents the trace $\mathbf{A}$ . We use $\lambda_{1}(\mathbf{A}),\dots,\lambda_{n}(\mathbf{A})$ to denote $n$ eigenvalues of $\mathbf{A}$ . Further, for vectors $\mathbf{a},\mathbf{b}\in\mathcal{R}^{n\times 1}$ , we write $\langle\mathbf{a},\mathbf{b}\rangle=\mathbf{a}^{T}\mathbf{b}$ for their scalar product. In addition. $\|\cdot\|$ represents the Euclidean norm for a vector and the induced operator norm for a matrix.

2 Existing tests for cross-sectional dependence based on sample correlation

Consider the heterogeneous panel data model:

[TABLE]

where $i$ indexes the cross-sectional units and $t$ the time series observations, $y_{it}$ is the response variable and $\mathbf{x}_{it}$ is a $k\times 1$ vector of regressors with unity on the first row with coefficients $\boldsymbol{\beta}_{i}$ allowed to vary cross the cross-sectional units. For each $i$ , the error term, $v_{it}$ , are assumed to be serially independent with zero mean and finite variance. The null hypothesis of interest in the literature is

[TABLE]

When $T$ is sufficiently large, a natural way to test $H_{0}$ is based on some reliable estimates ( $\hat{\rho}_{ij}$ ) for the pair-wise error sample correlations ( $\rho_{ij}$ ). Specifically,

[TABLE]

where $\hat{v}_{it}$ is the Ordinary Least Squares (OLS) estimate of $v_{it}$ in (1) defined by

[TABLE]

with $\hat{\boldsymbol{\beta}}_{i}$ being the OLS estimate of $\boldsymbol{\beta}_{i}$ by regressing the $T$ sample observations $y_{it}$ on $\mathbf{x}_{it}$ for each $i$ . In the seemingly unrelated regression equations (SURE) context with fixed $n$ and as $T\rightarrow\infty$ , Breusch and Pagan (1980) proposed a Lagrange multiplier ( $LM$ ) test for testing $H_{0}$ given by

[TABLE]

where $\hat{\mathbf{R}}=(\hat{\rho}_{ij})_{1\leq i,j\leq n}$ is the sample correlation matrix of residuals. It has been shown that $LM\stackrel{{\scriptstyle d}}{{\longrightarrow}}\chi^{2}_{d}$ , where $d=\frac{1}{2}n(n-1)$ under $H_{0}$ and normal errors assumption. However, it is well known that the $LM$ test is severely oversized when $n$ is relatively large compared to $T$ . To amend this size distortion, Pesaran (2004) put forward a scaled version of the $LM$ test given by

[TABLE]

which is asymptotically distributed as $N(0,1)$ under the SEQ-L scheme. There are two important cases where the $CD_{LM}$ test is not reliable. Firstly, as Baltagi et al. (2012) noted, it will exhibit substantial size distortions in the homogeneous panel data model. Secondly, Pesaran (2004) pointed out that, in finite $T$ case, the $CD_{LM}$ test tends to over-reject the null due to the fact that $E(T\hat{\rho}_{ij}^{2}-1)$ is not correctly centered at zero. This kind of bias even accumulates as $n$ becomes larger. In this case, Dufour and Khalaf (2002) suggested to apply bootstrap method to (5). Pesaran (2004) and Pesaran (2015) proposed an alternative adjustment based on the raw, non-squared, sample correlation coefficients given by

[TABLE]

The test is asymptotically distributed as standard normal under both SEQ-L and SIM-L schemes. However, it is widely reported that the $CD_{P}$ test suffers from a specific loss of power when the loadings have zero mean in the cross-sectional dimension under factor representation. Baltagi et al. (2012) proposed another modified version of the $CD_{LM}$ test for fixed effect panel data model, $LM_{bc}$ , given by

[TABLE]

The test is asymptotically standard normal with normal errors and strictly exogenous regressors under the SIM-L scheme. Pesaran et al. (2008) proposed an alternative finite sample adjustment to the $LM$ test by deriving the exact moments of the squared sample correlation coefficients under normal errors and strictly exogenous regressors assumptions. Their $LM_{adj}$ test statistic is given by

[TABLE]

where

[TABLE]

and

[TABLE]

$\mathbf{M}_{i}=\mathbf{I}_{T}-\mathbf{X}_{i}^{\prime}(\mathbf{X}_{i}\mathbf{X}_{i}^{\prime})^{-1}\mathbf{X}_{i}$ is the projection matrix, where $\mathbf{X}_{i}=(\mathbf{x}_{i1},\cdots,\mathbf{x}_{iT})$ contains $T$ samples on the $k$ regressors for the $i$ -th individual regression. Under $H_{0}$ and the SEQ-L scheme, $LM_{adj}$ was shown to be asymptotically distributed as $N(0,1).$ However, as pointed out by Pesaran et al. (2008), the $LM_{adj}$ test is not robust in panel data models with weakly exogenous regressors. Bailey et al. (2021) proposed another modified $LM$ test for heterogenous panel data models based on Random Matrix Theory, $LM_{RMT}$ , given by

[TABLE]

where

[TABLE]

with $\kappa=\frac{3T(T-k-2)}{(T+2)(T-k)}$ . Under the assumptions of normal regressors and normal errors, the authors showed that $LM_{RMT}$ is asymptotically distributed as $N(0,1)$ under the SIM-L scheme. The application scopes of the discussed tests are summarized in Table 1. (The table also contains the new tests proposed in this paper in the last two columns so-called $RLM$ and $RLM_{PE}$ , which are developed later.)

3 The RLM test and its power enhancement

3.1 The RLM test

Motivated by the existing well-known tests based on the sum of squared sample correlation coefficients in (4), (5), (7), (8) and (9), it is natural to consider the limiting behavior of $tr(\hat{\mathbf{R}}^{2})$ under the SIM-L scheme. Throughout the paper, we consider the following assumptions.

Assumption 1.

$T\rightarrow\infty,n=n(T)\rightarrow\infty$ such that $c_{T}=\frac{n}{T}\rightarrow c\in(0,\infty)$ .

Assumption 2.

For each $i$ , the errors, $\{v_{it}\}$ , are i.i.d distributed with mean [math] and variance $\sigma_{i}^{2}$ .

Assumption 3.

(i)

The errors have uniformly bounded sixth moment, i.e.

$\sup_{i,t}E|v_{it}|^{6+\epsilon}\leq C_{1}$ for some positive constant $C_{1}$ and $\epsilon>0$ . 2. (ii)

The errors have uniformly bounded eighth moment, i.e. $\sup_{i,t}E|v_{it}|^{8+\epsilon}\leq C_{2}$ for some positive constant $C_{2}$ and $\epsilon>0$ .

For a static heterogeneous panel data model, we further assume

Assumption 4.

For each $i$ , the regressors, $\mathbf{x}_{it}$ , satisfy

(i)

$E(v_{it}|\mathbf{x}_{it},\cdots,\mathbf{x}_{i1})=0$ for all $i$ and $t$ . 2. (ii)

let $\mathbf{X}_{i}=(\mathbf{x}_{i1},\dots,\mathbf{x}_{iT})$ , there exists a $k\times k$ nonrandom positive definite matrix $\mathbf{B}$ such that $\frac{1}{T}\mathbf{X}_{i}\mathbf{X}_{i}^{\prime}\stackrel{{\scriptstyle p}}{{\rightarrow}}\mathbf{B}$ . 3. (iii)

$\max_{1\leq t\leq T}\|\frac{1}{\sqrt{T}}\mathbf{x}_{it}\|\stackrel{{\scriptstyle p}}{{\rightarrow}}0$ .

Assumption 2 is standard allowing for heteroskedastic errors across units. Assumption 3 requires suitable moments of the errors for the two proposed test procedures, respectively. It helps relax the often-met normal error assumption by Random Matrix Theory. Assumption 4(i) only requires the regressors to be weakly exogenous. Assumption 4(ii) and (iii) impose mild conditions on the design matrix. We note that Assumption 4 does not impose the dependence structure between errors and regressors, which allows for the regressors to be weakly exogenous. Under these assumptions, $\sqrt{T}(\hat{\boldsymbol{\beta}}_{i}-\boldsymbol{\beta}_{i})$ is asymptotically normal according to Lai and Wei (1982).

For a dynamic heterogeneous panel data model with lagged dependent variable included in regressors, more assumptions are needed which will be discussed in Section 4.

Now we are in the position of introducing the $RLM$ test and establishing its asymptotic property in the following theorem.

Theorem 1.

Under Assumptions 1, 2, 3 $(i)$ and 4,

[TABLE]

where $\mu_{0}=n+\frac{n^{2}}{T-1}-c_{T}$ and $\sigma_{0}^{2}=4c_{T}^{2}.$

The proof of Theorem 1 is provided in the Appendix, and the method is in two stages. In the first stage, Lemma 1 establishes the Central Limit Theorem of $tr(\mathbf{R}^{2})$ with tools from Random Matrix Theory, where $\mathbf{R}=(\rho_{ij})_{1\leq i,j\leq n}$ . In the second stage, Lemma 2 shows that the asymptotic bias of $tr(\hat{\mathbf{R}}^{2})$ disappears under the SIM-L scheme with $c_{T}\rightarrow c\in(0,\infty)$ .

3.1.1 Relationship between the $RLM$ and $LM_{RMT}$ tests

By the respective definitions of the $LM_{RMT}$ and $RLM$ in (9) and Theorem 1, we have

[TABLE]

and

[TABLE]

as $\kappa=3+o(1)$ . It follows that

[TABLE]

which indicates that $LM_{RMT}$ is asymptotically equivalent to $RLM$ regardless of model specifications and assumptions. Note that the proof the asymptotic normality of $LM_{RMT}$ in Bailey et al. (2021) heavily relies on the assumptions of normal regressors and normal errors as it is used to ensure the residuals have desirable properties, and then transform the sample correlation matrix of residuals to the sample correlation matrix of a nomarlized population with unit covariance matrix (see details in Section 3.1 in Bailey et al. (2021)). From (11), we conclude that the $LM_{RMT}$ is also valid without the restrictive assumptions of normality. Besides, as we will show later, $RLM$ is also valid in both dynamic and fixed effects panel data models, which theoretically extends the application scope of $LM_{RMT}$ . This finding is consistent with the simulation findings that show such robustness of $LM_{RMT}$ in Bailey et al. (2021).

3.1.2 Relationship of the $RLM$ , $LM_{bc}$ and $CD_{LM}$ tests

By the respective definitions of the $CD_{LM}$ , $LM_{bc}$ and $RLM$ tests in (5), (7) and Theorem 1, we have the following identities

[TABLE]

and

[TABLE]

Note that the factor $\sqrt{\frac{n}{n-1}}\rightarrow 1$ and the remainder $\frac{\sqrt{n}}{2(T-1)(\sqrt{n}+\sqrt{n-1})}\rightarrow 0$ . It follows that the two tests, $RLM$ and $LM_{bc}$ , are always asymptotically equivalent, while the $CD_{LM}$ statistic has always a positive mean shift of value $\frac{n}{2(T-1)}$ .

In particular, Theorem 1 is also valid for the $LM_{bc}$ statistic. Moreover, anticipating Theorems 3 and 5 in Sections 4 and 5, for dynamic and fixed effects panel data model, respectively, these asymptotic normality are also valid for the $LM_{bc}$ statistic. In this sense, the results from the paper can also be considered as new extension of the $LM_{bc}$ test, originally developed for homogeneous fixed effects panel data model in Baltagi et al. (2012), to various large panel models with coefficient heterogeneity.

3.2 The $RLM_{PE}$ test

In the high dimensional setting, for testing the identity hypothesis $H_{0}:\mathrm{corr}(\mathbf{v}_{t})=\mathbf{I}_{n}$ , where $\mathbf{v}_{t}=(v_{1t},\dots,v_{nt})^{\prime}$ , there are mainly two types of test statistics. The majority of existing tests are based on the squared Frobenius norm $\|\mathbf{R}-\mathbf{I}_{n}\|_{F}^{2}=tr(\mathbf{R}^{2})-n=\sum_{i\neq j}\rho_{ij}^{2}.$ However, this quadratic statistic lacks power if $\mathrm{corr}(\mathbf{v}_{t})$ is a sparse matrix, see Fan et al. (2015). Considering this, tests based on the maxima of absolute values, $\max_{i<j}|\rho_{ij}|$ , which share a asymptotic type I extreme value distribution, are generally powerful under sparse alternatives. This approach has however a main drawback that such test can suffer from size distortions, which is common for statistics of the maximum type, see Liu et al. (2008). Besides, this way is not as appropriate as the Frobenius norm (sum) type in some cases. For example, consider the alternative

[TABLE]

where $\mathbf{P}_{n}$ is a perturbation matrix with diagonal entries being zero and $s$ non-zero off diagonal entries, where $s\in\{1,\dots,n^{2}-n\}$ . Intuitively, $\mathbf{P}_{n}$ can be designed as a dense matrix but with weak coefficients such that $\max_{i<j}|\rho_{ij}|<z_{\alpha}$ for any $s$ . Consequently, the extreme value type tests will fail to detect such a matrix. In such instances, the sum type tests are more suitable in the light of the fact that the eigenvalues of $\mathbf{R}$ could vary from $H_{0}$ to $H_{1}$ , which results in larger quadratic statistic value by $tr(\mathbf{R}^{2})=\sum_{i=1}^{n}\lambda_{i}^{2}({\mathbf{R}})$ . In order to realize an interpolation of the two types of statistics above, namely the maximum type and the sum type, we propose a new test statistic based on $tr(\mathbf{R}^{4})=\sum_{i=1}^{n}\lambda_{i}^{4}({\mathbf{R}})$ . The reason is that large empirical correlations, $\rho_{ij}^{4}$ , would be more emphasized in $tr(\mathbf{R}^{4})$ than in $tr(\mathbf{R}^{2})$ . To see this, consider increasingly large powers of the sample correlations, $\rho_{ij}^{m}$ , where $m$ is a positive integer. Let $E=\text{argmax}_{(i,j):\;i<j}|\rho_{ij}|$ , then

[TABLE]

where $\mathrm{card}(E)$ denotes the cardinality of the set $E$ . Therefore, the new statistic with $m=4$ can mimic some properties of the maximum type, while remaining a sum type smoothing statistic. The resulting power is expected to be higher than $tr(\mathbf{R}^{2})$ when very few sample correlations are significantly non-zero under sparse alternatives, and higher than maximum type statistics when there are many but relatively small correlations.

3.2.1 Test based on $\sum_{i\neq j}\hat{\rho}_{ij}^{4}$

On the ground of analyses above, we propose a new test statistic based on the fourth power of $\hat{\rho}_{ij}$ in the following theorem.

Theorem 2.

Under Assumptions 1, 2, 3 $(ii)$ and 4,

[TABLE]

where $\mu_{PE}=n+\frac{6n^{2}}{T-1}+\frac{6n^{3}}{(T-1)^{2}}+\frac{n^{4}}{(T-1)^{2}}-6c_{T}(1+c_{T})^{2}-2c_{T}^{2}$ and $\sigma_{PE}^{2}=8c_{T}^{2}+96c_{T}^{3}(1+c_{T})^{2}+16c_{T}^{2}(3c_{T}^{2}+8c_{T}+3)^{2}.$

Remark 1.

We choose $m=4$ to generate the $RLM_{PE}$ test for technical simplicity. In fact, one can increase $m$ to any large even integer to obtain new tests that may have larger power in the sparse correlation setting. This strategy is feasible with the proof techniques provided in Appendix. Further, by (15), it is expected that tests based on $\sum_{i\neq j}|\hat{\rho}_{ij}|^{m}$ would share similar power with maximum type statistics, which has been suggested as a powerful tests in sparse data, for example, see Cai et al. (2014). It can provide a series of potential statistics that can well control the size and may be more powerful under sparse alternatives at the same time.

Remark 2.

For the initial $RLM$ test (also the $LM_{bc}$ test), the remarkable screening technique in Fan et al. (2015) can provide an improved test that has the same asymptotic size with non-inferior asymptotic power against a broader range of alternatives. Compared to this approach, our power enhanced test avoids constructing such a “power enhancement component” by increasing the power of the sample correlation to four. However, studying the power properties of our technique with different choices of $m$ is not the main focus in this paper and remains an open problem.

The proof of the Theorem 2 is similar to that of Theorem 1, which requires the two lemmas given in Appendix.

4 Dynamic panel data model

In this section, we show that the $RLM$ and $RLM_{PE}$ tests are asymptotically valid in a dynamic panel data model, which is specified as following:

[TABLE]

for $i=1,\dots,n;t=1,\dots,T$ , where $y_{i,t-1}$ is the lagged dependent variable. Let $\mathbf{z}_{it}=(y_{i,t-1},\mathbf{x}_{it}^{\prime})$ , $\boldsymbol{\phi}_{i}=(\boldsymbol{\beta}_{i},\alpha_{i})^{\prime}$ , then (16) can be rewritten as $y_{it}=\mathbf{z}_{it}^{\prime}\boldsymbol{\phi}_{i}+v_{it}$ . We show that the proposed $RLM$ and $RLM_{PE}$ tests still have standard normal limiting distribution under the null hypothesis in the dynamic panel data model. To establish the asymptotic normality, we need additional assumptions as following,

Assumption 5.

(i)

$\{y_{it}\}_{1\leq i\leq n,1\leq t\leq T}$ is a stationary and ergodic process. 2. (ii)

Let $\boldsymbol{y}_{i}=(y_{i,0},\dots,y_{i,T-1})$ , $\frac{1}{T}\boldsymbol{y}_{i}\boldsymbol{y}_{i}^{\prime}=O_{p}(1)$ holds uniformly in $i$ .

We establish the limiting distributions of the proposed tests in the following theorems

Theorem 3.

Under Assumptions 1, 2, 3 $(i)$ , 4 and 5,

[TABLE]

Theorem 4.

Under Assumptions 1, 2, 3 $(ii)$ , 4 and 5,

[TABLE]

Under Assumption 5, the proofs of Theorems 3 and 4 follow along the same lines as that of static panel data model. See the Appendix.

5 Fixed effects panel data model

In this section, we establish the asymptotic normality of the $RLM$ and $RLM_{PE}$ tests in a fixed effects panel data model. We find that as long as the coefficient estimator is $\sqrt{T}$ -consistent, the proposed tests still have standard normal limiting distribution under the null. To allow for weakly exogenous regressors, various consistent estimators have been proposed in the literature including Chudik and Pesaran (2015), Chudik et al. (2018) etc. However, these estimators require stronger assumptions than the static panel data model. For simplicity of illustration, we focus on residuals obtained by the within estimator. The strictly exogenous assumption is then necessary for the consistency of the within estimator. One can relax this assumption to the weakly exogenous one by applying a $\sqrt{T}$ -consistent estimator.

Consider a fixed effects panel data model:

[TABLE]

for $i=1,\dots,n;t=1,\dots,T$ , where $\mu_{i}$ denotes the time-invariant individual effect. The within estimator in (17) is specified by

[TABLE]

where $\tilde{\mathbf{x}}_{it}=\mathbf{x}_{it}-\frac{1}{T}\sum_{t=1}^{T}\mathbf{x}_{it}$ and $\tilde{y}_{it}=y_{it}-\frac{1}{T}\sum_{t=1}^{T}y_{it}$ .

Assumption 6.

The regressors, $\mathbf{x}_{it}$ , satisfy

(i)

(strictly exogenous) $E(v_{it}|\mathbf{x}_{iT},\cdots,\mathbf{x}_{i1})=0$ and $E(v_{jt}|\mathbf{x}_{iT},\cdots,\mathbf{x}_{i1})=0$ for all $i,j$ and $t$ . 2. (ii)

For the demeaned regressors $\tilde{\mathbf{x}}_{it}$ , $\frac{1}{T}\sum_{t=1}^{T}\tilde{\mathbf{x}}_{it}$ and $\frac{1}{T}\sum_{t=1}^{T}\tilde{\mathbf{x}}_{it}\tilde{\mathbf{x}}_{jt}^{\prime}$ are stochastic bounded for all $i,j$ . Besides, $\lim_{(n,T)\rightarrow\infty}\frac{1}{nT}\sum_{i=1}^{n}\sum_{t=1}^{T}\tilde{\mathbf{x}}_{it}\tilde{\mathbf{x}}_{it}^{\prime}$ exists and is nonsingular.

Under the Assumptions 1, 2, 3 and 6, $\hat{\boldsymbol{\beta}}$ is $\sqrt{nT}-$ consistent. We establish the validity of our proposed tests in the following theorems.

Theorem 5.

Under Assumptions 1, 2, 3 $(i)$ and 6,

[TABLE]

Theorem 6.

Under Assumptions 1, 2, 3 $(i)$ and 6,

[TABLE]

The proofs of Theorems 5 and 6 are given in the Appendix.

6 Monte Carlo simulations

In this section, we conduct Monte Carlo simulations to examine the empirical sizes and powers of our $RLM$ and $RLM_{PE}$ tests, which are defined by (1) and (2), respectively, and compare their performances to that of the $CD_{p}$ test and the $LM_{adj}$ test defined by (6) and (8), respectively. We consider four data generating processes (DGPs): heterogeneous panel data model with either strictly or weakly exogenous regressors, fixed effects panel data model and pure dynamic panel data model.

Before looking at the simulation results, we consider the estimated rejection frequencies within range from 3.6% to 6.5% to provide evidence consistent with the robustness of the tests, following the arguments in Halunga et al. (2017). Besides, we don’t include the $LM_{RMT}$ and the $LM_{bc}$ tests since they are almost identical to the $RLM$ test by (11) and (14).

6.1 Monte Carlo design

6.1.1 DGP1: Heterogeneous panel data model with strictly exogenous regressors

We first consider the DGP used in Pesaran et al. (2008), which is specified by

[TABLE]

where $\alpha_{i}\sim IIDN(1,1)$ , $\beta_{li}\sim IIDN(1,0.04)$ . The regressors are generated as

[TABLE]

with $x_{li,-51}=0$ where $u_{lit}\sim IIDN(0,\tau_{li}^{2}/(1-0.6^{2}))$ , $\tau_{li}^{2}\sim IID\chi^{2}(6)/6$ . The first 50 observations are discarded to lessen the effects of initial values. Now we generate the disturbances under the null $H_{0}$ as $v_{it}=\sigma_{i}\epsilon_{it}$ , where $\sigma_{i}\sim\chi^{2}(2)/2$ and $\{\epsilon_{it}\}$ are generated from three different distributions: (i) normal, $N(0,1)$ , (ii) chi-squared, $(\chi^{2}(5)-5)/\sqrt{10}$ and (iii) student-t, $t_{10}/\sqrt{10/8}$ . The normalizations in (ii) and (iii) are such that errors have mean one and variance one. To investigate the effects of the number of regressors, $k=2,4$ are considered.

To examine the powers of the proposed tests, the disturbances are generated by a factor model as following:

[TABLE]

where $f_{t}(t=1,\dots,T)$ are the factors with $f_{t}\sim IIDN(0,1)$ and $\lambda_{i}(i=1,\dots,n)$ are the loadings. We consider the following three cases of loading construction:

(1)

Dense case. $\lambda_{i}\sim IIDU(-b,b)$ , for $i=1,\dots,n$ , where $b=\sqrt{3h/n}$ and $h=3.$ 2. (2)

Sparse case. $\lambda_{i}\sim IIDU(0.5,1.5)$ , for $i=1,\dots,[n^{0.3}]$ , and $\lambda_{i}=0$ , for $i=[n^{0.3}]+1,\dots,n$ , where $[n^{0.3}]$ is the integer part of $n^{0.3}$ . 3. (3)

Less-sparse case. $\lambda_{i}\sim IIDU(0.5,1.5)$ , for $i=1,\dots,[n^{0.5}]$ , and $\lambda_{i}=0$ , for $i=[n^{0.5}]+1,\dots,n$ .

In the dense case, $h$ measures the degree of cross-sectional dependence. The sparse case and the less-sparse case follow the design used in Bailey et al. (2016) to model the weak and strong cross-sectional dependence, respectively. The Monte Carlo experiments are conducted for $T=50,100,200$ , and three different choices of ratio $n/T=0.5,1,2$ basing on 2000 replications. To obtain the empirical size, the proposed $RLM$ test, $RLM_{PE}$ test and $LM_{adj}$ test are implemented at the one-sided 5% nominal significance level, while $CD_{P}$ test is conducted at the two-sided 5% nominal significance level.

6.1.2 DGP2: Heterogeneous panel data model with weakly exogenous regressors

To investigate the performances of the $RLM$ and $RLM_{PE}$ tests in panel data models with weakly exogenous regressors, we consider the following DGP:

[TABLE]

where $\alpha_{i}\sim IIDN(1,1)$ , $\beta_{li}\sim IIDN(1,0.04)$ , and

[TABLE]

with $y_{i,-51}=x_{1,i,-51}=0$ where $u_{1it},u_{2it}\sim IIDN(0,\tau_{li}^{2}/(1-0.6^{2}))$ , $\tau_{li}^{2}\sim IID\chi^{2}(6)/6$ . This set up allows for feedback from $y_{it-1}$ to the regressors, thus rendering weakly exogenous. The errors, $\{v_{it}\}$ , are generated in the same way as DGP1.

6.1.3 DGP3: Fixed effects panel model

The third DGP considered is a fixed effects panel data model with homogeneous coefficients, which is specified as

[TABLE]

where $\alpha$ and $\beta_{l}$ are set arbitrarily to 1 and $l$ , respectively, $\mu_{i}\sim IIDN(1,1)$ . The regressors and errors are generated in the same way as DGP1.

6.1.4 DGP4: Dynamic panel data model

To examine the properties of the $RLM$ and $RLM_{PE}$ tests in a dynamic panel data model, we follow the design of Pesaran et al. (2008):

[TABLE]

with $y_{i,-51}=0$ , where $\beta_{i}\sim IIDN(1,0.04)$ , and the fixed effects, $\xi_{i}$ , are drawn as $v_{i0}+\eta_{i}$ , with $\eta_{i}\sim IIDN(1,2)$ . The errors, $\{v_{it}\}$ , are generated in the same way as DGP1.

6.2 Simulation results

Table 2 reports the empirical size of these tests for the DGP1. The proposed $RLM$ and $RLM_{PE}$ tests successfully control the size under almost all settings, irrespective of number of regressors included in the panel data model444However, for small sample size with more regressors, i.e. $T=50$ and $k=4$ , the $RLM$ and $RLM_{PE}$ tests would be slightly oversized. For example, the empirical sizes of $RLM$ and $RLM_{PE}$ are 7.65 and 7.4 under normal errors, respectively. . For a fixed ratio $n/T$ , the empirical size of the $RLM$ and $RLM_{PE}$ tests converge to the nominal size of $0.05$ as $T\rightarrow\infty$ , that authenticates the asymptotic normality of the tests under the SIM-L scheme. Besides, the performance of $RLM$ are almost identical to $LM_{adj}$ . The $CD_{P}$ test has correct size in all cases.

Table 3 demonstrates the empirical power of these tests under the alternative with dense factors. The $RLM$ test has comparable power to $LM_{adj}$ regardless of $(n,T)$ combinations and error distributions. In contrast, the $CD_{P}$ test suffers from little power by construction, where mean of factor loading is close to zero as mentioned by Pesaran et al. (2008). The power enhancement version of $RLM$ , the $RLM_{PE}$ test, outperforms others across the board, especially when $n/T=2$ . For example, the power of the $RLM_{PE}$ test is 78.25% for $T=100,n=200,k=4,h=3$ and student-t errors, whereas the power of the $RLM$ and $LM_{adj}$ tests are 60.1% and 56.75%, respectively. It improves the power by up to around 30%. Besides, the power of the $RLM_{PE}$ test is 69.2% for $T=100,n=200,k=2,h=3$ and chi-square errors, and the power of the $RLM$ and $LM_{adj}$ tests are 51.2% and 51.1%, respectively. These results indicate that the $RLM_{PE}$ test successfully boost the power.

The empirical power of these tests under the alternative with sparse and less sparse factors are summarized in Tables 4 and 5, respectively. The $RLM$ test again has similar performance to $LM_{adj}$ . The empirical power of $RLM_{PE}$ show that it performs the best among those tests. The power of the $CD_{P}$ test floats around 5% throughout as in Pesaran (2015).

For the heterogeneous panel data model with weakly exogenous regressors, Table 6 shows that $LM_{adj}$ becomes considerably oversized, especially for the case $n/T=2$ , where it has size around 28%. This shows that the $LM_{adj}$ test is not robust to weakly exogenous regressors, which is also observed in Bailey et al. (2021). However, our proposed $RLM$ test and $RLM_{PE}$ tests control the size well, which are not sensitive to the strictly exogenous assumptions on regressors. Therefore, though it is widely reported that the $LM_{adj}$ test has generally satisfying empirical performances regardless of restrictive strictly exogenous regressors and normal errors assumptions, the present DGP2 is indeed a rare situation where the $LM_{adj}$ test is outperformed by others.

Table 7 reports the size of the tests for the fixed effects panel data model. It shows that the proposed $RLM$ and $RLM_{PE}$ tests have the correct size, close to the 5% nominal significance level, for example, $RLM$ has 5.1% and 5% size results, respectively for $T=50,n=25$ under normal errors and for $T=100,n=200$ under chi-squared errors. Similar results for $RLM_{PE}$ can be also observed in this table. Pesaran’s $LM_{adj}$ and $CD_{P}$ tests have correct size in this setting as in Pesaran (2004) and Pesaran et al. (2008).

Finally, Table 8 gives the empirical size of these tests for dynamic panel data model. It shows that the proposed $RLM$ and $RLM_{PE}$ tests have the correct size, e.g. 5.15% for $n=100,T=100$ with chi-squared error, which is comparable to the $LM_{adj}$ test. The $CD_{P}$ always has correct size as in Pesaran (2004). The results of empirical power for DGP2, DGP3 and DGP4 are similar to those of DGP1, so we omit it here.

Based on these findings, the $RLM_{PE}$ test is strongly recommended for practitioners if it is not clear whether weakly exogenous regressors are present or not, given its universally correct size and better power performances. Instead, when regressors are believed to be strictly exogenous, then the $RLM_{PE}$ test, a easily implemented and computationally cheap procedure, is preferred for large panels ( $T\geq 50$ ). For $T\leq 50$ , $RLM_{PE}$ is still applicable though it might be slightly oversized, or the $LM_{adj}$ test is a suggested method at the risk of intensive computation.

7 Conclusion

This paper has developed a Lagrange multiplier type test for the null hypothesis of no cross-sectional dependence in large panel models. The procedure can be applied to a wide class of linear panel data models and shows robustness to quite general forms of non-normality in the disturbance distribution. We further proposed a power enhancement version of the $LM$ type test based on the fourth moment of the sample correlations obtained from residuals to boost power under sparse alternatives, which only requires existence of higher moment but still shares such robustness. The simulations illustrate that this test has satisfactory power under the sparse alternatives of weak cross-sectional dependence, and both of the tests successfully control the size in different data generating processes.

For future work, it is interesting to explore theoretically the power properties of $RLM_{PE}$ and the optimal $m$ that would maximise power. Also, it would be of interest to investigate the performance of the $RLM$ and $RLM_{PE}$ tests in the weakly cross-sectional dependence framework. In addition, testing the null hypothesis with no cross-sectional dependence when errors are serial dependent will also be studied.

Appendix

This appendix includes the proofs of the following lemmas:

Lemma 1.

Under Assumptions 1, 2 and 3 $(i)$ ,

[TABLE]

Lemma 2.

Under Assumptions 1, 2, 3 $(i)$ and 4,

[TABLE]

Lemma 3.

Under Assumptions 1, 2 and 3 $(ii)$ ,

[TABLE]

Lemma 4.

Under Assumptions 1, 2, 3 $(ii)$ and 4,

[TABLE]

In the static heterogeneous panel data model, $\hat{\boldsymbol{\beta}}_{i}=\left(\boldsymbol{X}_{i}\boldsymbol{X}_{i}^{\prime}\right)^{-1}\boldsymbol{X}_{i}\boldsymbol{y}_{i}$ is the OLS estimator and the residuals are given by $\hat{v}_{it}=v_{it}-\mathbf{x}_{it}^{\prime}\left(\hat{\boldsymbol{\beta}}_{i}-\boldsymbol{\beta}_{i}\right)$ . Let $\mathbf{v}_{i}=(v_{i1},\dots,v_{iT})^{\prime}$ , $\hat{\mathbf{v}}_{i}=(\hat{v}_{i1},\dots,\hat{v}_{iT})^{\prime}$ for $i=1,\dots,n$ , consequently, $\hat{\mathbf{v}}_{i}=\mathbf{v}_{i}-\boldsymbol{X}_{i}^{\prime}\left(\hat{\boldsymbol{\beta}}_{i}-\boldsymbol{\beta}_{i}\right)$ . Define $\mathbf{\hat{V}}=\begin{pmatrix}\hat{\mathbf{v}}_{1},\cdots,\hat{\mathbf{v}}_{n}\end{pmatrix}$ , $\mathbf{w}_{i}=\boldsymbol{X}_{i}^{\prime}\left(\hat{\boldsymbol{\beta}}_{i}-\boldsymbol{\beta}_{i}\right)$ and $\mathbf{W}=\begin{pmatrix}\mathbf{w}_{1},\cdots,\mathbf{w}_{n}\end{pmatrix}$ . Using this notation, $\hat{\mathbf{v}}_{i}=\mathbf{v}_{i}-\mathbf{w}_{i}$ , $\mathbf{\hat{V}}=\mathbf{V}-\mathbf{W}$ and the sample covariance matrices can be written as $\mathbf{S}_{T}=\frac{1}{T}\mathbf{\hat{V}}^{\prime}\mathbf{\hat{V}}$ , $\hat{\mathbf{S}}_{T}=\frac{1}{T}\mathbf{V}^{\prime}\mathbf{V}$ with elements $S_{T,i,j}=\frac{1}{T}\sum\limits_{t=1}^{T}v_{it}v_{jt}$ , $\hat{S}_{T,i,j}=\frac{1}{T}\sum\limits_{t=1}^{T}\hat{v}_{it}\hat{v}_{jt},$ respectively.

To accomplish the proof of results above, several lemmas are introduced as following.

Lemma 5.

(Theorem 13 of Chapter 13, Petrov (1975)) Let $Y_{1},\cdots,Y_{n}$ be independent and identically distributed random variables, such that $E(Y_{1})=0$ , $E(Y_{1})^{2}=1$ and $E|Y_{1}|^{r}<\infty$ for some $r\geq 3$ . Then

[TABLE]

for all $y$ , where $\Phi(\cdot)$ is the cumulative distribution function of standard normal random variable and $C(r)$ is a positive constant depending only on $r$ .

Lemma 6.

Let $\{Y_{ij}\}_{i\geq 1,j\geq 1}$ be an array of independent and identically distributed random variables such that $E(Y_{11})=0$ , $E(Y_{11})^{2}=1$ and $E|Y_{11}|^{r}<\infty$ for some $r\geq 3$ . Let $X_{in}=\frac{1}{\sqrt{n}}\sum_{j=1}^{n}Y_{ij}$ , then for any $\epsilon>0$ , we have

[TABLE]

Proof.

For some $\alpha>0,$ let $c=\frac{1}{2r}+\epsilon$ , we have

[TABLE]

where the first inequality follows by Lemma 5, and the first approximation follows by the fact that $\Phi(x)\sim 1-\frac{1}{\sqrt{2\pi}x}e^{-\frac{x^{2}}{2}}$ for large $x$ . Therefore, $\max_{1\leq i\leq n}|X_{in}|=O_{p}(n^{c})$ holds. ∎

Remark 3.

For the panel data model, if the errors $\{v_{it}\}_{1\leq i\leq n,1\leq t\leq T}$ satisfy the conditions in lemma 5 and $X_{iT}=\frac{1}{\sqrt{T}}\sum_{t=1}^{T}v_{it}$ , then for any any $\epsilon>0$ , the estimate $\max_{1\leq i\leq n}|X_{iT}|=O_{p}(n^{\frac{1}{2r}+\epsilon})$ still holds once we further assume that $K_{1}\leq\frac{n}{T}\leq K_{2}$ for some positive constants $K_{1}$ and $K_{2}$ . It holds naturally since $\frac{n}{T}\rightarrow c>0$ in the SIM-L scheme.

Lemma 7.

(Li et al. (2012)) Suppose $E|v_{it}|^{6}<\infty$ , then

[TABLE]

when $\frac{n}{T}\rightarrow c>0$ .

Lemma 8.

Under Assumptions 1, 2, 3 and 4, for any $\epsilon>0$ and some integer $r_{1}\geq 3$ , i.e. $E|v_{it}|^{r_{1}}<\infty$

(a)

$\displaystyle\max_{1\leq i,j\leq n}|\langle\mathbf{v}_{i},\mathbf{w}_{j}\rangle|=O_{p}(n^{\frac{1}{r_{1}}+2\epsilon_{1}})$ . 2. (b)

$\displaystyle\max_{1\leq i,j\leq n}|\langle\mathbf{w}_{i},\mathbf{w}_{j}\rangle|=O_{p}(n^{\frac{1}{r_{1}}+2\epsilon_{1}})$ . 3. (c)

$\displaystyle\max_{1\leq i\leq n}|S_{T,i,i}-\sigma^{2}|=O_{p}(n^{\frac{1}{r_{1}}+\epsilon_{2}-\frac{1}{2}})$ . 4. (d)

$\displaystyle\max_{1\leq i\neq j\leq n}|S_{T,i,j}|=O_{p}(n^{\frac{1}{r_{1}}+\epsilon_{2}-\frac{1}{2}})$ . 5. (e)

$\displaystyle\max_{1\leq i\leq n}|\hat{S}_{T,i,i}-S_{T,i,i}|=O_{p}(n^{\frac{1}{r_{1}}+2\epsilon_{1}-1}).$ ** 6. (f)

$\displaystyle\max_{1\leq i\leq n}|\hat{S}_{T,i,i}-\sigma^{2}|=O_{p}(n^{\frac{1}{r_{1}}+\epsilon_{2}-\frac{1}{2}})$ .

Proof.

(a). Firstly, we consider the case $i=j$ . By Assumption 2, we obtain $\boldsymbol{\xi}_{i}=\left(\boldsymbol{X}_{i}\boldsymbol{X}_{i}^{\prime}\right)^{-\frac{1}{2}}\sum\limits_{t}\mathbf{x}_{it}v_{it}\stackrel{{\scriptstyle d}}{{\longrightarrow}}N_{k}\left(0,\sigma_{i}^{2}\mathbf{I}_{k}\right).$ Therefore, by Lemma 6 for some $r_{1}\geq 3$ and for any $\epsilon_{1}>0$ ,

[TABLE]

Consequently,

[TABLE]

The calculations for $i\neq j$ case is similar so we omit it here.

(b). We have

[TABLE]

(c). By CLT, $Z_{i}\stackrel{{\scriptstyle\Delta}}{{=}}\sqrt{T}(S_{T,i,i}-\sigma_{i}^{2})\stackrel{{\scriptstyle d}}{{\longrightarrow}}N(0,\tau_{i}^{2})$ where $\sigma^{2}=E(v_{it}^{2})$ and $\tau_{i}^{2}=\text{var}(v_{it}^{2})$ , then $\max\limits_{1\leq i\leq n}|Z_{i}|=O_{p}(n^{\frac{1}{2r_{2}}+\epsilon_{2}})$ for some $r_{2}\geq 3$ and for any $\epsilon_{2}>0$ by Lemma 6. It can be easily found that $r_{1}=2r_{2}$ , therefore,

[TABLE]

(d). Note that $E(v_{it}v_{jt})=0$ for $i\neq j$ , it follows along the same lines as that of (c).

(e). By (a) and (b), we have

[TABLE]

(f). The conclusion holds from (c) and (e).

∎

Proof of Lemma 1

Proof.

By the Theorem 3.1 of Yin et al. (2021), there exist constants $\mu_{center}$ , $\mu_{limit}$ and $\sigma_{0}>0$ such that:

[TABLE]

Applying the results in Example 3.2 of Yin et al. (2021) with $g_{l}=x^{2}$ , we obtain $\mu_{center}=n(1+\frac{n}{T-1})$ . For the case $g_{l}=x^{2}$ and $\mathbf{R}=\mathbf{I}_{n}$ , the results in Example 3.3 of Yin et al. (2021) shows that $\mu_{limit}=-c$ and $\sigma_{0}^{\prime}=2c$ . Finally, substituting $c$ with $c_{T}$ by Slutsky’s theorem completes the proof. ∎

Proof of Lemma 2

Proof.

By direct calculation we have

[TABLE]

where constant $0<\alpha_{1}<1$ . Therefore, we aim to show that:

(i)

$A_{1}\stackrel{{\scriptstyle\Delta}}{{=}}\left|\frac{1}{T^{\alpha_{1}}}\sum\limits_{1\leq i\neq j\leq n}\rho_{ij}^{4}\right|=o_{p}(1)$ , 2. (ii)

$A_{2}\stackrel{{\scriptstyle\Delta}}{{=}}\left|T^{\alpha_{1}}\sum\limits_{1\leq i\neq j\leq n}\left(\frac{S_{T,i,j}^{2}\hat{S}_{T,i,i}\hat{S}_{T,j,j}-\hat{S}_{T,i,j}^{2}S_{T,i,i}S_{T,j,j}}{S_{T,i,j}^{2}\hat{S}_{T,i,i}\hat{S}_{T,j,j}}\right)^{2}\right|=o_{p}(1)$ .

(i) By lemma 7, we have

[TABLE]

Therefore, $A_{1}=o_{p}(1)$ holds.

(ii) By direct calculation we have

[TABLE]

Define $RHS\stackrel{{\scriptstyle\Delta}}{{=}}T^{{\alpha_{1}-8}}\sum\limits_{1\leq i\neq j\leq n}\frac{1}{S_{T,i,j}^{4}\hat{S}_{T,i,i}^{2}\hat{S}_{T,j,j}^{2}}|\sum\limits_{m=1}^{17}A_{2,m}|^{2},$ then

[TABLE]

where $\tau_{m_{1},m_{2}}$ is a constant only depending on $m_{1}$ and $m_{2}$ .

Therefore, if $T^{\alpha_{1}-8}\sum\limits_{1\leq i\neq j\leq n}\frac{|A_{2,m_{1}}A_{2,m_{2}}|}{S_{T,i,j}^{4}\hat{S}_{T,i,i}^{2}\hat{S}_{T,j,j}^{2}}=o_{p}(1)$ for any $1\leq m_{1},m_{2}\leq 17$ holds, then we can conclude that $A_{2}=o_{p}(1)$ . Further, we only need to consider the case when $m_{1}=m_{2}$ by equality $2\cdot|A_{2,m_{1}}A_{2,m_{2}}|\leq|A_{2,m_{1}}|^{2}+|A_{2,m_{2}}|^{2}$ , i.e. if we can show $T^{\alpha_{1}-8}\sum\limits_{1\leq i\neq j\leq n}\frac{|A_{2,m}|^{2}}{S_{T,i,j}^{4}\hat{S}_{T,i,i}^{2}\hat{S}_{T,j,j}^{2}}=o_{p}(1)$ for any $1\leq m\leq 17$ , then $A_{2}=o_{p}(1)$ immediately holds. By Lemma 8, we show that

[TABLE]

By similar calculations, we conclude that

[TABLE]

Therefore, we can conclude that $A_{2}=o_{p}(1)$ . ∎

Proof of Lemma 3

Proof.

By the Theorem 3.2 of Yin et al. (2021), there exist constants $\mu_{center,4}$ , $\mu_{limit,4}$ and $\sigma_{PE}>0$ such that:

[TABLE]

Applying the results in Example 3.2 of Yin et al. (2021) with $g(x)=x^{4}$ , we obtain $\mu_{center,4}=n+\frac{6n^{2}}{T-1}+\frac{6n^{3}}{(T-1)^{2}}+\frac{n^{4}}{(T-1)^{2}}$ . For the case $g(x)=x^{4}$ and $\mathbf{R}=\mathbf{I}_{n}$ , the results in Example 3.3 of Yin et al. (2021) shows that $\mu_{limit}=-6c(1+c)^{2}-2c^{2}$ and $\sigma_{PE,0}^{2}=8c^{2}+96c^{3}(1+c)^{2}+16c^{2}(3c^{2}+8c+3)^{2}$ . Finally, substituting $c$ with $c_{T}$ by Slutsky’s theorem completes the proof. ∎

Proof of Lemma 4

Proof.

It is easy to verify that

[TABLE]

where $1\leq i,j,l,s\leq n$ . We only aim to show that

(i)

$B\stackrel{{\scriptstyle\Delta}}{{=}}\left|\sum\limits_{i\neq j\neq l}\left(\hat{\rho}_{ij}^{2}\hat{\rho}_{jl}^{2}-\rho_{ij}^{2}\rho_{jl}^{2}\right)\right|=o_{p}(1)$ , 2. (ii)

$C\stackrel{{\scriptstyle\Delta}}{{=}}\left|\sum\limits_{i\neq j}\left(\hat{\rho}_{ij}^{4}-\rho_{ij}^{4}\right)\right|=o_{p}(1)$ , 3. (iii)

$D\stackrel{{\scriptstyle\Delta}}{{=}}\left|\sum\limits_{i\neq j\neq l}\left(\hat{\rho}_{ij}\hat{\rho}_{jl}\hat{\rho}_{il}-\rho_{ij}\rho_{jl}\rho_{il}\right)\right|=o_{p}(1)$ , 4. (iv)

$E\stackrel{{\scriptstyle\Delta}}{{=}}\left|\sum\limits_{i\neq j\neq l\neq s}\left(\hat{\rho}_{ij}\hat{\rho}_{jl}\hat{\rho}_{ls}\hat{\rho}_{si}-\rho_{ij}\rho_{jl}\rho_{ls}\rho_{si}\right)\right|=o_{p}(1)$

since we have $\left|\sum\limits_{i\neq j}\left(\hat{\rho}_{ij}^{2}-\rho_{ij}^{2}\right)\right|=o_{p}(1)$ by Lemma 2.

(i) By direct calculation we have

[TABLE]

where constant $0<\alpha_{2}<1$ . We show that:

(i.1)

[TABLE] 2. (i.2)

[TABLE]

(i.1) By lemma 7, we have

[TABLE]

Therefore, $B_{1}=o_{p}(1)$ holds.

(i.2) By direct calculation, we have

[TABLE]

where

[TABLE]

Consequently,

[TABLE]

where $\eta_{m_{1},m_{2}}$ is a constant only depending on $m_{1}$ and $m_{2}$ . By the same arguments in the proof of Lemma 2, we only need to show that

[TABLE]

for any $1\leq m\leq 153$ . By Lemma 8, for $1\leq m\leq 53$ , one can easily show that the stochastic order dominating terms are

[TABLE]

which have the same order $O_{p}\Big{(}n^{\frac{10}{r_{1}}+4\epsilon_{1}+8\epsilon_{2}+\alpha_{2}-3}\Big{)}=o_{p}(1)$ . For $54\leq m\leq 153$ , stochastic order dominating terms have the same order of

[TABLE]

whose order are $O_{p}(n^{\frac{8}{r_{1}}+8\epsilon_{1}+6\epsilon_{2}+\alpha_{2}-1})=o_{p}(1)$ . Therefore, we can conclude that $B_{2}=o_{p}(1)$ .

(ii) By direct calculation we have

[TABLE]

where constant $0<\alpha_{3}<1$ . We show that:

(ii.1)

[TABLE] 2. (ii.2)

[TABLE]

(ii.1) By lemma 7, we have

[TABLE]

Therefore, $C_{1}=o_{p}(1)$ holds.

(ii.2) By direct calculation, we have

[TABLE]

where

[TABLE]

Consequently,

[TABLE]

where $\delta_{m_{1},m_{2}}$ is a constant only depending on $m_{1}$ and $m_{2}$ . By the same arguments in the proof of Lemma 2, we only need to show that

[TABLE]

for any $1\leq m\leq 68$ . By Lemma 8, for $1\leq m\leq 32$ , one can show that the stochastic order dominating terms are

[TABLE]

and

[TABLE]

which have the same order $O_{p}(n^{\frac{10}{r_{1}}+4\epsilon_{1}+8\epsilon_{2}+\alpha_{3}-4})=o_{p}(1)$ . For $33\leq m\leq 68$ , stochastic order dominating terms have the same order of

[TABLE]

whose order is $O_{p}(n^{\frac{8}{r_{1}}+8\epsilon_{1}+6\epsilon_{2}+\alpha_{2}-1})=o_{p}(1)$ . Therefore, we can conclude that $C_{2}=o_{p}(1)$ .

**(iii)**By direct calculation, we have

[TABLE]

For constant $0<\alpha_{4}<1$ , we show that:

(iii.1)

[TABLE] 2. (iii.2)

[TABLE]

(iii.1) By lemma 7, we have

[TABLE]

Therefore, $F_{1}=o_{p}(1)$ holds.

(iii.2) By direct calculation, we have

[TABLE]

where

[TABLE]

Thus

[TABLE]

where $\xi_{m_{1},m_{2}}$ is a constant only depending on $m_{1}$ and $m_{2}$ . By the same arguments in the proof of Lemma 2, we only need to show that

[TABLE]

for any $1\leq m\leq 80$ . By Lemma 8, for $1\leq m\leq 63$ , one can show that the stochastic order dominating terms have the same order of

[TABLE]

whose orders are $O_{p}(n^{\frac{6}{r_{1}}+4\epsilon_{1}+4\epsilon_{2}+\alpha_{4}-1})=o_{p}(1)$ . For $64\leq m\leq 80$ , the stochastic order dominating terms have the same order of

[TABLE]

whose orders are $O_{p}(n^{\frac{8}{r_{1}}+4\epsilon_{1}+6\epsilon_{2}+\alpha_{4}-2})=o_{p}(1)$ . Therefore, we can conclude that $D_{2}=o_{p}(1).$

(iv) By direct calculation, we have

[TABLE]

For constant $0<\alpha_{5}<1$ , we show that:

(iv.1)

[TABLE] 2. (iv.2)

[TABLE]

(iv.1) By lemma 7, we have

[TABLE]

Therefore, $E_{1}=o_{p}(1)$ holds.

(ii.2) By direct calculation, we have

[TABLE]

where

[TABLE]

Thus

[TABLE]

where $\lambda_{m_{1},m_{2}}$ is a constant only depending on $m_{1}$ and $m_{2}$ . By the same arguments in the proof of Lemma 2, we only need to show that

[TABLE]

for any $1\leq m\leq 336$ . By Lemma 8, for $1\leq m\leq 255$ , stochastic order dominating terms have the same order of

[TABLE]

whose orders are $O_{p}(n^{\frac{8}{r_{1}}+4\epsilon_{1}+6\epsilon_{2}+\alpha_{5}-1})=o_{p}(1)$ . For $256\leq m\leq 336$ , one can show that the stochastic order dominating terms have the same order of

[TABLE]

whose orders are $O_{p}(n^{\frac{10}{r_{1}}+4\epsilon_{1}+8\epsilon_{2}+\alpha_{5}-2})=o_{p}(1)$ . Therefore, we can conclude that $E_{2}=o_{p}(1)$ . Finally, proof of proposition 2 is completed. ∎

Proof of Theorem 3 and 4

For the dynamic panel data model, let $\boldsymbol{Z}_{i}=(\mathbf{z}_{i1},\dots,\mathbf{z}_{iT})$ , then $\hat{\boldsymbol{\phi}}_{i}=\left(\boldsymbol{Z}_{i}\boldsymbol{Z}_{i}^{\prime}\right)^{-1}\boldsymbol{Z}_{i}\boldsymbol{y}_{i}$ is the OLS estimator and the residuals are given by $\hat{\hat{v}}_{it}=v_{it}-\mathbf{z}_{it}^{\prime}\left(\hat{\boldsymbol{\phi}}_{i}-\boldsymbol{\phi}_{i}\right)$ . In vector form, $\hat{\mathbf{v}}_{i}=\mathbf{v}_{i}-\boldsymbol{Z}_{i}^{\prime}\left(\hat{\boldsymbol{\phi}}_{i}-\boldsymbol{\phi}_{i}\right)$ . Define $\hat{\hat{\mathbf{V}}}=\begin{pmatrix}\hat{\mathbf{v}}_{1},\cdots,\hat{\mathbf{v}}_{n}\end{pmatrix}$ , $\hat{\hat{\mathbf{w}}}_{i}=\boldsymbol{Z}_{i}^{\prime}\left(\hat{\boldsymbol{\phi}}_{i}-\boldsymbol{\phi}_{i}\right)$ and $\hat{\hat{\mathbf{W}}}=\begin{pmatrix}\hat{\hat{\mathbf{w}}}_{1},\cdots,\hat{\hat{\mathbf{w}}}_{n}\end{pmatrix}$ . Using this notation, $\hat{\mathbf{v}}_{i}=\mathbf{v}_{i}-\hat{\hat{\mathbf{w}}}_{i}$ . Replacing $\hat{\boldsymbol{\beta}}_{i}$ and $\boldsymbol{X}_{i}$ with $\hat{\boldsymbol{\phi}}_{i}$ and $\boldsymbol{Z}_{i}$ , respectively, the proofs of Theorem 3 and 4 follow along the same arguments above, that is, we only need to verify that (a) and (b) in Lemma 8 still hold for the dynamic panel data model.

Lemma 9.

Under Assumptions 1, 2, 3, 4 and 5, for any $\epsilon>0$ and some integer $r_{1}\geq 3$ , i.e. $E|v_{it}|^{r_{1}}<\infty$

(a)

$\displaystyle\max_{1\leq i,j\leq n}|\langle\mathbf{v}_{i},\hat{\hat{\mathbf{w}}}_{j}\rangle|=O_{p}(n^{\frac{1}{r_{1}}+2\epsilon_{1}})$ . 2. (b)

$\displaystyle\max_{1\leq i,j\leq n}|\langle\hat{\hat{\mathbf{w}}}_{i},\hat{\hat{\mathbf{w}}}_{j}\rangle|=O_{p}(n^{\frac{1}{r_{1}}+2\epsilon_{1}})$ .

Proof.

(a). Firstly, for the case $i=j$

[TABLE]

We have $\max_{1\leq i\leq n}\|\hat{\boldsymbol{\phi}}_{i}-\boldsymbol{\phi}_{i}\|=O_{p}(n^{\frac{1}{2r_{1}}+\epsilon_{1}-\frac{1}{2}})$ for some integer $r_{3}>3$ and $\epsilon_{3}>0$ by Lemma 8, then

[TABLE]

For $\max_{1\leq i\leq n}\|\mathbf{v}_{i}^{\prime}\boldsymbol{X}_{i}^{\prime}\|$ :

[TABLE]

For $\max_{1\leq i\leq n}\|\mathbf{v}_{i}^{\prime}\boldsymbol{y}_{i}^{\prime}\|$ : Applying martingale theory, we show that $\mathbf{v}_{i}^{\prime}\boldsymbol{y}_{i}^{\prime}=\sum_{t=1}^{T}y_{it-1}v_{it}$ converged to a centered normal distribution. Let $\mathcal{S}_{T}=\sum_{t=1}^{T}y_{it-1}v_{it}$ , we aim to verify conditions A1 and A2 imposed in Corollary 2.1.10 of Duflo (2013). Firstly, $\langle M\rangle_{T}\stackrel{{\scriptstyle\Delta}}{{=}}\sum_{t=1}^{T}E\left((\mathcal{S}_{t}-\mathcal{S}_{t-1})^{2}|\mathcal{F}_{t-1}\right)=\sum_{t=1}^{T}E\left(y_{it-1}^{2}v_{it}^{2}|\mathcal{F}_{t-1}\right)=\sigma^{2}\sum_{t=1}^{T}y_{it-1}^{2},$ where $\mathcal{F}_{t-1}$ is the corresponding filtration. Therefore, $\langle M\rangle_{T}/T\stackrel{{\scriptstyle p}}{{\longrightarrow}}\sigma^{2}E(y_{i,0}^{2})$ under Assumption 5(i), so that A1 holds. Note that the Lyapunov condition

[TABLE]

holds under Assumption 5(i), which indicates that A2 holds as well. The assertion follows from Corollary 2.1.10 of Duflo (2013), so that $\max_{1\leq i\leq n}\|\mathbf{v}_{i}^{\prime}\boldsymbol{y}_{i}^{\prime}\|=O_{p}(n^{\frac{1}{2r_{1}}+\epsilon_{1}})$ by Lemma 8. Consequently,

[TABLE]

The case $i\neq j$ is similar.

(b)

[TABLE]

Under Assumption 4(ii) and 5(ii), $\max_{i}\|\boldsymbol{X}_{i}\|\leq\max_{i}\|\boldsymbol{X}_{i}\boldsymbol{X}_{i}^{\prime}\|^{\frac{1}{2}}=O_{p}(\sqrt{T})$ and $\max_{i}\|\boldsymbol{y}_{i}\|\leq\max_{i}\|\boldsymbol{y}_{i}\boldsymbol{y}_{i}^{\prime}\|^{\frac{1}{2}}=O_{p}(\sqrt{T})$ , so that $\max_{1\leq i,j\leq n}|\langle\hat{\hat{\mathbf{w}}}_{i},\hat{\hat{\mathbf{w}}}_{j}\rangle|\leq\left(O_{p}(n^{\frac{1}{2r_{1}}+\epsilon_{1}-\frac{1}{2}})\cdot O_{p}(\sqrt{T})\right)^{2}=O_{p}(n^{\frac{1}{r_{1}}+2\epsilon_{1}}).$ ∎

Proofs of Theorem 5 and 6

For the fixed effects panel data model, $\hat{\boldsymbol{\beta}}$ is the within estimator and the within residuals are given by $\hat{v}_{it}^{fixed}=\tilde{y}_{it}-\tilde{\mathbf{x}}_{it}^{\prime}(\hat{\boldsymbol{\beta}}-\boldsymbol{\beta})$ . Let $\bar{\mathbf{x}}_{i\cdot}=\frac{1}{T}\sum_{t=1}^{T}\mathbf{x}_{it}$ , $\bar{y}_{i\cdot}=\frac{1}{T}\sum_{t=1}^{T}y_{it}$ , $\bar{v}_{i\cdot}=\frac{1}{T}\sum_{t=1}^{T}v_{it}$ and $\tilde{v}_{it}=v_{it}-\bar{v}_{i\cdot}$ . Define $\tilde{\mathbf{v}}_{i}=(\tilde{v}_{i1},\dots,\tilde{v}_{iT})^{\prime}$ , $\bar{\mathbf{v}}_{i}=(\bar{v}_{i\cdot},\dots,\bar{v}_{i\cdot})^{\prime}$ , $\tilde{\boldsymbol{X}}_{i}=(\tilde{\mathbf{x}}_{i1},\dots,\tilde{\mathbf{x}}_{iT})^{\prime}$ and $\bar{\boldsymbol{X}}_{i}=(\bar{\mathbf{x}}_{i\cdot},\dots,\bar{\mathbf{x}}_{i\cdot})^{\prime}$ . Let $\tilde{\mathbf{w}}_{i}=\tilde{\boldsymbol{X}}_{i}^{\prime}\left(\hat{\boldsymbol{\beta}}-\boldsymbol{\beta}\right)$ and $\tilde{\mathbf{W}}=\begin{pmatrix}\tilde{\mathbf{w}}_{1},\cdots,\tilde{\mathbf{w}}_{n}\end{pmatrix}$ . Again, it suffices to verify that Lemma 8 (a) and (b) still hold for the fixed effect panel data model.

Lemma 10.

Under Assumptions 1, 2,3 and 4, for any $\epsilon>0$ and some integer $r_{1}\geq 3$ , i.e. $E|v_{it}|^{r_{1}}<\infty$

(a)

$\displaystyle\max_{1\leq i,j\leq n}|\langle\tilde{\mathbf{v}}_{i},\tilde{\mathbf{w}}_{j}\rangle|=O_{p}(n^{\frac{1}{r_{1}}+2\epsilon_{1}})$ . 2. (b)

$\displaystyle\max_{1\leq i,j\leq n}|\langle\tilde{\mathbf{w}}_{i},\tilde{\mathbf{w}}_{j}\rangle|=O_{p}(n^{\frac{1}{r_{1}}+2\epsilon_{1}})$ .

Proof.

(a) When $i=j$ , by $\tilde{\mathbf{v}}_{i}=\mathbf{v}_{i}-\bar{\mathbf{v}}_{i}$ and $\tilde{\boldsymbol{X}}_{i}=\boldsymbol{X}_{i}-\bar{\boldsymbol{X}}_{i}$

[TABLE]

For $\max_{1\leq i\leq n}|\mathbf{v}_{i}^{\prime}\boldsymbol{X}_{i}(\hat{\boldsymbol{\beta}}-\boldsymbol{\beta})|:$

[TABLE]

For $\max_{1\leq i\leq n}|\bar{\mathbf{v}}_{i}^{\prime}\boldsymbol{X}_{i}(\hat{\boldsymbol{\beta}}-\boldsymbol{\beta})|:$

[TABLE]

uniformly in $i$ since $\frac{1}{T}\sum_{t=1}^{T}x_{it}=O_{p}(1)$ holds uniformly by Assumption 3 and by Assumption 4 and Lemma 8, we have $\max_{1\leq i\leq n}|\frac{1}{T}\sum_{t=1}^{T}v_{it}|=O_{p}(n^{{2r_{1}+\epsilon_{1}-\frac{1}{2}}})$ . Therefore $\max_{1\leq i\leq n}|\bar{\mathbf{v}}_{i}^{\prime}\boldsymbol{X}_{i}(\hat{\boldsymbol{\beta}}-\boldsymbol{\beta})|=O_{p}(n^{\frac{1}{2r_{1}}+\epsilon_{1}-1/2})$ . Using same techniques, $\max_{1\leq i\leq n}|\mathbf{v}_{i}^{\prime}\bar{\boldsymbol{X}}_{i}(\hat{\boldsymbol{\beta}}-\boldsymbol{\beta})|=O_{p}(n^{\frac{1}{2r_{1}}+\epsilon_{1}-1/2})$ and $\max_{1\leq i\leq n}|\bar{\mathbf{v}}_{i}^{\prime}\bar{\boldsymbol{X}}_{i}(\hat{\boldsymbol{\beta}}-\boldsymbol{\beta})|=O_{p}(n^{\frac{1}{2r_{1}}+\epsilon_{1}-1/2})$ , so that $\max_{1\leq i\leq n}|\mathbf{v}_{i}^{\prime}\boldsymbol{X}_{i}(\hat{\boldsymbol{\beta}}-\boldsymbol{\beta})|=O_{p}(n^{\frac{1}{2r_{1}}+\epsilon_{1}-1/2})\leq O_{p}(n^{\frac{1}{r_{1}}+2\epsilon_{1}})$ .

The calculations for $i\neq j$ case is similar.

(b)

[TABLE]

For $\max_{1\leq i\leq n}|(\hat{\boldsymbol{\beta}}-\boldsymbol{\beta})^{\prime}\boldsymbol{X}_{i}^{\prime}\boldsymbol{X}_{j}(\hat{\boldsymbol{\beta}}-\boldsymbol{\beta})|:$

[TABLE]

Lastly, $|(\hat{\boldsymbol{\beta}}-\boldsymbol{\beta})^{\prime}\bar{\boldsymbol{X}}_{i}^{\prime}\boldsymbol{X}_{j}(\hat{\boldsymbol{\beta}}-\boldsymbol{\beta})|$ , $|(\hat{\boldsymbol{\beta}}-\boldsymbol{\beta})^{\prime}\boldsymbol{X}_{i}^{\prime}\bar{\boldsymbol{X}}_{j}(\hat{\boldsymbol{\beta}}-\boldsymbol{\beta})|$ and $|(\hat{\boldsymbol{\beta}}-\boldsymbol{\beta})^{\prime}\bar{\boldsymbol{X}}_{i}^{\prime}\bar{\boldsymbol{X}}_{j}(\hat{\boldsymbol{\beta}}-\boldsymbol{\beta})|$ are all $O_{p}(n^{-1})$ by Assumption 4. Therefore, $\max_{1\leq i,j\leq n}|\langle\tilde{W}_{i},\tilde{W}_{j}\rangle|=O_{p}(n^{-1})\leq O_{p}(n^{\frac{1}{r_{1}}+2\epsilon_{1}}).$

∎

Bibliography27

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Bailey et al. (2021) Bailey, N., J. Dandan, and J. Yao (2021). A lagrange-multiplier test for large heterogeneous panel data models. Available at SSRN 3804164 .
2Bailey et al. (2016) Bailey, N., G. Kapetanios, and M. H. Pesaran (2016). Exponent of cross-sectional dependence: Estimation and inference. Journal of Applied Econometrics 31 (6), 929–960.
3Baltagi et al. (2008) Baltagi, B. H., G. Bresson, and A. Pirotte (2008). To pool or not to pool? In The Econometrics of Panel Data , pp. 517–546. Springer.
4Baltagi et al. (2012) Baltagi, B. H., Q. Feng, and C. Kao (2012). A lagrange multiplier test for cross-sectional dependence in a fixed effects panel data model. Journal of Econometrics 170 (1), 164–177.
5Breusch and Pagan (1980) Breusch, T. S. and A. R. Pagan (1980). The lagrange multiplier test and its applications to model specification in econometrics. The Review of Economic Studies 47 (1), 239–253.
6Cai et al. (2011) Cai, T. T., T. Jiang, et al. (2011). Limiting laws of coherence of random matrices with applications to testing covariance structure and construction of compressed sensing matrices. The Annals of Statistics 39 (3), 1496–1525.
7Cai et al. (2014) Cai, T. T., W. Liu, and Y. Xia (2014). Two-sample test of high dimensional means under dependence. Journal of the Royal Statistical Society: Series B: Statistical Methodology , 349–372.
8Chudik and Pesaran (2015) Chudik, A. and M. H. Pesaran (2015). Common correlated effects estimation of heterogeneous dynamic panel data models with weakly exogenous regressors. Journal of econometrics 188 (2), 393–420.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Unified and robust Lagrange multiplier type tests for cross-sectional independence in large panel data models

Abstract

1 Introduction

2 Existing tests for cross-sectional dependence based on sample correlation

3 The RLM test and its power enhancement

3.1 The RLM test

Assumption 1**.**

Assumption 2**.**

Assumption 3**.**

Assumption 4**.**

Theorem 1**.**

3.1.1 Relationship between the RLMRLMRLM and LMRMTLM_{RMT}LMRMT​ tests

3.1.2 Relationship of the RLMRLMRLM, LMbcLM_{bc}LMbc​ and CDLMCD_{LM}CDLM​ tests

3.2 The RLMPERLM_{PE}RLMPE​ test

3.2.1 Test based on ∑i≠jρ^ij4\sum_{i\neq j}\hat{\rho}_{ij}^{4}∑i=j​ρ^​ij4​

Theorem 2**.**

Remark 1**.**

Remark 2**.**

4 Dynamic panel data model

Assumption 5**.**

Theorem 3**.**

Theorem 4**.**

5 Fixed effects panel data model

Assumption 6**.**

Theorem 5**.**

Theorem 6**.**

6 Monte Carlo simulations

6.1 Monte Carlo design

6.1.1 DGP1: Heterogeneous panel data model with strictly exogenous regressors

6.1.2 DGP2: Heterogeneous panel data model with weakly exogenous regressors

6.1.3 DGP3: Fixed effects panel model

6.1.4 DGP4: Dynamic panel data model

6.2 Simulation results

7 Conclusion

Appendix

Lemma 1**.**

Lemma 2**.**

Lemma 3**.**

Lemma 4**.**

Lemma 5**.**

Lemma 6**.**

Proof.

Remark 3**.**

Lemma 7**.**

Lemma 8**.**

Proof.

Proof.

Proof.

Proof.

Proof.

Lemma 9**.**

Proof.

Lemma 10**.**

Proof.

Assumption 1.

Assumption 2.

Assumption 3.

Assumption 4.

Theorem 1.

3.1.1 Relationship between the $RLM$ and $LM_{RMT}$ tests

3.1.2 Relationship of the $RLM$ , $LM_{bc}$ and $CD_{LM}$ tests

3.2 The $RLM_{PE}$ test

3.2.1 Test based on $\sum_{i\neq j}\hat{\rho}_{ij}^{4}$

Theorem 2.

Remark 1.

Remark 2.

Assumption 5.

Theorem 3.

Theorem 4.

Assumption 6.

Theorem 5.

Theorem 6.

Lemma 1.

Lemma 2.

Lemma 3.

Lemma 4.

Lemma 5.

Lemma 6.

Remark 3.

Lemma 7.

Lemma 8.

Lemma 9.

Lemma 10.