Robust Productivity Analysis: An application to German FADN data

Mathias Kloss; Thomas Kirschstein; Steffen Liebscher; Martin; Petrick

arXiv:1902.00678·econ.GN·February 14, 2019

Robust Productivity Analysis: An application to German FADN data

Mathias Kloss, Thomas Kirschstein, Steffen Liebscher, Martin, Petrick

PDF

Open Access

TL;DR

This paper introduces a two-step method combining outlier detection and robust parameter estimation to improve productivity analysis accuracy using German FADN data, addressing biases from outliers and multicollinearity.

Contribution

It presents a novel approach that integrates multivariate outlier detection with consistent production function estimation for more reliable empirical results.

Findings

01

Outlier detection effectively identifies multivariate outliers.

02

Decontamination improves parameter estimation precision.

03

Method reduces bias from multicollinearity in productivity analysis.

Abstract

Sources of bias in empirical studies can be separated in those coming from the modelling domain (e.g. multicollinearity) and those coming from outliers. We propose a two-step approach to counter both issues. First, by decontaminating data with a multivariate outlier detection procedure and second, by consistently estimating parameters of the production function. We apply this approach to a panel of German field crop data. Results show that the decontamination procedure detects multivariate outliers. In general, multivariate outlier control delivers more reasonable results with a higher precision in the estimation of some parameters and seems to mitigate the effects of multicollinearity.

Tables8

Table 1. Table 1: Outlier treatment in empirical economics

	FADN	LSMS	GSOEP
Sample composition	FACEPA (2009-2011); Factor Markets (2011-2013)	World Bank LSM (1980-2002) & Policy Research Working Paper (2010-2015)	SOEPapers (2011-2015)
Number of studies surveyed	36	129	396
Number of studies that deal with outliers	11 ( $\approx$ 30.6%)	23 ( $\approx$ 17.8%)	34 ( $\approx$ 8.6%)
Dominant decontamination approach	Univariate	Univariate	Univariate
Frequently used methods and examples	Trimming (Guastella and Moro,, 2013; Petrick and Kloss,, 2013), No method stated (Bakucs et al.,, 2010; Ciaian et al.,, 2011)	Visual observation (Deaton,, 1981, 1988), Trimming (Oseni et al.,, 2014; Backiny-Yetna and McGee,, 2015), Censoring of outliers (Liverpool-Tasie et al.,, 2015)	Visual observation (Crosetto and Filippin,, 2012; Obschonka et al.,, 2013), Trimming (Pfeiffer and Schulz,, 2011; Murphy and Oesch,, 2015), No method stated (Lang,, 2012; Arnold et al.,, 2014)

Table 2. Table 2: Results of simulated production function example

decont.	raw sample			sample I			sample II
scheme	est. $l$	est. $k$	$N$	est. $l$	est. $k$	$N$	est. $l$	est. $k$	$N$
without	0.41^∗∗∗	0.61^∗∗∗	700	0.28^∗∗∗	0.73^∗∗∗	840	0.22^∗∗∗	0.80^∗∗∗	840
univariate	0.39"^∗∗∗	0.62^∗∗∗	526	0.24^∗∗∗	0.74^∗∗∗	677	0.20^∗∗∗	0.78^∗∗∗	711
multivariate	0.40^∗∗∗	0.61^∗∗∗	498	0.42^∗∗∗	0.62^∗∗∗	539	0.41^∗∗∗	0.62^∗∗∗	534

Table 3. Table 3: Selection of variables

FADN code	Variable description
Outputs
SE131	Total output (€)
Inputs
SE011	Labour input (hours)
SE025	Total utilised agricultural area (ha)
\pbox20cmF72 + SE300 +
SE305 + SE336	\pbox20cmCosts for seed and seedlings + crop protection + other crop specific
costs + overheads (EUR) = materials (€)
SE360	Depreciation (€) = fixed capital
Source: Authors.

Table 4. Table 4: Multivariate outliers divided into large and small companies

	small companies	large companies	neither nor	$\sum$
East	79	6	237	322
West	229	236	656	1121
Source: Authors.

Table 5. Table 5: Results of production function estimation (East Germany, WLP estimator)

	no-out		uni-out		full-out		small-large
	Coeff	SE	Coeff	SE	Coeff	SE	Coeff	SE
labour	$0.000$	$0.050$	$0.002$	$0.049$	$- 0.116$ ^∗∗∗	$0.040$	$- 0.004$	$0.056$
land	$- 0.044$	$0.042$	$- 0.044$	$0.034$	$0.125$ ^∗∗∗	$0.042$	$- 0.019$	$0.062$
materials	$1.015$ ^∗∗∗	$0.136$	$1.034$ ^∗∗∗	$0.158$	$0.884$ ^∗∗∗	$0.147$	$1.001$ ^∗∗∗	$0.145$
capital	$0.125$ ^∗∗∗	$0.047$	$0.123$ ^∗∗	$0.050$	$0.146$ ^∗∗∗	$0.039$	$0.110$ ^∗∗	$0.049$
$N$	1340		1305		1262		1327
Elast of Scale	$1.096$		$1.115$		$1.039$		$1.089$
p-value CRS	$0.395$		$0.381$		$0.757$		$0.463$
p-value Model	$< 0.001$		$< 0.001$		$< 0.001$		$< 0.001$
RSS	$0.065$		$0.064$		$0.051$		$0.063$

Table 6. Table 6: Results of production function estimation (West Germany, WLP estimator)

	no-out		uni-out		full-out		small-large
	Coeff	SE	Coeff	SE	Coeff	SE	Coeff	SE
labour	$0.186$ ^∗∗∗	$0.019$	$0.184$ ^∗∗∗	$0.019$	$0.167$ ^∗∗∗	$0.023$	$0.178$ ^∗∗∗	$0.021$
land	$- 0.010$	$0.016$	$- 0.011$	$0.016$	$0.019$	$0.020$	$- 0.000$	$0.018$
materials	$0.802$ ^∗∗∗	$0.086$	$0.773$ ^∗∗∗	$0.083$	$0.803$ ^∗∗∗	$0.091$	$0.786$ ^∗∗∗	$0.089$
capital	$0.159$ ^∗∗∗	$0.023$	$0.169$ ^∗∗	$0.023$	$0.129$ ^∗∗∗	$0.022$	$0.156$ ^∗∗∗	$0.023$
$N$	3382		3355		3053		3245
Elast of Scale	$1.137$		$1.115$		$1.118$		$1.120$
p-value CRS	$0.061$		$0.105$		$0.135$		$0.112$
p-value Model	$< 0.001$		$< 0.001$		$< 0.001$		$< 0.001$
RSS	$0.065$		$0.065$		$0.060$		$0.064$

Table 7. Table 7: Results of production function estimation ("adjusted" small-large sample, East Germany)

	adjusted small-large
	Coeff	SE
labour	$- 0.074$ ^∗	$0.039$
land	$0.059$	$0.041$
materials	$0.969$ ^∗∗∗	$0.143$
capital	$0.105$ ^∗∗	$0.049$
$N$	1322
Elast of Scale	$1.059$
p-value CRS	$0.619$
p-value Model	$< 0.001$
RSS	$0.060$

Table 8. Table 8: Results of regressions leaving labour and land out (full-out sample, East Germany)

	full-out		full-out
	Coeff	SE	Coeff	SE
labour	–	–	$- 0.086$ ^∗∗	$0.037$
land	$0.097$ ^∗∗	$0.041$	–	–
materials	$0.813$ ^∗∗∗	$0.146$	$0.969$ ^∗∗∗	$0.143$
capital	$0.140$ ^∗∗∗	$0.038$	$0.150$ ^∗∗∗	$0.040$
$N$	1262		1262
p-value Model	$< 0.001$		$< 0.001$
RSS	$0.052$		$0.054$

Equations17

y_{i t} = α^{A} a_{i t} + α^{L} l_{i t} + α^{K} k_{i t} + α^{M} m_{i t} + ω_{i t} + ε_{i t},

y_{i t} = α^{A} a_{i t} + α^{L} l_{i t} + α^{K} k_{i t} + α^{M} m_{i t} + ω_{i t} + ε_{i t},

ω_{i t} = h_{t} (m_{i t}, k_{i t}),

ω_{i t} = h_{t} (m_{i t}, k_{i t}),

ω_{i t} = E [ω_{i t} ∣ ω_{i t - 1}] + ξ_{i t},

ω_{i t} = E [ω_{i t} ∣ ω_{i t - 1}] + ξ_{i t},

E [ω_{i t} ∣ k_{i t}, a_{i t - 1}, l_{i t - 1}, k_{i t - 1}, m_{i t - 1}, \dots, a_{i 1}, l_{i 1}, k_{i 1}, m_{i 1}]

E [ω_{i t} ∣ k_{i t}, a_{i t - 1}, l_{i t - 1}, k_{i t - 1}, m_{i t - 1}, \dots, a_{i 1}, l_{i 1}, k_{i 1}, m_{i 1}]

= E [ω_{i t} ∣ ω_{i t - 1}] = g (ω_{i t - 1}) \equiv g [h (m_{i t - 1}, k_{i t - 1})],

y_{i t} = α^{A} a_{i t} + α^{L} l_{i t} + α^{K} k_{i t} + α^{M} m_{i t} + g [h (m_{i t - 1}, k_{i t - 1})] + e_{i t},

y_{i t} = α^{A} a_{i t} + α^{L} l_{i t} + α^{K} k_{i t} + α^{M} m_{i t} + g [h (m_{i t - 1}, k_{i t - 1})] + e_{i t},

E [e_{i t} ∣ k_{i t}, a_{i t - 1}, l_{i t - 1}, k_{i t - 1}, m_{i t - 1}, \dots, a_{i 1}, l_{i 1}, k_{i 1}, m_{i 1}] = 0.

E [e_{i t} ∣ k_{i t}, a_{i t - 1}, l_{i t - 1}, k_{i t - 1}, m_{i t - 1}, \dots, a_{i 1}, l_{i 1}, k_{i 1}, m_{i 1}] = 0.

y_{i t} = 0.4 \cdot l_{i t} + 0.6 \cdot k_{i t} + ω_{i} + ε_{i t},

y_{i t} = 0.4 \cdot l_{i t} + 0.6 \cdot k_{i t} + ω_{i} + ε_{i t},

y_{i t} = 0.99 \cdot l_{i t} + 0.01 \cdot k_{i t} + ω_{i} + ε_{i t} .

y_{i t} = 0.99 \cdot l_{i t} + 0.01 \cdot k_{i t} + ω_{i} + ε_{i t} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Statistical Methods and Models · Agricultural risk and resilience

Full text

Robust Productivity Analysis: An application to German FADN data

Mathias Kloss Leibniz Institute of Agricultural Development in Transition Economies (IAMO), Theodor-Lieser-Straße 2, D-06120, Halle (Saale), Germany

Thomas Kirschstein Martin-Luther-University Halle-Wittenberg, Große Steinstraße 73, D-06099 Halle (Saale), Germany

Steffen Liebscher22footnotemark: 2

Martin Petrick11footnotemark: 1

Abstract

Sources of bias in empirical studies can be separated in those coming from the modelling domain (e.g. multicollinearity) and those coming from outliers. We propose a two-step approach to counter both issues. First, by decontaminating data with a multivariate outlier detection procedure and second, by consistently estimating parameters of the production function. We apply this approach to a panel of German field crop data. Results show that the decontamination procedure detects multivariate outliers. In general, multivariate outlier control delivers more reasonable results with a higher precision in the estimation of some parameters and seems to mitigate the effects of multicollinearity.

Keywords: Productivity analysis, Outlier detection, FADN, Germany

1 Introduction and Background

In assessing factor productivity, biases can mainly emerge from two different sources. The first source is with regard to the classical identification problem of production function parameters. Inputs are usually subject to the farmers decision making process. Hence, there is an endogeneity problem due to the endogenous nature of input choice (Griliches and Mairesse,, 1998). In addition, due to lack of variation in factor utilisation across firms, standard estimation procedures like OLS and fixed effects are not able to extract the information necessary for the separate identification of the different output elasticities. This leads to a collinearity problem (Ackerberg et al.,, 2007). Recently, there has been new insight into these two issues and promising approaches have been developed to mitigate them (cf. Olley and Pakes,, 1996; Blundell and Bond,, 2000; Levinsohn and Petrin,, 2003; Wooldridge,, 2009; Gandhi et al.,, 2011). However, even in case that the statistical identification is secured, there might be sources of bias emerging from the data itself. This usually happens if there are outliers in the data set, which is an ubiquitous feature of many real world data sets, and hence an issue in many empirical applications. If outliers are not dealt with, estimators can be obscured arbitrarily by the presence of as little as one single outlier in the data set. Moreover, in terms of estimating factor productivity accounting for outliers ensures that the assumption of a homogenous production technology is maintained. In this paper, we focus on this latter source of bias while treating the production function estimator which controls for the identification problems mentioned as given.

Outliers occur e.g. due to measurement errors, variations in the data generating process, or misreporting. In general, other than by being very pragmatic and not performing any outlier decontamination, there are two concurrent views on how to define and, consequently, identify outliers.111Not performing any outlier control is a common practice (see section 2). On the one hand, there are methods which assume a rigorous statistical model e.g. in a sense that outliers are thought of as data coming from a different distribution (from another than the one the researcher is actually interested in). The whole data can then be understood as a mixture of two (or more) distributions, where the target distribution should comprise the majority of the probability mass (if, in contrast, the outliers would constitute the majority, it would not make much sense to speak of outliers at all. In such cases the outliers should rather be considered as the distribution of interest). Methods falling into that category commonly aim at estimating some features of the target distribution while outlier detection is usually not their primary objective (it is more of an inherent by-product). If outliers are identified, they are identified with that precise model in mind, meaning that the methods in that category cannot be reasonably applied when the underlying (distributional) assumptions are not fulfilled (at least no useful results can be expected in that case). In this category, a bunch of robust estimators for various models can be found, see e.g. Rousseeuw and Leroy, (1987); Barnett and Lewis, (2000); Hampel et al., (2005); Maronna et al., (2006); Huber and Ronchetti, (2009) for an overview of such methods.

On the other hand, there are methods which follow a more sample-oriented view on how to define an outlier. Universally, all of these methods interpret outliers as observations that differ from the target observations. Obviously, there exist numerous proposals on how that difference can actually be quantified. A large amount of these methods is based on some type of distance, either between any two observations or with respect to some reference point, but there are also other approaches e.g. based on depth or on the empirical density. Representatives can mainly be found in the computer science and data mining literature, see e.g. Chandola et al., (2009) and references therein as a starting point. With these methods, the primary goal is indeed the identification of outliers. An additional analysis might still be carried out on the identified non-outliers, though. The biggest advantage of methods falling into that category is that they can generally be applied without being limited to situations where certain distributional assumptions are fulfilled.

Generally, as we discovered by evaluating a sample of empirical economics papers, if a decontamination is conducted almost always univariate methods focusing on one variable of a more complex multivariate model are employed prior to the follow-up analysis. However, such approaches neglect the models multivariate nature.

In this paper we propose a robust two-stage approach for estimating the production function for field crop farms in East and West Germany based on the "Farm Accountancy Data Network" (FADN) data set. First, by performing the outlier decontamination, and second by estimating the parameters of the production function. While the literature on outlier detection methods is vast (Maronna et al.,, 2006) and other methods might be feasible as well (e.g. cluster analysis), we resort to a multivariate decontamination procedure due to Kirschstein et al., (2013) for identifying the outliers. This method has already proven its effectiveness in an application to determine unsuccessful warship designs (Liebscher and Kirschstein,, 2012). Moreover, it is a non-parametric approach (not requiring special distributional assumptions) and it offers computational advantages (especially for large data sets as the one analyzed here) which makes it our preferred choice. After decontamination, we estimate the parameters of the production function by the Wooldridge, (2009) instrumental variable estimator to account for endogeneity as well as collinearity issues. While we consider the estimation of production functions, the insights should also be of relevance to other model driven empirical applications in which the data is contaminated by outliers as well as those in which instrumental variable estimation is performed.

The paper proceeds as follows. In section 2 we discuss the going practice in outlier treatment in empirical economics. Next, we present the methodological background. Firstly, by discussing our approach to outlier identification. Secondly, by presenting the methods employed for production function estimation. At this stage, we also illustrate by example the consequences to the production function estimation in presence of outliers in the data. Section 4 discusses the FADN data. We move on with a presentation of our results. Section 6 concludes the paper.

2 Going practice in empirical economics

In order to get a general idea how outliers have been treated within the field of economics, we surveyed studies from two additional data sources that have approximately the same importance as the "Farm Accountancy Data Network" (FADN), our data source, in agricultural economics. For the FADN data, we reviewed studies from two recent research projects funded by the European Commission under FP7 which heavily employed this data set: "Farm Accountancy Cost Estimation and Policy Analysis of European Agriculture" (FACEPA) and "Factor Markets". For the field of development economics we reviewed Working Papers from the World Bank’s "Living Standard Measurement Survey" (LSMS) studies while for the field of general economics we analyzed studies employing data from the "German Socio-Economic Panel" (GSOEP). The GSOEP’s "SOEPapers" series is a collection of such work. In Table 1 we summarize the going practice of outlier treatment for a sample of empirical work from these three data sources.

Generally, the three data sources leave the assessment as well as handling of outliers to the researcher (Grosh and Munoz,, 1996; Haisken-DeNew and Frick,, 2005; European Commission,, 2010). Grosh and Munoz, (1996, p. 125) for the LSMS are explicitly open about this view by explaining that "further treatment of these problems should be left to analysts, since there is no universally acceptable solution to these problems". This argument opens up many possibilities for researchers to deal with outliers.

In empirical economic literature emerging from these sources, a large group of authors assess outliers by visual observation - which is usually only feasible for smaller data sets and two dimensions - and contextual reasoning. Based on these modes of operation they drop implausible cases (e.g. Deaton,, 1981, 1988; Crosetto and Filippin,, 2012; Obschonka et al.,, 2013; Oltmanns et al.,, 2014; Auer and Danzer,, 2014). Other approaches involve the transformation of variables (e.g. logarithmization), the application of influence measures or censoring of extreme values (cf. Schneck,, 2011; Olper et al.,, 2014; Liverpool-Tasie et al.,, 2015). Some authors remove outliers without stating their method of detection (e.g. Bauernschuster et al.,, 2011; Ciaian et al.,, 2011; Lang,, 2012; Kemptner,, 2013; Arnold et al.,, 2014).

More structured approaches apply a two-step procedure by first identifying and removing extreme observations for a target variable (univariate outliers) prior to the desired analysis. The approach most commonly used among all empirical data sources is ’trimming’, either by removing some percentage of the data (usually 1% or 5%) at the top and/or bottom of the distribution of a single univariate measure central to the analysis or by applying a quantile based rule, e.g. upper/lower quartile $\pm s\cdot IQR$ with $s$ being some scaling factor and $IQR$ the interquartile range (e.g. Pfeiffer and Schulz,, 2011; Zibrowius,, 2012; Schmitt,, 2013; Sorgner and Fritsch,, 2013; Guastella and Moro,, 2013; Oseni et al.,, 2014; Murphy and Oesch,, 2015; Avdic and Bünnings,, 2015; Backiny-Yetna and McGee,, 2015). However, univariate outlier identification approaches neglect the multivariate data’s nature.

Most strikingly, the group of papers that does not mention any outlier control is by far the largest (e.g. Pitt,, 1995; Alderman,, 1998; Bauer et al.,, 2011; Carletto et al.,, 2011; Headey et al.,, 2012; Dustmann and Görlach,, 2015; Schurer,, 2015). For instance, in only 8.6% of the sample comprising 396 studies we reviewed from the SOEPapers data base (period 2011 - 2015) an outlier control was mentioned.

Another way is to incorporate the outlier problem directly into the estimator (i.e. by developing or applying a robust estimator). Hübler, (2012) applies a simple quantile regression estimator. This way outliers are accounted for by using an appropriate estimation method. However, with the estimation of instrumental variable problems in mind, this method is not feasible since it does not account for endogeneity and collinearity issues. To develop a robust estimator capable to deal with endogeneity and collinearity issues would require extensive work which is beyond the scope of this paper.

Another two-step approach, and so far the only one to our knowledge, which accounts for outliers in a multivariate manner using FADN data has been used within the FACEPA project (Bahta et al.,, 2011). Their detection procedure employs the minimum covariance determinant (MCD) algorithm due to Rousseeuw, (1985). However, this approach imposes strict distributional assumptions on the data. Furthermore, their estimation procedure does not account for farm specific heterogeneity.

In summary, in the majority of studies no outlier control is mentioned. In those studies that do mention a decontamination procedure univariate methods prevail.

3 Methodology

In order to perform an unbiased estimation, we proceed in two steps. First, we decontaminate the sample from outliers by pruning the minimum spanning tree (Kirschstein et al.,, 2013). After eliminating the outliers from the data, we proceed by estimating the production function using the Wooldridge, (2009) production function estimator. In the upcoming subsection we describe both methods in detail. We conclude this section by demonstrating the effects of outliers on production function estimation with simulated data.

3.1 Decontamination by pruning the minimum spanning tree

Robust statistical methods are designed to deliver unbiased estimates of measures of interest in presence of contaminated data sets. A general two-step-approach is, therefore, to identify an outlier-free subsample first. This step can be called decontamination. In turn, observations not belonging to the uncontaminated subsample are suspected to be outliers. In contrast to outlier detection approaches, this approach tries to find an outlier-free subsample instead of outliers. This way, we accept the risk to falsely discard non-outlying observations in favor of a (most likely) outlier-free subsample.

A robust estimator relying on a non-parametric decontamination procedure is the pMST estimator which is a robust estimator of multivariate location and scatter (Kirschstein et al.,, 2013). The decontamination results from pruning of the so-called Minimum Spanning Tree (MST) of a data set. The idea is that each observation of a data set represents a point in Euclidean space whereby its coordinates are the observation’s values in each dimension. Note that in a panel data context, an observation corresponds to an entry in the data base, e.g. a farm’s record of labour, land, materials and capital use in a certain year.222In the formal presentation we resign from using a time index to prevent a too cluttered notation. For qualifying observations to be outlying, the pMST procedure implies that outliers are isolated with respect to similar observations. Similarity is here defined as the Euclidean distance between two observations. Similar observations can be interpreted as neighbours. In order to identify the observations’ neighbourhoods, the spanning tree concept is used (see Figure 1 for a graphic illustration in two dimensions). The aim is to select a minimal set of connections between observations such that all observations are connected with each other. Among all spanning trees, the minimum spanning tree has smallest weight, i.e. the sum of the lengths of all connections is minimal. For decontamination, the pMST procedure then iteratively deletes the longest connections in the MST until a certain threshold is reached (see below). The largest (still connected) fraction of the original MST is finally retained as the non-outlying part of the original data set. A formal description of the algorithm follows.

Formally, given a data set $\mathbf{X}$ of $n$ points in dimension $p$ , i.e. $\mathbf{X}=\{\mathbf{x}_{1},\ldots,\mathbf{x}_{n}\}\subseteq{\mathbb{R}}^{p}$ , and all pairwise links (edges) $\mathbf{E}$ (i.e. $\mathbf{E}\ni e_{ij}=\{\mathbf{x}_{i},\mathbf{x}_{j}\}$ with $i,j=1,...,n$ and $i\neq j$ ), the MST is defined as the graph $\mathbf{G}=(\mathbf{X},\mathbf{E}^{*})$ connecting all points of $\mathbf{X}$ such that its total length is minimized, i.e. $\operatorname*{arg\,min}\limits_{\mathbf{E}^{*}\subset\mathbf{E}}\left(\sum\limits_{e_{ij}\in\mathbf{E}^{*}}w(e_{ij})\right)$ . Typically, the weight of an edge $e_{ij}=\{\mathbf{x}_{i},\mathbf{x}_{j}\}$ is the Euclidean distance between $\mathbf{x}_{i}$ and $\mathbf{x}_{j}$ that is $w(e_{ij})=\sqrt{(\mathbf{x}_{i}-\mathbf{x}_{j})\cdot(\mathbf{x}_{i}-\mathbf{x}_{j})^{\prime}}$ . From this fact follows that Euclidean distances are meaningful for the data set, i.e. all variables are estimated on the same (or at least a similar) scale. The term ”connected” refers to the property that there must be a path (i.e. a sequence of edges) in $\mathbf{G}$ between any two points of $\mathbf{X}$ . It can be proven that the number of edges in $\mathbf{G}$ is always $|\mathbf{E^{*}}|=n-1$ (Jungnickel,, 2008). Moreover, it can be shown that the MST is unique if all edges in $\mathbf{E}$ have unique weights. The MST can be efficiently computed even for large data sets. See e.g. Jungnickel, (2008) for a review of efficient algorithms.

Given the MST, pruning is realized by successively deleting edges in $\mathbf{G}$ according to their length. I.e. in the first iteration the longest edge is removed, in the second iteration the second longest edge, and so on. This way $\mathbf{G}$ is split into several subgraphs which are not connected amongst each other. During the pruning process, the subgraphs’ cardinality (i.e. the corresponding number of observations) declines. The pruning process is stopped if, by deleting the next edge, the cardinality of the largest subgraph would fall below $\lfloor(n+p+1)/2\rfloor$ . Stopping at this bound assures that subsequently applied estimators achieve a maximum breakdown point, i.e. they are resistant against a maximum level of contamination. The largest subgraph at this point contains more than or exactly $\lfloor(n+p+1)/2\rfloor$ observations and is denoted by $\mathbf{G^{\prime}}=\left(\mathbf{X}^{\prime},\mathbf{E}^{\prime}\right)$ with $\mathbf{X}^{\prime}\subset\mathbf{X}$ and $\mathbf{E}^{\prime}\subset\mathbf{E}^{*}$ . This approach was first proposed by Bennett and Willemain, (2001). A discussion on the robustness properties of the associated estimators can be found in Kirschstein et al., (2013). Problematically, in most real world data sets much less than $n-\lfloor(n+p+1)/2\rfloor$ outliers occur. To avoid low efficiency of robust estimators, reweighting procedures were proposed to enlarge the uncontaminated subsample.

For MST-based outlier decontamination, reweighting implies that a critical edge length $w_{\alpha}^{crit}$ has to be determined up to which (with a certain probability $\alpha$ ) the MST consists of uncontaminated observations only. To estimate $w_{\alpha}^{crit}$ , a non-parametric approach relying on a finite sample version of Chebychev’s inequality is described in Liebscher and Kirschstein, (2014). The main idea is to estimate $w_{\alpha}^{crit}$ based on the mean edge length $\mu^{w}$ and the edge lengths’ standard deviation $\sigma^{w}$ . The parameters $\mu^{w}$ and $\sigma^{w}$ are estimated based on the edge set $\mathbf{E}^{\prime}$ of the initial robust subset of $\mathbf{G}^{\prime}$ . For $w_{\alpha}^{crit}$ follows $w_{\alpha}^{crit}=\hat{\mu}^{w}+\sqrt{\frac{(m^{2}-1)}{m^{2}\cdot(1-\alpha)-m}}\cdot\hat{\sigma}^{w}$ where $m$ denotes the cardinality of $\mathbf{E}^{{}^{\prime}}$ . Once $w_{\alpha}^{crit}$ is determined, $\mathbf{G^{\prime}}$ is ”rebuilt” by attaching all edges of $\mathbf{G}$ with edge weights smaller or equal to $w_{\alpha}^{crit}$ . This way, a still robust but larger subgraph (say $\mathbf{G}^{\prime\prime}$ ) is determined whose associated observations are considered as the outlier-free subsample used in further analyses.

3.2 Production function estimation

Suppose production can be described by the Cobb-Douglas function:

[TABLE]

where $y$ is the natural logarithm of output $Y$ , $A$ is land use, $L$ is labour, $K$ fixed capital, $M$ materials (working capital) and $i$ and $t$ are farm and time indices. Lower case letters denote the natural logarithm of the inputs. The $\alpha^{X}$ are parameters to be estimated, and $X\in\left\{A,L,K,M\right\}$ refers to the production factors. $\omega_{it}$ are farm- and time-specific factors known by the farmer but unobserved by the analyst. $\varepsilon_{it}$ are the remaining independent and identically distributed (iid) errors.

First, previous studies on productivity analysis using FADN data focused on this functional form. Flexible functional forms such as the Translog specification are desirable. However, recent studies employing this data set have shown at best mixed results with the Cobb-Douglas specification (Zhengfei et al.,, 2006; Petrick and Kloss,, 2013; Latruffe and Nauges,, 2013). Furthermore, with increasing degrees of power in polynomials of inputs virtually any data structure can be modelled. This could possibly also mitigate the outlier problematic but it implies adding more and more regressors to the estimating equation. As a result, problems such as multicollinearity are amplified. Therefore, we resort to the specification stated in (1).

Next, the $\omega_{it}$ will likely be correlated with the other inputs in (1) because factor use across farms is usually under control of the farmer. Therefore, the production factors in (1) are subject to an endogeneity problem. As a result, the OLS estimator which is commonly used as an empirical baseline will produce biased estimates of output elasticities since it neglects the presence of $\omega_{it}$ . In empirical practice a typical outcome are upward biased elasticities for variable inputs (e.g. materials). To tackle the endogeneity problem, several strategies have been proposed (cf. Griliches and Mairesse,, 1998). One route is to assume that $\omega_{it}$ can be clearly separated into individual and time specific fixed effects (Mundlak,, 1961). However, this approach, so far, has been applied only with mixed success. In order to control for $\omega_{it}$ , we apply in this paper a semi-parametric control function approach due to Wooldridge, (2009). This estimator belongs to a class of so-called ’proxy’ approaches introduced by Olley and Pakes, (1996) and further developed by Levinsohn and Petrin, (2003). Control function estimators have already been successfully applied in an agricultural economics context (e.g. Petrick and Kloss,, 2013; Kloss and Petrick,, 2014).These estimators assume that adjustment costs are the main driver of unobserved heterogeneity, $\omega_{it}$ . The Wooldridge extension further allows to tackle the collinearity issue (Bond and Söderbom,, 2005). This problem prevents other estimators (including OLS and traditional fixed effects approaches) from theoretically identifying the impact of fully variable production factors on productivity.

Assuming the existence of a suitable proxy (e.g. materials) for $\omega_{it}$ , we can write:

[TABLE]

where $h_{t}$ is a potentially observable control function and $k_{it}$ is the pre-determined level of capital use at time $t$ . It is assumed to evolve according to $k_{it+1}=(1-\delta)k_{it}+inv_{it}$ , with $\delta$ as the depreciation rate and $inv$ as investment.

Furthermore, suppose that unobserved productivity follows a first-order Markov process:

[TABLE]

where $\xi_{it}$ is an innovation uncorrelated with $k_{it}$ , but possibly correlated with the other factors in the production function. Following Wooldridge, (2009), we further assume that $\omega_{it}$ has conditional expectation such that:

[TABLE]

where $g$ is an unknown productivity function.

Now, the problem can be formulated by plugging the identity, $\omega_{it}=g[h(m_{it-1},k_{it-1})]$ , in (1) as follows:

[TABLE]

where $e_{it}=\xi_{it}+\varepsilon_{it}$ . The moment conditions that hold for (5) are:333Note, that the orthogonality conditions in (5) are weakened in a way that only current realisations and one lag of the inputs are assumed to be uncorrelated with the $\varepsilon_{it}$ / $e_{it}$ .

[TABLE]

Hence, in (5) current and past values of $k$ , past values of $a$ , $l$ and $m$ and also functions of these may be used as instruments. Given this setup, we can identify the production function parameters by estimating (5) using instrumental variable estimation with instruments for $a$ , $l$ and $m$ (Wooldridge,, 2009, p. 113). The function $h$ is approximated by low-order polynomials of first-order lags of $m$ and $k$ while $g$ might be a random walk with drift (Wooldridge,, 2009, p. 114).

3.3 Artificial example

In order to demonstrate the effects of outliers on non-robust estimations, we discuss a simplified example. Therefore, we simulate an example with 100 farms over 7 periods. The data generating process of these observations is as follows:

[TABLE]

where $y$ , $l$ and $k$ are the natural logarithm out output, labour and capital, $\omega_{i}$ represents unobserved heterogeneity with $\omega_{i}\sim N(0,25)$ and $\varepsilon_{it}$ is the remaining disturbance following $N(0,1)$ . Labour and capital input are random variables with $N(0,4)$ .

As outliers we generate two data sets with 20 small farms over the same periods with $\omega_{i}\sim N(-5,4)$ and $l_{it},k_{it}\sim N(-5,9)$ . This additional data is added to the ’raw’ sample. Finally, we arrive at two different outlier contaminated data samples. In the first outlier data set, we set $cor(l_{it},k_{it})=0$ (sample I) and in the second $cor(l_{it}k_{it})\approx-1$ (sample II), so that labour and capital are almost (perfect) substitutes. This assumption has been chosen for demonstrative purposes. Please note, we further differentiate in this way to illustrate the effects of multicollinearity on outlier infested data and how outlier decontamination is able to mitigate such effects. In both samples the production function for the outlier sets is:

[TABLE]

I.e. in both sets $7\cdot 20=140$ observations are generated by (8) which are regarded as outlying from the process assumed in (7).

We present fixed effects regression results to control for unobserved farm specific effects ( $\omega_{i}$ ) for all three data sets applying a) no decontamination, b) univariate outlier decontamination, and c) multivariate decontamination using the pMST method. The estimated production coefficients for labour and capital input as well as final sample size are summarized in Table 2.

Given an appropriate estimator that controls for unobserved heterogeneity, we are able to recover the ’true’ production function parameters, as given in (7), rather precisely for both contaminated samples with the multivariate decontamination method. Univariate decontamination yields no improvement in estimation accuracy. In fact, results are close to the raw sample. Hence, this simple procedure fails to detect the ’meaningful’ outliers in a sense that it cannot detect outliers in dimensions other than the considered one. Therefore, without applying multivariate decontamination, already a relatively small number of outliers (about 16.7 per cent) biases the estimates severely.

Multicollinearity (sample II) increases the problem which results in even more deteriorated estimates. However, multivariate decontamination is able to identify the outliers correctly in this case, too. In particular, this example demonstrates that the presence of multicollinearity does not interfere with the multivariate detection procedure’s ability to identify outliers correctly - even in this extreme case of almost perfect correlation between the inputs. Moreover, to put it differently, the pMST procedure might also mitigate multicollinearity. This, as illustrated in this example, works especially well if the outliers are the (major) source of multicollinearity in the data.

In summary, only after controlling for both - unobserved heterogeneity and the effects of outliers - we are capable of obtaining reliable output elasticity estimates. Hence, these two issues have to be treated individually. Given the results of this simple example, an effective outlier decontamination can only be conducted if all model dimensions are considered, i.e. multivariate outlier detection is conducted.

4 Data

We use data for field crop farms extracted from the German Farm Accountancy Data Network (FADN) covering the years from 2001 up to 2008. The FADN provides a farm level data set that holds accountancy data for 25 of the 28 EU member states. To represent the heterogeneity of farms and ensure representativeness, a stratified sample is obtained. The stratification criteria are region, economic size and type of farming. Each year about 80,000 farms are sampled. They represent a population of about 5,000,000 farms in the member states. In each member state a liaison agency is responsible for the data collection and transmission, which consists of about 1,000 variables including structural, economic and financial data.

The farm universe consists of all farms with more than one hectare or those with less than one hectare that provide the market with a specified amount of output. From this universe all non-commercial farms are excluded. To be classified as a commercial farm, a farm must exceed a certain economic size. It is measured in economic size units (ESU). One ESU represents a certain amount of standard gross margin in € that is periodically adjusted for inflation. In addition, farms are classified by type of farming (TF). To justify the assumption of a homogenous state of technology across farms, we only study field crop farms (TF1) and treat East and West Germany separately in the following analysis because they are structurally distinct. East German agriculture can be characterised by large-scale corporate farms while West Germany is dominated by small- to medium-scale family farms. In addition, after the removal of outliers such a treatment allows carving out similarities and differences more precisely since both German regions are under the same jurisdiction while having historically different forms of agricultural organization. East Germany consists of the five states Mecklenburg-West Pomerania, Brandenburg, Saxony-Anhalt, Thuringia and Saxony. West Germany contains all other states except Berlin and Bremen, which are not represented in the FADN data. From the raw data provided by FADN we constructed panel data sets covering the observed years.

We measure output as the total farm output in €, labour by the total on-farm hired and family labour working time and land as the utilised agricultural area in ha - including owned and rented land as well as land in sharecropping. A persistent issue in estimating production functions has been the specification of the capital variable. Typically, some simple measures of input quantities (such as fertilisers or pesticides) and machinery use (such as fuel expenses or tractor hours) are used in the cross-sectional studies. More sophisticated approaches use inventory methods to estimate real capital service flows by making assumptions about depreciation and capital rental rates (Andersen et al.,, 2011). In this study, the material or working capital input is proxied by total intermediate consumption in €. It consists of the total specific costs and overheads arising from production in the accounting year. Among others, it includes cost for fuel, lubricants, water, electricity and seed. We do not include the costs for fertilizer in the materials input as land and fertiliser are highly correlated. This observation implies that these inputs are applied in more or less fixed ratios on the majority of farms. In return, this might induce a multicollinearity problem in the estimations. Nevertheless, due to this correlation, the effect of fertilizer inputs is still captured by the land input even if the former is not included. Fixed capital inputs are approximated by depreciation of capital assets estimated at replacement value in €. Such a treatment is consistent with recent literature on production function estimation using firm level data (Olley and Pakes,, 1996; Blundell and Bond,, 2000; Levinsohn and Petrin,, 2003). This variable includes depreciation for plantations of permanent crops, buildings and equipment, land improvements, machinery, and forest plantations. Table 3 summarizes the variable definitions and gives the actual FADN codes.

All monetary values are deflated to real values in 2005 prices using respective price indices. The information was extracted from the Eurostat online database and merged with the country panels. Output was deflated by the agricultural output price index. Fixed capital was deflated by the agricultural input price index for goods and services contributing to agricultural investment, and working capital by the agricultural input price index for goods and services currently consumed in agriculture.

5 Results

5.1 Outlier identification

The data sets which were finally used for the outlier identification contain a total of 3,610 (East Germany) and 8,490 observations (West Germany), respectively. Data has been logarithmized to compensate for different scales and heavy tails. Figures 2a and 2b show the corresponding scatterplot matrices. Neither the plot for East nor for West Germany gives rise to serious concern. In most cases variables show a high pairwise correlation (especially for East Germany). The major part of the data forms a big cluster with some observations scattered around. An interesting effect can be observed for the variable ”Labour” where the observations seem to be compressed for values somewhere between 0 and 1. This effect results from a substantial amount of ”one-man-companies” in the data sets which all unanimously stated 2,210 hours (i.e., 52 weeks $\cdot$ 5 days $\cdot$ 8.5 hours) as their working input.

When applying the pMST procedure with $\alpha=0.95$ (as described in Section 3.1) on the data, 322 (East) and 1,121 (West) potential outliers are identified. Figures 2a and 2b show the scatterplot matrices with the identified outliers colored in red.

The following results are unfolding:

•

All scattered observations (lying around the main bulk of the data) are identified as outliers (as expected).

•

It seems that observations lying within the main bulk are also identified as outliers, which - at a first glance - might be understood as an undesired result. However, this view leaves the multivariate nature of the data out of consideration, i.e., an observation which is inlying in one scatterplot might be very well an outlier in another bivariate plot or higher dimensional displays.

•

By tendency, outliers seem to be small farms (with low inputs and low output) as they mainly occur in the lower left corner of the scatterplots (this is visible especially for East Germany in Figure 2a).

The latter point can also be confirmed by having a closer look at the outlier characteristics by means of parallel boxplots; see Figures 3a and 3b.

In Figure 3a it can be seen that the outliers are primarily made up of small companies (presumably of the one-man-companies mentioned earlier) since the boxes and the medians are situated clearly below those of the non-outliers. Additionally, the outliers also cover those observations with very large values in each variable as the (upper) whiskers reach out beyond those of the non-outliers. For the West German data (Figure 3b) the situation is not that clear. While the outliers are still made up of the largest and smallest observations in each variable (as expected), the identified outliers seem to be more evenly distributed.

In order to get a better understanding how the outliers divide into ”small” and ”large” companies, the approach described in Liebscher and Kirschstein, (2012) can be applied: Based on the identified non-outliers and their values in the 5 variables a frontier/boundary (in concept similar to the Free Disposal Hull, see e.g. Cooper et al., 2007) can be constructed. Those observations among the non-outliers which dominate the remaining non-outliers (in a sense that they possess a higher value in at least one variable while being at least equal in the remaining variables) constitute the upper boundary (see Figure 4 in the Appendix for an example when considering only the variables Capital and Output of the West German data set). Likewise a lower boundary is constructed (just by switching the sign of the variables). The identified outliers can now be assigned to one of the two groups (large/small) by examining their position in relation to these boundaries. Outliers lying beyond the upper boundary are considered as ”large” companies while observations lying below the lower boundary are considered as ”small” companies. There might also be outliers which lie within the region encapsulated by both boundaries. Table 4 shows the result of this analysis by giving the number of outliers falling in each region.

Note that for these results all five variables have been jointly considered. Therefore the regions are actually hypercuboidal and not planar (as Figure 4 might suggest). The results generally support the conclusions already drawn from the parallel boxplots. Interestingly, for both East and West Germany, there is a substantial amount of outliers lying within the encapsulated region. These observations, while being identified as outliers, must be located close to the bulk of non-outlying observations. Therefore, if one is interested in a more ”conservative” approach in outlier identification, one may consider to include these observations in the set of non-outliers again for any follow-on production function analysis.

5.2 Production function estimation

We estimate the parameters of the production function by the Wooldridge, (2009) (WLP) estimator. It is applied to various samples: a) the full sample without any outlier identification (no-out), b) the "cleaned" subsample resulting after univariate outlier identification (uni-out), c) the "cleaned" subsample resulting after multivariate outlier identification (full-out), and d) the "cleaned" subsample resulting after removing only small and large multivariate outliers (small-large). For case b) observations were dropped if the fixed capital productivity per farm was beyond the upper/lower quartile $\pm$ 1.5 times the interquartile range (IQR). We resort to such a trimming rule because it is prominent in the literature outlined in the introductory section. According to this rule 298 outliers have been detected for East Germany and 1,429 for West Germany which differs from the numbers observed in the multivariate case (see table 4). This result is not too surprising as both ways to detect outliers are inherently different. The final estimation samples include farms that have a minimum panel representation of four consecutive years in order to justify the assumption that factor adjustment drives unobserved heterogeneity. This also ensures that the panels do not become "holey" after applying the outlier removal procedure as the detection is done on observations. In addition, it copes well with the assumption that costly factor adjustment drives unobserved heterogeneity (Petrick and Kloss,, 2013). In general, our approach ensures that it preserves as much valuable information as possible. For instance, if a firm constantly overreports, it will not be included in the estimation sample.

5.2.1 Results for East Germany

Comparing the results in Table 5, it turns out that for the majority of samples the labour coefficient is insignificant and close to zero. Generally, materials is the most important input as it displays output elasticities fluctuating at around $1.0$ . Furthermore, the hypothesis of constant returns to scale cannot be rejected for any sample.

As expected, results based on the "small-large" subsample and the "uni-out" subsample are quite similar to the "no-out" model. This means estimates and goodness of fit (in terms of residuals’ sum of squares, normalized by number of observations and number of parameters to be estimated) are almost equal showing significantly positive effects for materials and capital. Coefficients for labour and land are insignificant and partially negative. A suspicious artefact is an estimated material elasticity greater than 1. Although scale elasticity sums up to values slightly larger than 1, the hypothesis of constant returns to scale cannot be rejected for any model.

Removing the full set of multivariate outliers (full-out subsample) leads to different results. Compared to the other three models, it was possible to identify the land output elasticity and the materials coefficient slightly decreased. Hence, the multivariate outlier decontamination procedure mitigates the multicollinearity problem with regards to these two inputs - a result also observed for West Germany (see below). This observation further implies that a large amount of the multicollinearity due to land and materials is originating from the multivariate outliers. In addition, farms relying, on average, on a more (fixed) capital intensive production are left in the sample as indicated by the highest capital coefficient among all subsamples. Furthermore, in this particular case results display a higher precision as the standard errors of this parameter is smallest among all subsamples. Likewise, the remaining farms after multivariate decontamination are less working capital intensive. Moreover, the elasticity of scale is closest to one and the corresponding RSS is the lowest among all models. However, the coefficient of labour is significantly negative for this model which is inconsistent with production theory. Nevertheless, estimation results for the other subsamples, including the more conservatively multivariately cleaned subsample d), propose a consensus suggesting that labour is an abundant input factor. We observe this issue in the following.

Taking a closer look at the outliers removed additionally compared to the "small-large" subsample reveals that to a large extent the differences between the "small-large" and "full-out" results originate from the observations of a single farm. A trivariate scatterplot of labour, land, and output shows that the observations of this particular entity are far apart from the remaining data (see the green-coloured points in Figure 5 in the Appendix). Removing the observations of the outlying firm from the "small-large" subsample leads to a significantly negative elasticity estimate for labour (see Table 7 in the Appendix). This highlights the influential effect of even a single outlying farm on regression estimates.

In general, removing all (multivariate) outliers leads to a more homogeneous sample. For East Germany this results in an increase in partial correlation between labour and land from 0.81 for the complete data set to 0.90 for "full-out" subsample. The resulting extreme collinearity between both regressors cannot be compensated by the WLP estimator. Therefore, it appears that land and labour may be regarded as a closely intertwined input "package" where one input measures the effect of the other. Omitting either of these two regressors leads to similar estimates as for the fully specified model (compare Table 8 in the Appendix).

To conclude, we find that materials and capital are restrictive inputs for agricultural production in East Germany. The largest influence on production is consistently estimated for the materials input. However, capital shows a significant effect on production in any model. For land and labour the story is more complex due to outliers and a high level of collinearity of both regressors. However, further specifications suggest that labour is an abundant input factor and that the effect of labour and land may be captured by only including the latter.

5.2.2 Results for West Germany

The production function estimates for West Germany are summarized in Table 6. In general, the structure of farms in East and West Germany is quite different (as also Figures 3a and 3b suggest). While in East Germany a lot of very large farms exist (including some farms identified as outlying), in West Germany primarily small and medium scale family farms are prevalent. Hence, one can presume that production technologies in both regions are different to some extent.

Furthermore, while for all subsamples the assumption of constant returns to scale cannot be rejected, this result is much clearer for estimation samples based on multivariate outlier detection methods. The "no-out" subsample even rejects this assumption at the 10 percent significance level. Orders of magnitude of estimated labour and capital coefficients are smaller in the multivariate compared to the "no-out" and univariate cases - hinting at upward biased coefficients in the latter two. This drives the results more into the direction of returns to scale approaching unity. Additionally, a consistent difference between East and West Germany is that the labour elasticity is significantly positive throughout, indicating that this input is to some extent a scarce factor in West Germany.

Generally, for the case of West Germany, the latest methodological advancements in production function estimation together with multivariate outlier decontamination are able to fully embrace their benefits. This is signalled by - as outlined above - plausible estimates which exhibit positive output elasticities throughout all inputs and, in the case of capital inputs, the highest parameter precision. Finally, RSS is lowest for the full multivariate decontamination scheme.

For West Germany we find similar results for material and capital elasticity as for East Germany. Comparing the "full-out" estimates, coefficients of both variables are quite close for both countries. A similar conclusion can be made for the estimates for the land input. The major difference is the estimate of labour elasticity which is significantly positive in West Germany which may indicate a shortage of labour force compared to East Germany. Additionally, the correlation between labour and land is considerably smaller in West Germany compared to East Germany (about 0.11 to 0.81). This indicates a high level of heterogeneity in the West German data set obviously containing firms with different levels of labour intensity. The opposite is the case for East Germany.

5.3 Evaluation of Results

In summary, the two-step approach presented allows for robust and consistent estimation of production functions. By applying a multivariate assessment of outliers, we are able to treat all considered dimensions of agricultural production - resulting in a global assessment of outliers. Perhaps unsurprisingly, the univariate decontamination procedure is not capable in detecting the ’meaningful’ outliers. This procedure can only detect conventional outliers beyond some threshold in a single dimension, while the multivariate algorithm at hand may, in addition, also detect such outliers located within the production technology. This is a major advantage compared to the common univariate approach employed. By just dropping observations beyond some threshold the univariate procedure tends to "mis-detect" outliers and potentially drops (keeps) valuable (valueless) information. Results indicate that the pMST procedure indeed detected all scattered observations located around the main bulk of observations as well as the ones within the production technology.

The advantage of the multivariate detection also carries over to the estimation, at least to some extent. It results in improved estimates compared to the results for the other samples, e.g. it allows for a higher precision in the estimation of some parameters. In addition, multivariate detection seems to mitigate the effects of multicollinearity further, however, with better results for West Germany. In both German regions, we observe the lowest RSS for estimation samples based on multivariate decontamination with a clear advantage towards the "full-out" samples. This indicates that the multivariately decontaminated samples also lead to preferable results in terms of goodness of fit. Finally, discarding only extreme multivariate outliers leads to more conservative results closer to the full sample estimates.

6 Conclusion

In this paper, we propose a two-step approach for estimating agricultural production functions robustly. In the first step, a non-parametric multivariate decontamination procedure based on pruning the minimum spanning tree of a data set is used to determine an outlier-free subsample. In contrast to univariate extreme value detection which is the prevalent approach in empirical economics practice, this method assesses outliers considering all observed dimensions of agricultural production. In the second step, production functions are estimated using the recently proposed panel data estimator by Wooldridge, (2009) to mitigate endogeneity as well as collinearity issues. The WLP estimator promises consistent estimates by addressing endogeneity and collinearity problems. In our simulated example, we show that given an appropriate estimator only multivariate decontamination leads to estimates close to the true parameter values.

In the empirical section, we apply the two-step procedure to East and West German agricultural field crop data from the FADN data base. The analysis reveals that many outlying observations are made up of relatively small farms. In addition, outliers were also detected within the main bulk of observations, which cannot be detected with a univariate approach. Hence, future empirical analyses utilising FADN data should apply a multivariate outlier decontamination procedure. The estimated production functions show that the WLP estimator delivers convincing results with respect to input elasticities and returns to scale when it is applied to outlier-free subsamples derived by the multivariate pMST approach. The estimates for East and West Germany show similarly positive elasticities for capital (working and fixed) as well as scale elasticities close to one. By moving from the univariate to the multivariate decontaminated sample a picture unfolds in which, on average, more fixed capital and less working capital farms remain in the East German sample; for the West German one it is the other way around. A key advantage of the multivariate decontamination is that it potentially helps to further mitigate multicollinearity problems (see simulated example). Generally, estimations based on this sample provide parameter estimates of previously unidentified coefficients (land in East Germany) or a higher precision in estimation. In case of East Germany, this does come at some cost as we observe a negative and significant labour coefficient. Estimates of labour and land elasticities have to be interpreted carefully due to a high correlation of land and labour. However, results for the other specifications and subsamples, including the more conservative multivariate solution, suggest that labour is no scarce factor in East Germany while land to some extent is.

In essence, the proposed two-step approach reveals that production technology in East and West Germany is not that different as it seems at the first glance. This is even more important as West Germany shows a more heterogeneous set of firms compared to East Germany. Nonetheless, in both regions farms rely on material and capital inputs in a very similar way.

The most remarkable remaining difference between East and West Germany is the labour elasticity, which is significantly positive for West Germany. This can be interpreted as an indication of a shortage of labour force in West Germany whereas East Germany does not suffer from such a restriction. Similarly, for both regions fixed capital constitutes an equally restrictive input factor whereby the materials input is the far most important input. Therefore, policy reforms should aim to ease the access of agricultural companies to capital and labour force, particularly in West Germany.

Statistical outlier analysis in general has also its limitations. As it is only a statistical tool it will not absolve the researcher from thinking about the scope of research. With regard to production function estimation, statistical outlier analysis is a mean to homogenize production technologies. Therefore, the statements made in this paper are with respect to the prevalent production technology in both German regions. However, many of the outliers detected are small farms. If such farms are of interest to the research question at hand, the outlier analysis, uni- or multivariate, needs to be reconsidered.

Appendix

Bibliography63

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Ackerberg et al., (2007) Ackerberg, D., Lanier Benkard, C., Berry, S., and Pakes, A. (2007). Econometric Tools for Analyzing Market Outcomes. In Heckman, J. J. and Leamer, E., editors, Handbook of Econometrics. Volume 6A , pages 4171–4276. Elsevier, Amsterdam: North Holland.
2Alderman, (1998) Alderman, H. (1998). Social assistance in Albania : decentralization and targeted transfers. Living Sandards Measurement Survey Working Paper No. 134.
3Andersen et al., (2011) Andersen, M. A., Alston, J. M., and Pardey, P. G. (2011). Capital Services in U.S. Agriculture: Concepts, Comparisons, and the Treatment of Interest Rates. American Journal of Agricultural Economics , 93(3):718–738.
4Arnold et al., (2014) Arnold, D., Brändle, T., and Goerke, L. (2014). Sickness Absence and Works Councils: Evidence from German Individual and Linked Employer-Employee Data. SOE Ppapers on Multidisciplinary Panel Data Research at DIW Berlin No. 691.
5Auer and Danzer, (2014) Auer, W. and Danzer, N. (2014). Fixed-Term Employment and Fertility: Evidence from German Micro Data. SOE Ppapers on Multidisciplinary Panel Data Research at DIW Berlin No. 719.
6Avdic and Bünnings, (2015) Avdic, D. and Bünnings, C. (2015). Does the Burglar Also Disturb the Neighbor?: Crime Spillovers on Individual Well-being. SOE Ppapers on Multidisciplinary Panel Data Research at DIW Berlin No. 737.
7Backiny-Yetna and Mc Gee, (2015) Backiny-Yetna, P. and Mc Gee, K. (2015). Gender differentials and agricultural productivity in Niger. World Bank Policy Research Working Paper 7199.
8Bahta et al., (2011) Bahta, S. T., Kleinhanss, W., and Berner, A. (2011). Results for Germany. In Kleinhanss, W., editor, Implementation, validation and results of the cost of production model using the EU FADN, FACEPA Deliverable No. 3.2 . Braunschweig.