On the Structure of Ordered Latent Trait Models
Gerhard Tutz

TL;DR
This paper explores the structure of ordered latent trait models, clarifies the role of binary models within them, and discusses threshold ordering, enhancing understanding of model construction and interpretation.
Contribution
It provides a detailed analysis of how binary models underpin various ordinal models and clarifies the interpretation of parameters, especially in the partial credit model.
Findings
Binary models are integral to ordinal models.
Threshold ordering is not essential for the partial credit model.
All models exhibit a Guttman space structure.
Abstract
Ordered item response models that are in common use can be divided into three groups, cumulative, sequential and adjacent categories model. The derivation and motivation of the models is typically based on the assumed presence of latent traits or underlying process models. In the construction frequently binary models play an important role. The objective of this paper is to give motivations for the models and to clarify the role of the binary models for the various types of ordinal models. It is investigated which binary models are included in an ordinal model but also how the models can be constructed from a sequence of binary models. In all the models one finds a Guttman space structure, which has previously been investigated in particular for the partial credit model. The consideration of the binary models adds to the interpretation of model parameters, which is helpful, in…
| 1 | 0 | |||
| 1 | ||||
| 0 | ||||
| 0 | 1 | 2 | |
|---|---|---|---|
| 1 | 0 | |||
| 1 | ||||
| 0 | ||||
| 1 | ||||
| Fail(F) | Pass (P) | Distinction (D) |
| Inadequate | Discrete | Integrated and manipulated |
| 0 | (0,0) |
|---|---|
| 1 | (1,0) |
| 2 | (1,1) |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
On the Structure of Ordered Latent Trait Models
Gerhard Tutz
Ludwig-Maximilians-Universität München
Akademiestraße 1, 80799 München
Abstract
Ordered item response models that are in common use can be divided into three groups, cumulative, sequential and adjacent categories model. The derivation and motivation of the models is typically based on the assumed presence of latent traits or underlying process models. In the construction frequently binary models play an important role. The objective of this paper is to give motivations for the models and to clarify the role of the binary models for the various types of ordinal models. It is investigated which binary models are included in an ordinal model but also how the models can be constructed from a sequence of binary models. In all the models one finds a Guttman space structure, which has previously been investigated in particular for the partial credit model. The consideration of the binary models adds to the interpretation of model parameters, which is helpful, in particular, in the case of the partial credit model, for which interpretation is less straightforward than for the other models. A specific topic that is addressed is the ordering of thresholds in the partial credit model because for some researchers reversed ordering is an anomaly, others disagree. It is argued that the ordering of thresholds is not a constitutive element of the partial credit model.
Keywords: Ordered responses, latent trait models, item response theory, partial credit model, sequential model, Rasch model
1 Introduction
Various latent trait models for ordered response data have been proposed in the literature, for an overview see, for example, Van der Linden (2016). One can distinguish between three basic types of models, cumulative models, sequential models and adjacent categories models. While for the first two models the interpretation of the latent trait is straightforward, there is still ongoing discussion on the interpretation of the parameters in the adjacent categories approach, see, for example, Adams et al. (2012) or Andrich (2015). In the latter article the problems with the step metaphor in polytomous models, which is a source of misinterpretation, are extensively discussed. There has also been some discussion of the ordering of thresholds in adjacent categories models. Andrich (2013) strongly criticized the findings of Adams et al. (2012) and argued that reverse order of thresholds is an anomaly. One motivation for this paper was to clarify some of the derivations of Andrich (2013). In particular it is argued that the ordering of thresholds is not what makes the adjacent categories model an ordered latent trait model. Moreover, a simpler form of the Guttman structure in the partial credit model is derived.
Ordered latent trait models are typically motivated by latent random variables because the latent process is useful to clarify the meaning of parameters. By characterizing the process that generates the response in one of the ordered categories one obtains an understanding of the meaning of the parameters. A specific property of all the ordered models considered here is that they are linked to binary models, typically binary Rasch models. One of the objectives of the present paper is to investigate the nature of this link. In the investigation it is often useful to distinguish two different views on the link to binary models. One can start from an ordered latent trait model and identify which binary models are implied. Alternatively, one can start from binary models and derive from them an ordered response model. Both views can contribute to the interpretation of the model parameters.
As a preview on the structure of models and the involved binary variables Table 1 shows the three types of dichotomizations that determine the models. It is assumed that the ordinal response takes values from . The variables denote the binary variables that are contained in the model. Brackets denote which response categories are used when dichotomizing and the ”” determines the (conditional) split. The table shows which binary models are contained in ordered latent trait models but can also be used to construct models from binary variables.
We start with cumulative models and then consider sequential models and adjacent categories models. The latter is investigated more closely because the binary variables involved are much harder to investigate than for the other models.
2 Cumulative or Graded Response Models
We first consider cumulative models, for which interpretation of parameters is most easily obtained by deriving the model from latent variables. In the following let , , , denote the ordinal response of person on item .
2.1 Model Derivation from Latent Variables
The class of cumulative models can be motivated by a latent variable , where is a person parameter and is a noise variable with symmetric continuous distribution function . The category boundaries approach assumes that the observed response is a coarser version of the latent variable . More concrete, if person meets item one assumes
[TABLE]
where are ordered item-specific thresholds. Thus, one observes a response in category if the latent variable is between thresholds and . For an illustration see Figure 1, which shows the density of the latent variable if . It is easy to derive the cumulative model
[TABLE]
For the logistic distribution function and one obtains the simple binary Rasch model . The general model with is equivalent to Samemija’s graded response model (Samejima, 1997, 2016).
It is implicitly assumed that the person parameters are on the same latent scale as the item parameters . The item parameters have a distinct interpretation as thresholds on the latent scale and the person parameters represent the location of the persons on the latent scale. In proficiency tests the person parameter can be interpreted as ability and the item parameters as difficulties. Person has location on the latent ability scale, but ability is blurred by the noise variable . Nevertheless, primarily the location determines the observed probabilities. If is large the probability will be large and the probability will be small. For small the order is reversed. For fixed the difference determines if the probability tends to be large or small.
2.2 Binary Responses in the Cumulative Models
A special feature of the model is that the binary Rasch model (for logistic) holds if categories are collapsed into two groups. More specific, if one considers the grouped response if and , otherwise, one obtains for the binary Rasch model with predictor ,
[TABLE]
For the link between the binary variables and the response in categories one obtains
[TABLE]
where is the last of the sequence of binary variables with an value 1. All the sequences that result have the property that a sequence of ones is followed by a sequence of zeros, which reminds of a Guttman scale. Therefore, random variables with and will be called Guttman variables (they have a Guttman structure), the corresponding Guttman space is defined by . The binary variable contains the information if a response is larger than or equal to category or not. If one knows that the response is in category or in a higher one.
2.3 Binary Models as Building Blocks
The cumulative model can also be constructed from binary variables. Let the binary variables be defined as above by if and , otherwise. If one assumes that the binary response models hold for one obtains immediately the cumulative model . Thus, the binary variables can be considered the building blocks of the cumulative model.
It should be noted that the binary variables used here are defined with reference to the ordinal response. Therefore, one implicitly assumes that an ordinal response is present. As shown in the following section one can also construct the model by starting with a specific set of linked binary models and then define an ordinal response.
Deriving the Model Without Using the Ordinal Response
Cumulative models can also be constructed from binary response models without reference to an already given ordinal response. Consider a set of binary responses . Let us assume for the binary variables:
- (a) Binary models hold for .
- (b) The binary variables are Guttman variables, that is, holds for .
As will be shown one can construct an ordinal response model from these binary variables. This is possible since the assumption that the binary variables are Guttman variables implies an order.
Proposition 2.1
Let assumptions (a) and (b) hold. For the ordinal response variable defined by
[TABLE]
one obtains the cumulative model
[TABLE]
iff
[TABLE]
To illustrate why this holds let us consider the case of three categories . Then, one has two binary variables . Let , , denote the probabilities of obtaining 1 in the binary variables. In the contingency table given in Table 2 all the probabilities are determined by the marginal probabilities (which are fixed) and the assumption that one entry has probability zero, which follows from assumption (b).
This is a valid contingency table only if , which corresponds to the condition (1). For one obtains immediately the probabilities as
and therefore , , and . Since by assumption one obtains the cumulative model.
Proposition 2.1 shows that a cumulative model can be constructed from a sequence of binary variables. For the marginal distributions of the binary variables binary Rasch models (for logistic) are assumed to hold with all of the models containing the parameter . It means that the cumulative model does not only contain binary Rasch models but can also be built from them. The condition (b), that is, for , means that the sequences of binary variables have the form . The binary variables are not independent, and are, in particular, restricted by for . The latter condition is equivalent to the ordering of the thresholds, for . The binary variables also show that the parameters can be interpreted as the parameters of binary Rasch models, in particular the parameters represent difficulty parameters if one has a proficiency test.
The derivation of the cumulative model from binary variables that are not defined by an ordinal response but yield an ordinal response might seem not to be crucial for the interpretation of models. However, it should be seen in connection with the motivation of the partial credit model, to be treated later. Andrich (2013) tried to motivate the partial credit model by considering binary variables in a similar but nevertheless distinctly different way (see Section 4.3).
Let us finally comment on the role of the Rasch model when deriving an ordinal response model from binary models. In the derivation it was assumed that the binary variables follow a Rasch model. However, one can drop this assumption and only postulate that holds for (assumption (b)). Then one obtains for the ordinal response defined by the link provided holds for . Thus the construction of a cumulative model does not have to assume a specific model for the binary variables. If, however, one specifies the marginals by a binary model, that is, one obtains the corresponding cumulative latent trait model.
In summary, it is seen that the existence of binary variables, for which and hold is a necessary and sufficient condition for the cumulative model.
3 Sequential Models
We start with a traditional derivation of the model and then consider the binary variables that are linked to the model.
3.1 Derivation of the Model from Consecutive Steps
The sequential model is typically used in proficiency tests where a person tries to solve an item in several steps with the scores representing the levels of the solution. A classical example considered by Masters (1982) is the item . Three levels of performance may be distinguished: No sub-problem solved (level 0), 9.0/0.3 = 30 solved (level 1), solved (level 2), (level 3). The main feature of the sequential or step model is that the stepwise manner of solving an item is exploited.
In each of the consecutive steps a binary response model applies. Let denote the binary response in the -th step. In the first step the respondent tries to master the transition from level 0 to level 1. The corresponding model is
[TABLE]
If the person does not master the first step, the process stops and one observes . Otherwise, and the person tries to solve the next step. In general, the -th step, which is only tried if the previous steps were successful, is determined by
[TABLE]
If the person does not master the step, the process stops and one observes . Otherwise, and the person tries to solve the next step. It is straightforward to derive the sequential model
[TABLE]
The process model is illustrated in Figure 2 for response categories .
The important feature of the model is that the transition from one level to the next is modelled conditionally (given the previous level has been reached). For each conditional step the binary model could also be motivated by latent random variables , where is a noise variable with symmetric continuous distribution function . Given the -th step is reached one has , or equivalently , if , and () otherwise. Thus, in proficiency tests it is straightforward to interpret the person parameter as the ability and as the difficulty of the -th step. The parameter can be seen as a threshold that has to be mastered in the -th step. No ordering of the difficulties/thresholds is assumed. For example, the third step can be easier than the second step (). However, the person has to solve all the previous steps before being able to try to master the third step. The ordering of the measurement is due to the sequential nature of the model.
For the probabilities one derives
[TABLE]
where . The probabilities also reflect the sequential nature of successful steps with probabilities followed by one unsuccessful step. In the th step the probability of failing is .
The cumulative model has been motivated by a latent variable and then binary models that can be used to construct the model have been considered. The motivation for the sequential model given here is different. It starts with the binary variables, which represent the conditional transitions. Thus, the model itself is constructed by using binary models.
It is straightforward to show that the binary variables, which determine the transitions, can also be derived from the sequential model (2). One assumes that the sequential model holds and uses the dichotomization into groups and given to define
[TABLE]
These are exactly the conditional variables given in Table 1.
The first version of this model was considered by Molenaar (1983). Tutz (1990) considered a parametric model under the name sequential model, Verhelst et al. (1997) called the corresponding model step model. The model is strongly related to the class of continuation ratio models, see, for example, Agresti (2013), Tutz (2012). Measurement properties of the model were investigated by Hemker et al. (2001).
3.2 Derivation from Binary Variables
In the previous section the resulting ordinal response was explained by consecutive steps that model the transitions between categories. However, as for the cumulative model one can also start with Guttman variables and then construct an ordinal response without using that an ordinal experiment has been executed.
Let us again consider a set of binary variables , , and assume
- (a) A conditional binary model holds for .
- (b) The binary variables are Guttman variables, that is, holds for .
An ordinal response model can be constructed in the following way.
Proposition 3.1
Let assumptions (a) and (b) hold and the ordinal response variable be defined by
[TABLE]
where, for completeness, is defined by . Then, for the sequential model holds, that is,
[TABLE]
It is not hard to see that Proposition 3.1 holds. It follows from assumption (a) that all vectors show a Guttman structure, that is, they have the form . The definition of is equivalent to
[TABLE]
That means if is the last in the sequence of Bernoullis , that has response 1. Since the binary variables have a Guttman structure, one can also use the sum of the binary variables, to define the ordinal response. One uses the general result
[TABLE]
the property , and , which follows from assumption (b), to obtain the probabilities
[TABLE]
which are the response probabilities of the sequential model.
The sequence of observations is at the heart of the sequential model. It directly reflects the successive transitions of successful steps followed by an unsuccessful step. The sequence can also be seen a Markov chain that stops if the absorbing state 0 is reached.
It is interesting to compare the construction of the cumulative and the sequential models from binary variables. In both derivations the main assumption is the Guttman structure. No further assumption is needed when defining an ordinal response that yields the sequential model, whereas one needs the probabilistic monotonicity assumption to construct a cumulative model. Of course, the role of the Rasch model differs. In the cumulative model the marginal distribution of the Guttman vectors follows a Rasch model, while the Rasch model is assumed for the conditional transitions in the sequential model. It should be noted that for both models the ordinal variables as well as the involved binary variables are observable.
4 Adjacent Categories: The Partial Credit Model
In contrast to the cumulative and sequential model, for which interpretation of the parameters is straightforward, the interpretation in the adjacent categories model is much more challenging. In the following, we start with a brief description of the partial credit model and consider the motivation of the model later.
4.1 The Partial Credit Model
The most prominent member of the family of adjacent categories models is the partial credit model (PCM), which was proposed by Masters (1982), see also Masters and Wright (1984) and extensions given by Glas and Verhelst (1989), Von Davier and Yamamoto (2004). The basic PCM assumes for the probabilities
[TABLE]
where is the person parameter and are the item parameters of item . For notational convenience the definition of the model implicitly uses . With this convention an alternative form is given by
[TABLE]
The defining property of the partial credit model is seen if one considers adjacent categories. For the comparison of two adjacent categories one obtains immediately
[TABLE]
However, it is often more useful to consider the representation
[TABLE]
where is the logistic function. It shows that the model is locally (given response categories , ) a binary Rasch model with person parameter and item difficulty . The general form of the model is
[TABLE]
where is a continuous cumulative distribution function. If is the logistic function one obtains the partial credit model. For simplicity we assume in the following that is the logistic function, although the derivations hold for more general distribution functions.
Various properties of the PCM have been used to obtain an interpretation of the parameters. For example, it is easily seen that for the probabilities of adjacent categories are equal, that is, . Moreover, response curve modes are always ordered. However, properties of this type seem not sufficient to obtain a satisfying interpretation of parameters.
While the cumulative model is derived as a coarser version of an underlying latent trait and the sequential model as a model for consecutive steps an underlying process model for the PCM seems harder to obtain. Possible models will be considered later. First we consider the binary variables contained in the model.
4.2 Binary Responses in the Partial Credit Model
Consider the conditional binary variables defined by
[TABLE]
It is easily seen that if the PCM holds one has for the binary variables
[TABLE]
Therefore, the variables defined in (5) are the binary variables contained in the PCM, which are also given in Table 1. It is straightforward to show that one can also start from the variables defined in (5) and derive from them that the PCM holds. In this sense the binary variables can be considered the building blocks of the PCM and can be used to interpret the parameters.
In contrast to the cumulative model the sequential and the adjacent categories model contain conditional binary models, in the case of the sequential model the condition is , and in the adjacent categories model. For the sequential model the conditioning comes from the modelling of consecutive steps which yields a straightforward interpretation of the model and its parameters. The conditioning in the adjacent categories model is much harder to handle, it is less distinctly motivated and gave rise to severe misunderstandings of the model.
In the early papers on the PCM it has been misunderstood as a model for consecutive steps. Masters (1982) explicitly considered the example of solving the item mentioned previously and explained the consecutive steps. It was said, for example:
{addmargin}
[0.5 cm]-0.0 cm The third step in item , for example, is from level 2 to level 3. The difficulty of this third step governs how likely it is that a person who has already reached level 2 will complete the third step to level 3. (Masters (1982), p. 157)
It seems that nowadays it is widely accepted that the partial credit models is not a model for consecutive steps. The model for consecutive steps is the sequential models, which is a distinctly different model, for further discussion see, for example, Tutz (1990), Verhelst et al. (1997), Verhelst and Verstralen (2008), Andrich (2015). In later presentations the conditional structure of the PCM has been acknowledged. Masters (2016) states correctly that the local comparison is the heart of the PCM, one conditions on a pair of adjacent categories and so eliminates all other response probabilities from consideration.
One consequence of the conditional definition of the binary variables is that they are not observable. If, for example, one observes , the outcome of will not be observed since it postulates the condition . Nevertheless, one can try to investigate the structure of the potential outcomes of the sequence . If takes value , one knows which values the binary variables and take. The former distinguishes between and the latter between . If the binary variables have to be and . That means the sequence has to be . Thus, adjacent binary variables yield outcomes of the form .
For the interpretation of parameters the conditional representations (3) and (6) are crucial. Since locally (given categories ) a Rasch model holds one might use the interpretation of parameters that is familiar from the Rasch model. In an achievement test represents the ability, and is the difficulty of observing rather than given the alternatives . What makes the interpretation less satisfying than for cumulative and sequential models is that one cannot drop the conditioning. Therefore, one has also to be cautious when trying to interpret the binary models as models for the transition from category to category determined by the threshold . They might be seen as transition models, but again only locally. The representations (3) and (6) show only local properties although changing one of the thresholds changes all of the probabilities. The response in any category is a function of all the thresholds. Alternative motivations and derivations of the PCM that have been given are considered in the following.
4.3 Alternative Derivations of the Partial Credit model
The local nature of the PCM makes it hard to find a model for the process by which a person reaches category. Therefore, in their study of invariance properties of various ordinal models Peyhardi et al. (2015) even conclude that ”‘adjacent models cannot be interpreted using latent variables”’. However, this conclusion lacks substance. It is not hard to find a process that determines the total response probabilities. It is tempting to imagine an underlying response process as a two-step model. In the first step a person chooses a pair of adjacent categories, in the second step he/she chooses one of the two categories. Since there are pairs of adjacent categories in the first step one needs a probability function that chooses among the pairs. In the second step, let the model for choosing rather than , given were chosen in the initial step, be the local Rasch model (3). Since one considers two steps one might suspect that the distribution in the first step could be chosen freely. However, the problem is that the distribution on the adjacent categories in the first step is already fixed if one assumes the Rasch model in the second step. Therefore, person and item parameters already determine the probabilities in the first step and the process offers no additional insights.
In the following alternative and more potent derivations of the PCM are considered and properties of the involved binary variables are investigated.
Deriving the Partial Credit Model from Independent and Guttman Variables
Andrich (2013) gave an expanded derivation of the threshold structure of the polytomous Rasch model, which was already considered in Andrich (1978). In his article he strongly criticizes the findings of Adams et al. (2012) as misunderstandings centered around the so-called ”‘threshold disorder controversy”’. In doing so he considers various response spaces, among them the Guttman space, which is of special interest here because it provides an alternative derivation of the PCM. In the following we first sketch the basic results.
Let us consider independent Bernoulli variables, , , , which follow a Rasch model
[TABLE]
However, instead of observing one performs the following Guttman space experiment:
- (1) Observe the realizations of the independent Bernoulli variables .
- (2) If for any one observes and the sequence of observations is considered as not valid.
The effect is that one does not observe the possible vectors , , but a reduced set of observations in the Guttman space, which is given by . It contains all vectors that have the form . Thus if one variable takes value zero the following variables also have to take the value zero. It is essential to distinguish between the independent Bernoulli variables and the variables that yield observations in the Guttman space. Let denote the binary variables of the Guttman space, which are distinctly different from the independent random variables . The latter are the sequence of independent variables used to construct the response space. The former are the random variables representing the outcome in the Guttman space (after deleting the non valid outcomes of the original independent variables). The Guttman variables are obtained by conditioning on for all .
It is important that the definition of the Guttman space used here postulates only for all (and ). It is not assumed that the are a sequence of successes and failures, for which one assumes an ordering of the thresholds of the Rasch models that determine single responses. One of the problems with Andrich’s derivations is that when he uses the term Guttman space he implies an ordering of thresholds, which he calls required ordering. This “intended ordering of thresholds” is not needed for the derivations in the following and might be a source of confusion. Ordering of thresholds might be a consequence when assuming a Guttman space, as has been shown in the derivation of the cumulative model, but is not needed for the definition of the space. A Guttman space as used here is a space of responses that has to be distinguished from a Guttman scale obtained for single items. We use the term Guttman scale only if an ordering of thresholds is postulated.
One obtains the following proposition, which has also be given by Andrich (2013), although in slightly different form. A proof is given in the Appendix.
Proposition 4.1
If one defines the random variable , which is the sum of the Bernoullis in the Guttman space, one obtains for the PCM.
The result is remarkable in several ways. First, the PCM is derived from the assumption of binary variables. The binary variables are defined without referring to a random variable with outcomes. Instead the variable is defined by the (sum of) binary variables. Second, one obtains the PCM from underlying independent binary variables, which seems surprising. However, one should not forget that the PCM is obtained by postulating the restriction to the Guttman space with representing the numbers of outcomes ’1’s that have been observed as a sequence of ’1’s not disturbed by zeros. Without this restriction one would not obtain the PCM.
If , that is, if one has the sequence , with the last 1 occurring in the -th component it is tempting to interpret it as a sequence of successful steps (the ones) followed by a sequence of unsuccessful steps (the zeros). However, one should be aware that one considers the sequence in the Guttman space, not in the space used for the construction, which is . It is slightly irritating that Andrich (2013), when defining the sum of Bernoullis, does not clearly distinguish between these different random variables (see equ (5) and (8) in Andrich (2013)). Although the sum defined in Proposition 4.1 takes the same values as the random variable , namely , the distributions of and differ, and consequently . Confusing and is dangerous because is built from dependent Bernoullis. If one does not distingiush between these sums one might try to interpret the response from the properties of the underlying experiment with independent observations.
It is instructive to investigate the distributions of the random variables implicitly defined by the Guttman experiment, which are far from being independent. Let us first consider the case of three categories with two independent binary variables . The left panel in Table 3 shows the probabilities in a -contingency table. The marginal distributions of are specified by a binary Rasch model,
[TABLE]
or, equivalently,
[TABLE]
According to the assumption of independence the probabilities in the table are given by , .
The conditioning on yields a distribution of that is quite different from the distribution of . In the right panel of Table 3 the probabilities are shown. One cell is empty, the probabilities of the other cells are given by
[TABLE]
where . It is immediately seen that and are not independent.
One obtains for the marginals , , which for simplicity are represented by the odds,
[TABLE]
[TABLE]
Therefore, the marginal distribution of depends on and the marginal distribution of depends on . The odds for are the odds of the Rasch model for modified by . Accordingly the odds for are the odds of the Rasch model for modified by .
However, it is interesting to consider the conditional distributions of the variables . One can derive the following proposition (a generalization is given later).
Proposition 4.2
For one obtains the Rasch-type conditional distributions
[TABLE]
That means that specific conditional distributions follow a Rasch model while the marginals of do not. It is easily derived how the binary variables are linked to the ordinal response in a PCM. Let us consider the first equation. The sum takes values from . Under the condition the variable is restricted to values , and is equivalent to . One obtains {1,2}. Since a similar transformation holds for the second equation, it is seen that the variables are just the conditional variables obtained as the binary variables contained in the PCM as derived in Section 4.2 (equ. (5)). They represent the Rasch models with local interpretation.
Proposition 4.2 is useful to clarify the difference in the Guttman space that is used here and the Guttman spcace that was used in the derivation of the cumulative model from binary variables. For the derivation of the cumulative model in Section 2.3 it was assumed that the marginals follow binary Rasch models. Here, the binary Rasch models are found as conditional models, not for the marginals, although they were assumed for the marginals of the underlying independent variables. Nevertheless, in both derivations, for the cumulative model and the PCM, the Guttman structure of the binary responses is postulated.
For illustration we consider a concrete example with only three ordered categories for the total response. As Andrich (2013) we use a specific example from Harris (1991) in the assessment of essay writing with respect to the criterion of setting. Table 4 shows the categories Fail, Pass, and Distinction in the assessment of proficiency, which are obviously ordered categories.
One might try to interpret the components of the random vector as steps or transitions from one level to the next. They seem more suitable for an interpretation as successive transitions than the variables because for them holds. In the example considered here one has two transitions. The first transition, , is from Fail to Pass (level 0 to 1), the second transition, , is from Pass to Distinction (level 1 to 2). If one accepts the interpretation of transitions, the conditional probabilities in the proposition could be interpreted as follows. If a person succeeds in the first transition from the lowest to the medium level () the conditional probability of succeeding in the second transition follows a Rasch model. If a person fails in the second transition from the medium to the highest level () the conditional probability of succeeding in the first transition follows a Rasch model.
It has to be emphasized that the transitions are definitely not steps that are tried in a consecutive order, there is no ordering in the sense that is performed before . It is crucial to understand that the variables are strongly associated but cannot be considered as consecutive variables as the Markov process in the sequential model. They have to be considered as strongly linked ”simultaneous” transitions. In a Markov process one typically considers the conditional distribution of random variables given the past. Here also the conditional probability , which is a conditioning on the second transition, is relevant. It should also be noted that the Rasch model that holds for does not mean that the second transition is determined by a Rasch model given the person is unable to perform the second transition. The condition is given , that is, that the second transition is not successful.
The General Case
The properties of the PCM given so far are also found (in a slightly different form) in Andrich (2013). This is not the case for the results derived in the following. Let us start with the generalization of Proposition 4.2. Let the marginals of the independent variables given in (7) be denoted by
[TABLE]
One obtains the following result (for a proof, see Appendix).
Proposition 4.3
For the binary variables of the corresponding Guttman space one obtains the Rasch-type conditional distributions
[TABLE]
The Rasch model is found for conditional probabilities of the event given the “previous” response takes value 1 and the “later” response takes value 0. It shows that the Rasch model holds locally conditional on the responses in adjacent categories. As special cases one obtains for the extreme categories
[TABLE]
Thus, for the extreme Guttman variables and , the conditioning is on just one event, a “previous” or “later” one. For these are the conditional probabilities given in Proposition 4.2.
Let us briefly summarize the given derivations. We started with assuming the existence of independent Bernoullis that marginally follow a Rasch model. Then a Guttman space experiment is performed that turns the original variables into Guttman variables . An ordinal response variable is defined by and it was shown that for this response variable the PCM holds.
Andrich (2013) tries to run the argument in the reverse direction. He starts with an ordinal response variable, for which the PCM holds, and tries to infer the existence of a latent Guttman space, which is linked to independent Bernoullis. Before considering his derivations in more detail we investigate the Guttman space (or, equivalently, the Guttman variables) in the PCM. In particular, it will be shown that the Guttman variables that are linked to the PCM are much easier to obtain than in Andrich’s derivation.
The Guttman Space in the Partial Credit Model
Let us first assume that for the PCM holds. Then one can define Guttman variables by
[TABLE]
It should be noted that the definition works since only outcomes of the form are allowed. Only then the number of possible outcomes in the Guttman space is equal to the number of response categories of and the probability for all vectors from the Guttman space is determined by the probabilities , and therefore the variables are well defined. It is noteworthy that in the definition of the Guttman vectors no conditioning is needed. In Table 5 the correspondence between and the binary variables is given for the case of three response categories.
From the definition of the binary variables one obtains ()
[TABLE]
Since the PCM is assumed to hold one has
[TABLE]
which are the equ. (8) from Proposition 4.2. As shown in the Appendix one obtains in the general case:
Proposition 4.4
Let the PCM hold and be the Guttman variables defined by (11). Then, one has
[TABLE]
The condition (12) is the same as obtained when assuming independent Bernoullis and then conditioning on the Guttman space (equ. (9)). Therefore, when assuming that the PCM holds one can construct a Guttman space, or equivalently, define Guttman variables, for which condition (12) holds.
It is crucial that also the reverse holds. One can start from Guttman variables, for which condition (12) holds and derive that the PCM holds. This is seen from the following proposition, which can be easily derived.
Proposition 4.5
Let be Guttman variables, for which (12) holds. Then, for the sum the PCM holds.
In summary, the propositions 4.4 and 4.5 yield the structure
{addmargin}
[0.5 cm]-0.0 cm
[TABLE]
Thus, it has been established that the existence of an observable Guttman space, for which (12) holds, is a sufficient and necessary condition for the PCM. In particular, when deriving the PCM the independent variables are not needed. It is sufficient to start from Guttman variables for which the condition (12) holds.
Since the condition is central to the Guttman space that is linked to the PCM let us consider its meaning more closely. The conditioning on responses of the adjacent variables in (12) reflects the local nature of the PCM. It shows that a binary Rasch model holds for the preference of category over given is preferred over () and is not preferred over (). Thus preferences are modeled locally given specific preferences in adjacent categories are present or not. Condition (12) can be seen as a motivation for an underlying process model. When a person compares all pairs of adjacent categories and decides that he/she prefers over but definitely does not prefer over the tendency to choose among the pair is determined by the binary Rasch model. However, in the local modeling actually four categories are involved, .
Alternative Derivations of the Guttman Space from the Partial Credit Model
In the previous sections it has in particularly been shown how the PCM can be derived from the assumption of independent Bernoullis and conditioning to obtain Guttman variables. Andrich (2013) also investigates the reverse derivation, he aims at “establishing the Guttman response space from the structure of the PCM” and tries to show that “the Guttman structure with a hypothesized threshold order is a necessary and sufficient condition for the PCM” (p.99).
Let us consider first the aim to establish the Guttman response space from the structure of the PCM. Andrich (2013) uses the following proposition that seems useful (see also Luo (2005)):
Proposition 4.6
Let . The probabilities of the PCM can be represented as
[TABLE]
where .
The proposition indeed suggests that there is a Guttman space behind the PCM. However, as has been shown in the previous section the Guttman variables can be defined directly. The indirect way to infer the Guttman structure, which is briefly considered in the following, is much more difficult to understand.
In his derivation of a Guttman space, Andrich (2013) starts with a random variable and lets denote the probability of the response in the space . The space is deliberately left undefined. It cannot be meant to be the space of the realizations of because this space is trivially the set . However, one can assume that it is the underlying probability space from which the random variable is obtained as a mapping . In the next step he ”defines” latent, dichotomous variables by
[TABLE]
A space is introduced but not defined. Only the subscript signals that it must be linked to the values . Thus the conditioning can only mean conditioning on or, equivalently . Andrich (2013) lets the in Proposition 4.6 be defined by and infers that holds. Although this is not needed to obtain the result in Proposition 4.6, it is used to infer the space . It turns out that is the Guttman space and is the Guttman space conditioning on responses in .
The derivations seem unnecessarily complicated. The use of response spaces is not always clear and tends to confuse most readers unfamiliar with the concept of spaces. As has been shown above one can derive the Guttman variables directly, they can be defined from the other random variables without reference to spaces.
Andrich (2013) goes one step further, he wants to infer the existence of independent binary variables that yield the variables as conditional responses, in his words to “infer the existence of a hypothetical, complete space of which is a subspace”. However, given the PCM one may define the variables in the Guttman space, which has been done above, but it seems hard to define independent response vectors , that yield the variables as conditional responses. Consequently, they were not explicitly defined in Andrich (2013). What can be done is to start from independent variables with the right probabilities and then derive conditional variables. This is exactly the derivation given in Proposition 4.1. It means that by assuming the right probabilities for the independent variables one obtains the Guttman variables as the conditional responses. It amounts to an embedding of the Guttman variables into a space of independent variables. But the starting point are the independent variables, their existence is not inferred by construction, it is just shown that one can postulate variables that are compatible with the PCM. From this embedding one can not infer that the “PCM with a hypothesized threshold order is a necessary and sufficient condition for the PCM”.
Andrich (2013) concludes that the Guttman space is manifest (observable) but the conditional space is latent. Since spaces are used it is not obvious which random variable are actually considered. Therefore, let us again consider the condition , which is the central condition in the Guttman space derived in Proposition 4.5. It has been shown that the condition can also be given in the form
[TABLE]
Therefore the condition is equivalent to postulating , where
[TABLE]
The variables are well defined but in contrast to they are not observable. If one observes one knows that , which locally compares and , took value 1 and , which locally compares and , took value 0. Therefore, one obtains . Since the condition varies across variables nothing is known about the value of other pairs of adjacent variables. It should be noted that the variables are equivalent to the conditional variables defined in (5). They seem to be the variables that determine the the conditional space considered by Andrich. However, observable or not, they are not Guttman variables, because they always have the form .
Let us make some further remarks. First, Guttman spaces are by no means an exclusive feature of the PCM. As has been shown in previous sections Guttman spaces occur also in derivations and representations of the cumulative and sequential models. Starting from these models one can always define Guttman type variables. Second, in none of the derivations concerning the PCM it was assumed that thresholds are ordered, which is important since Andrich concludes that thresholds in the PCM have to be ordered.
4.4 The PCM as a Model for Ordered Responses and the Ordering of Thresholds
It is obvious that the cumulative and the sequential models by construction are ordered response models. Also the PCM explicitly exploits that categories are ordered. This can be seen from the binary models contained in the model. If one considers the conditional response given categories one obtains the binary Rasch model . Thus, if the person parameter increases the probability of observing the higher category increases. Since the property holds for all pairs of adjacent categories definitely an ordering of categories is assumed and exploited by the model.
Moreover, the PCM is stable under the reverse permutation but not under other permutations, which makes it an ordinal model. It should be noted that ordering of categories is used but not in the sense of upward or downward ordering. More concrete, it can be shown that the model also holds for the reversed order obtained from the reverse permutation , . This property holds not only for the PCM but for all adjacent categories models with symmetric function , see Peyhardi et al. (2015) for a careful analysis of invariance properties of ordered response models.
Alternative and useful concepts of ordering were considered by Adams et al. (2012) and dismissed by Andrich (2013). Andrich (2013) embeds his definition of order ”in the intended relative difficulties of the thresholds in the resolved design”. The resolved design is considered a ”thought experiment”, it ”imagines judges making independent decisions”. It corresponds to the independent variables used to derive the PCM.
By postulating ”that the empirical ordering of categories is equivalent to the empirical ordering of successive thresholds that define adjacent categories on a continuum” (Andrich (2013), p 88), he links the order to the thresholds. This makes sense only because he argues that thresholds have to be ordered, which is just a postulate but nothing in the structure of the PCM demands ordering.
The main problem with the derivations of Andrich (2013) is the link between the so-called resolved design and the Guttman space. With reference to the three categories case he states {addmargin}[0.5 cm]-0.0 cm Consider a design in which a single response in one of the three ordered categories is resolved into two independent dichotomous decisions. The response might be a self-report or made by a judge. For efficiency of exposition, we use the latter case. Specifically, instead of one judge assigning each essay into one of three categories, two different judges work independently, one judge decides if each essay meets the standard at Pass, the other decides if the same essays meet the standard at Distinction. (Andrich (2013), p. 81)
.
In this ”‘resolved design”’ the two independent decisions are linked to thresholds which are assumed to be ordered, which is referred to as ’intended category order’. In Andrich’ words: {addmargin}[0.5 cm]-0.0 cm As a direct consequence of the intended category order, the a priori specification is that the threshold at Distinction is more difficult than the threshold at Pass, … It is stressed that in this design, it is the continuum that is dichotomized at two places (thresholds) with independent decisions made at each threshold.(Andrich (2013), p. 82)
.
When constructing response spaces one can indeed start with independent binary variables, and, when interpreting them as independent dichotomous decisions of judges, one might postulate ordered thresholds. After obtaining Guttman variables by conditioning one obtains an ordered response variable by considering the sum of Guttman variables. For the dependent binary variables in the Guttman space one obtains ordered thresholds, since they have been postulated from the start. The ’required order of thresholds’ in the PCM is inherited from the order in the original independent binary variables.
If one starts from the PCM, that means, without restricting thresholds to be ordered, one can define Guttman variables as in Section 4, but the independent variables cannot be explicitly defined. Although one might find independent variables that generate the Guttman variables by conditioning this brings us back to starting from the independent variables. Thus, the claim ”that inference can be made as if responses at the thresholds were experimentally independent” (p. 80) is merely supported by the fact that the PCM can be derived from independent variables.
In the derivation of the PCM a crucial step is the transition from the independent variables to the Guttman variables. There is no obvious reason why independent Bernoullis should generate only observations in the Guttman space, although it is an useful, constraint to obtain the PCM. Actually, the order the order of categories is introduced by restricting responses to the Guttman space. By assuming only responses of the form the order of categories is established, not by the ordering of thresholds as claimed by Andrich. Independent binary variables have no ordering, they can simply be permuted.
The use of the Guttman structure and Guttman space is not always clear in Andrich’s paper. In our use Guttman space means restriction to possible outcomes , without postulating ordering of any thresholds. When defining Guttman variables after starting from the PCM, one obtains a Guttman space in the latter sense. Andrich sometimes seems to use a different definition, when considering ’the Guttman structure, as a hypothesis of the threshold order’ (p.93). He emphasizes ”that because a Guttman vector could not be chosen without the specification of threshold order, the inference depends on the a priori specification of threshold order in the resolved design, which in turn arises from the specified category order in the standard design ” (p.89). However, starting from the PC without ordered thresholds yields Guttman type vectors without the specification of threshold order. He also concludes that ’the Guttman structure with a hypothesized threshold order is a necessary and sufficient condition for the PRM’(Andrich (2013), p. 99). However, as shown in Section 2 the Guttman structure with a threshold order yields a cumulative model (if one assumes the Rasch model for the single Bernoullis and uses the sum as response). The PCM follows only if one assumes the Rasch model for underlying independent variables and then assumes that a Guttman experiment is performed. It is not established that it is a necessary condition for the PCM because the PCM does not assume thresholds to be ordered.
In summary, the PCM can be derived from a Guttman space but without any threshold ordering involved. Thus the PCM does not assume that thresholds are ordered. At some points Andrich seems to agree:
{addmargin}
[0.5 cm]-0.0 cm Indeed, there is nothing in the structure, nor in the estimation process, that forces the values of thresholds estimates to be ordered. If there were such a feature, then the ordering would be a property of the model (as in the graded response model), and not a property of the data, and therefore could not disclose problems with the category ordering. The consistent argument made previously, and repeated here, is for evidence that the thresholds are ordered in the data, as disclosed by the PRM. (Andrich (2013), p. 103)
.
The statement is peculiar in itself. Data can be measured on an ordinal scale level, but data do not have thresholds, so how can ordering of thresholds be a property of the data. Thresholds are defined through the model and therefore are part of the modelling, not the data.
As stated before the category ordering is not determined by the ordering of thresholds and therefore there are no problems to be disclosed with the category order. Nothing ”goes awry” in the empirical ordering of the categories if the ordering of thresholds is violated. Of course, one has to distinguish between the ”empirical ordering” ordering of thresholds and the structure of the model. The structure of the PCM does not impose an ordering of thresholds. Thus empirically, which can only mean by estimation, one can investigate if (estimated) thresholds are ordered or not. However, a PCM with the additional assumption of ordered thresholds would be a submodel, that is, a more restrictive version of the PCM. Thresholds with an inverse order would be an anomaly in the latter model but not in the PCM.
5 Concluding Remarks
The role of binary variables in classical ordered response models has been investigated. In particular it has been shown that the Guttman structure is found in all three model types. Some caution is warranted if one considers the Guttman structure as a necessary and sufficient condition for a latent trait model since it depends on the definition of the ordered response arising from the Guttman structure.
If one has a Guttman structure and a Rasch model with ordered thresholds holds for the binary variables it can be inferred that the cumulative or graded response model holds for the sum of the Guttman variables. The reverse derivation starts from the cumulative model. If it holds one indeed finds a Guttman structure for the Bernoulli variables that use splits in the space of ordered categories. They follow a Rasch model with ordered thresholds
If one has a Guttman structure and a Rasch model for the conditional distributions it can be inferred that for the sum of the Guttman variables the sequential model holds. If one starts from the sequential model one finds a Guttman structure for the Bernoulli variables that use conditional splits in the space of ordered categories. They follow a Rasch model with thresholds that are not necessarily ordered.
If one has a Guttman structure and a Rasch model for specific conditional distributions (given in (12)) it can be inferred that for the sum of the Guttman variables the PCM holds. If one starts from the PCM one finds a Guttman structure for the Bernoulli variables defined in (11), for them locally follows a Rasch model.
We considered classical ordered latent trait models from the families of cumulative, sequential and adjacent categories models. More recently, alternative models have been proposed, in particular, tree-based models, which were considered among others by De Boeck and Partchev (2012), Böckenholt (2012), Khorramdel and von Davier (2014), Böckenholt (2017) and Böckenholt and Meiser (2017). Tree-based models assume a nested structure with the building blocks given as binary models. For them the role of binary models is obvious since they are built on them.
Appendix
5.1 Proof Proposition 4.1
In Proposition 4.1 it is assumed that for the independent variables the marginals are given by Rasch models,
[TABLE]
Let be defined by
[TABLE]
It denotes the sum of probabilities for all cells that are in the corresponding Guttman space. Then one obtains for the sum of Guttman vectors, which are defined by conditioning on the Guttman space (under the assumption that none of the probabilities are zero),
[TABLE]
Thus, one has , which means that for the PCM holds.
5.2 Proof Proposition 4.3
In Proposition 4.3 it is assumed that for the independent variables the marginals are given by Rasch models,
[TABLE]
Let again denote the sum of probabilities for all cells that are in the corresponding Guttman space, given by
[TABLE]
Then one obtains
[TABLE]
5.3 Proof Proposition 4.4
Let the PCM hold and be the Guttman variables defined by (11). Then, one has
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Adams et al. (2012) Adams, R. J., M. L. Wu, and M. Wilson (2012). The Rasch rating model and the disordered threshold controversy. Educational and Psychological Measurement 72 (4), 547–573.
- 2Agresti (2013) Agresti, A. (2013). Categorical Data Analysis, 3d Edition . New York: Wiley.
- 3Andrich (1978) Andrich, D. (1978). A rating formulation for ordered response categories. Psychometrika 43 (4), 561–573.
- 4Andrich (2013) Andrich, D. (2013). An expanded derivation of the threshold structure of the polytomous Rasch model that dispels any ’threshold disorder controversy’. Educational and Psychological Measurement 73 (1), 78–124.
- 5Andrich (2015) Andrich, D. (2015). The problem with the step metaphor for polytomous models for ordinal assessments. Educational Measurement: Issues and Practice 34 (2), 8–14.
- 6Böckenholt (2012) Böckenholt, U. (2012). Modeling multiple response processes in judgment and choice. Psychological Methods 17 (4), 665–678.
- 7Böckenholt (2017) Böckenholt, U. (2017). Measuring response styles in Likert items. Psychological Methods (22), 69–83.
- 8Böckenholt and Meiser (2017) Böckenholt, U. and T. Meiser (2017). Response style analysis with threshold and multi-process irt models: A review and tutorial. British Journal of Mathematical and Statistical Psychology 70 (1), 159–181.
