Towards quantification of incompleteness in the pairwise comparisons method
Konrad Ku{\l}akowski, Anna Prusak, Jacek Szybowski

TL;DR
This paper introduces four simple indices to quantify the impact of missing data in pairwise comparison matrices, linking incompleteness with inconsistency and sensitivity in decision-making.
Contribution
It develops and validates new indices for measuring incompleteness in pairwise comparisons, aiding quick assessment of data quality impact.
Findings
Incompleteness and inconsistency equally affect matrix sensitivity.
Proposed indices effectively estimate the impact of missing data.
Monte Carlo experiments confirm the indices' usefulness.
Abstract
Alongside consistency, completeness of information is one of the key factors influencing data quality. The objective of this paper is to define ways of treating missing entries in pairwise comparisons (PC) method with respect to inconsistency and sensitivity. Two important factors related to the incompleteness of PC matrices have been identified, namely the number of missing pairwise comparisons and their arrangements. Accordingly, four incompleteness indices have been developed, simple to calculate, each of them take into account both: the total number of missing data and their distribution in the PC matrix. A numerical study of the properties of these indices has been also conducted using a series of Montecarlo experiments. It demonstrated that both incompleteness and inconsistency of data equally contribute to the sensitivity of the PC matrix. Although incompleteness is only just one…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Towards quantification of incompleteness in the pairwise comparisons
method
Konrad Kułakowski, Anna Prusak, Jacek Szybowski Konrad Kułakowski, (corresponding author), AGH University of Science and Technology, the Department of Applied Computer Science, Poland, e-mail: [email protected].Anna Prusak, Cracow University of Economics, the Department of Quality Management, Poland, e-mail: [email protected].Jacek Szybowski, AGH University of Science and Technology, the Faculty of Applied Mathematics, Poland, e-mail: [email protected].
Abstract
Alongside consistency, completeness of information is one of the key factors influencing data quality. The objective of this paper is to define ways of treating missing entries in pairwise comparisons (PC) method with respect to inconsistency and sensitivity. Two important factors related to the incompleteness of PC matrices have been identified, namely the number of missing pairwise comparisons and their arrangements. Accordingly, four incompleteness indices have been developed, simple to calculate, each of them take into account both: the total number of missing data and their distribution in the PC matrix. A numerical study of the properties of these indices has been also conducted using a series of Montecarlo experiments. It demonstrated that both incompleteness and inconsistency of data equally contribute to the sensitivity of the PC matrix. Although incompleteness is only just one of the factors influencing sensitivity, a relative simplicity of the proposed indices may help decision makers to quickly estimate the impact of missing comparisons on the quality of final result.
Index Terms:
decision making, pairwise comparisons, incompleteness, data quality, AHP
I Introduction
I-A *On comparing alternatives in pairs *
The pairwise comparisons method is referred to as a process of comparing objects in pairs to judge which of them is preferred [34]. In the PC method, the elements in a given set are ranked on a pair-by-pair basis (two at a time), until performing all of the variations. The first evidence of pairwise judgments comes from the XIII-century philosopher Ramon Llull in the context of the election systems and the social choice theory. This system was based on binary comparisons [9]. Specifically, each voting round provides sets of two candidates who should be compared in pairs, and the winner is the one who gathers a majority of voices in the highest number of pairwise comparisons. The PC method proposed by Llull was then reinvented and improved by many other scientists including the XVIII-century French mathematician and philosopher Nicolas de Condorcet [10]. In his election system, the winner (so-called the Condorcet winner) is the one who is always victorious when being compared with any other candidate. However, Condorcet proved that there might be a situation when the winner cannot exist. He provided a three-voters example (the Condorcet triplet) when is preferred over , over , and over , so finally there is no winner [Saari 2009]. The Condorcet method was used in the preference aggregation methods of C. Dodgson in 1876 [22] and A. H. Copeland in 1951, the latter being known as the Copeland’s rule, in which the candidates are ordered by the number of pairwise wins minus the number of pairwise defeats [37, 15].
Another scholar known for his contribution to the PC methodology is an American psychologist and pioneer in psychometric research, Louis L. Thurstone. In 1927 he used Gaussian distribution to analyze pairwise comparisons. His model (also referred to as the law of comparative judgments) was based on three assumptions: 1) whenever a pair of stimuli is presented to a respondent it elicits a continuous preference for each stimulus (which is discriminal process); 2) the stimulus with higher value in the comparison is preferred by the respondent; 3) these unobserved preferences are normally distributed [31]. He linked his approach with the psychophysical theory proposed by the XIX-century scholars E. Weber and G. Fechner. The Thurstonian model was reinvented in 1987 by Y. Takane, who added a random error to each paired comparison (so-called Thurstone-Takane model) [31]. In 1952 R. A. Bradley and M. E. Terry proposed an alternate model to the Thurstonian one. They defined the probability that object () is preferred to object () in a given comparison . In the psychometric approach, the Bradley-Terry model is often called the BTL model, due to its relation to the choice axiom proposed in 1959 by R. D. Luce [12, 40, 42].
The widely known application of PC method is the Analytic Hierarchy Process (AHP) and the Analytic Network Process (ANP), the multi-criteria decision support techniques developed in the 1970s by the American mathematician, T. L. Saaty [39]. Besides the AHP/ANP methods, other multi-criteria decision techniques based on comparisons of alternatives include ELECTRE, PROMETHEE or MACBETH [16]. However, only the AHP/ANP judgments result in real numbers representing the relative strength of preference [26].
Despite its long history, the PC method (especially with relation to the AHP/ANP and the pairwise comparison matrices) is among the prevalent topics in recent studies, exploring problems such as inconsistency [8], rank reversal [32, 45] and incomplete judgments [34]. These characteristics are important indicators of data quality, which plays a critical role in modern decision-making processes, especially at business and governmental level [1].
I-B Quality of data
The literature does not provide a universal set of data quality dimensions. Discrepancies in types and definitions of the quality characteristics are due to the contextual nature of data quality. According to Batini et al. [1], the six most essential classifications of data quality criteria have been provided by Wand and Wang [43], Wang and Strong [44], Redman [36], Jarke et al. [20], Bovee et al. [3], and Naumann [33]. The analysis of these classifications allowed to distinguish a set of four attributes of data quality most commonly described in the literature. They include accuracy, completeness, consistency, and timeliness.
The first of them, accuracy, is “the extent to which the data is correct, reliable and certified” [44]. Redman [36] defines this term as a measure of the proximity of a data value () to other values (). Batini et al. [1] distinguish two types of accuracy, namely syntactic and semantic, specifying that data quality methodologies only consider syntactic accuracy, indicating the closeness of to the corresponding definition domain . In DAMA report [41] data accuracy is defined as “the degree to which data correctly describes the ‘real world’ object or event being described,” and its measure is “the degree to which the data mirrors the characteristics of the real world object or objects it represents.”
Completeness was also defined in multiple ways, for example, “the ability of an information system to represent every meaningful state of a real-world system” [43], or “percentage of real-world information entered in data sources and/or data warehouse” [20]. The authors of [41] defined this criterion as “the proportion of stored data against the potential of complete” measured as “the absence of blank (null or empty string) values or the presence of non-blank values.” In the literature, completeness is often associated with missing values, which exist in the real world but not in the database [1]. This criterion is crucial in pairwise comparison context when some pairs of objects remain with no comparisons, so only partial information is available. This causes other issues such as problems with calculating inconsistency of a partially filled matrix [4]. More information on this criterion is given in Section IV of this paper.
Consistency is one of the fundamental characteristics of data quality but defined in many different ways. Blake and Mangiamelli [2] emphasized that consistency is a multidimensional concept that can be represented by three aspects: representational consistency, integrity, and semantic consistency. Representational consistency refers to the presentation of data in the same format and compatibility with other (e.g., previous) data. Data integrity requires fulfilling four constraints: entity, referential, domain and column. Importantly, violations of entity integrity may lead to redundant or incomplete data. Semantic consistency indicates no contradiction between different data values in a particular set. In the literature, the most commonly discussed are representative consistency in relation to databases and semantic consistency in relation to the introduced values. Concerning the PC method, consistency of data is often regarded in terms of inconsistency indices [6, 27]. The mathematical basis of this measures is provided in Section III.
The last but not least is timeliness. Interpretation of this attribute is different across the literature. Thus, Batini et al. [1] suggested it should be considered in a broader context, as the time-related dimension. According to Wand and Wang [43], timeliness refers to “the delay between a change of a real-world state and the resulting modification of the information system state.” Other time-related dimensions are currency, interpreted as “the degree to which a datum is up-to-date” [36] or “when the information was entered in the sources and/or the data warehouse” [20].
I-C Motivation and the organization of the manuscript
Many scientific articles deal with the inconsistency in the pairwise comparisons method. Thus, also many methods for measuring inconsistency have been proposed and thoroughly investigated. As a guide in this rich literature may serve the works [23, 5, 6]. Amazingly the same does not apply to incompleteness. Although some researchers have proposed methods for calculating the ranking for incomplete paired comparisons, the influence of incompleteness to the final result has not been sufficiently studied. One of the exceptions here can be Harker [18], but even this work does not provide us with the methods to measure incompleteness. Therefore, the purpose of this research is to determine the impact of the incompleteness of data on the correctness of the ranking in the PC method. During the work, we have identified two critical factors related to the incompleteness affecting the quality of data. These are the number of missing pairwise comparisons and the arrangement of missing comparisons. For this reason, we propose four incompleteness indices, where each of them depends on both: the total number of missing data and their distribution in the pairwise comparisons matrix. The performed Montecarlo experiments confirmed their usefulness as fast and quick tests of data quality.
The presented paper is composed of V sections including introduction (Section I) and summary (Section V). Section II outlines the theory of the pairwise comparisons method and the PC matrices, explaining phenomena such as incompleteness, inconsistency, and sensitivity. Section III presents four groups of incompleteness indices (, , tree index, and the compound index) allowing for determining to what extent a given PC matrix based ranking is at risk due to the incompleteness of data. In Section IV, numerical experiments are presented demonstrating relationship between incompleteness, inconsistency and sensitivity.
II Preliminaries
II-A Pairwise comparisons
The pairwise comparisons method very often is used as a way that allows experts to create a ranking based on a series of individual comparisons. The subjects of comparisons are alternatives. Beginning the ranking procedure experts compare alternatives in pairs. Then the results of individual comparisons are used as an input to the appropriate mathematical procedure, which allows computing the final numerical ranking (Fig. 1).
Let be a finite set of alternatives representing options among which a decision maker can choose. Similarly, let be a set of expert judgements about each pair , so that is the result of comparisons against . Assigning the certain real value represents the expert’s opinion that the alternative is times more important than . It is convenient to represent the set of comparisons in the form of a matrix , hereinafter referred to as the PC (pairwise comparisons) matrix. Since a comparison of a given alternative to itself does not indicate the advantage of any of the two alternatives being compared, the diagonal of is composed of ones. Similarly, in most of the cases, it is assumed that if is times more important than than also is times less important then . The latter observation leads to the equality . In such a case it is convenient to use the following definition.
Definition 1**.**
A matrix is said to be reciprocal if for all holds .
The pairwise comparisons method aims to transform the set of paired comparisons (i.e., the PC matrix) into the ranking vector (Fig. 1). Let us define the function that assigns the weight (also called as the importance or the priority) to every single alternative. Every PC matrix can also be naturally presented in the form of a graph.
Definition 2**.**
Let be a labelled, directed graph with the set of vertices , the set of edges , and the labelling function so that . is said to be induced by the matrix .
In such a graph vertices correspond to alternatives and edges correspond to the comparisons among the alternatives.
Definition 3**.**
Let the output degree of be denoted by and be given as
[TABLE]
It is easy to observe that the output degree of vertex is equal to the number of comparisons of alternative with others.
Definition 4**.**
The ranking function for is a function that assigns a positive real number to every alternative .
The role of the ranking computation procedure is to determine the value of concerning every alternative. The list of all values we will often write in the form of a transposed vector :
[TABLE]
Very often is called interchangeably as a priority or weight vector. There are several methods of transforming paired comparisons into the ranking. According to the most popular one, referred to in the literature as eigenvalue method (EVM), the ranking is formed as the appropriately rescaled principal eigenvector [38]. Thus, to calculate in EVM one have to solve equation
[TABLE]
where is the spectral radius (principal eigenvalue) of , then rescale so that all its entries sum up to .
[TABLE]
where
[TABLE]
There are a dozen other weighting methods for PC matrices [19, 46, 47, 25, 13]. Among them, the geometric mean method (GMM) deserves particular attention. According to GMM the priority of i-th alternative is formed as the appropriately rescaled geometric mean of i-th row of the matrix . Due to its relative simplicity and theoretical properties in recent times it has gained many supporters.
Example 5**.**
Consider a pairwise comparison matrix
[TABLE]
Its principal eigenvalue equals and its principal eigenvector is given by
[TABLE]
The sum of its coordinates equals , so, after normalization, we obtain a priority vector . This determines the order of alternatives: . Notice that according to EVM, alternative is slightly better than . However, since geometric means of the second and third rows of are equal, GMM assigns the same weights to both alternatives.
II-B Incompleteness
The priority deriving methods mentioned in the previous section assume that the set of paired comparisons is complete, i.e., every entry of is known and available. In practice, this condition is not always met. It can happen for many reasons. After taking reciprocity into account, the number of all possible comparisons for alternatives is . Thus, when the number of alternatives is large comparing all of them in pairs requires considerable effort. It can not always be possible, for example, because of limited and expensive work time of experts. Harker [17] also points out that an expert, when faced with a comparison between two alternatives and , sometimes would rather not compare them directly. This may happen when, e.g., they do not yet have a good understanding of his or her preferences for this particular pair of alternatives. Sometimes experts evade from the answers, especially when taking a position on the given comparisons is morally or ethically tricky, e.g., comparing mortality risk vs. cost. Finally, some data may be lost or damaged.
In response to the above problems, the methods of calculating the ranking based on an incomplete set of pairwise comparisons arose. Probably one of the most popular (and the first one) is the Harker method [17]. According to the method based on matrix , a new auxiliary matrix is created where
[TABLE]
and means the number of the unanswered questions in the i-th row of . Harker has shown that a non-negative quasi-reciprocal matrix can be used for calculation priority ranking as a replacement for an original PC matrix. The natural limitation of the Harker method is that in there must be a series of comparisons between every two alternatives and such that exist. In other words, every two alternatives must be comparable at least indirectly. Every matrix for which the above condition holds is irreducible and every graph in which the set of vertices correspond to the set of alternatives , and the set of edges so that there exists the edge in if is known and defined, is strongly connected [35]. Let us consider the following example.
Example 6**.**
Let be incomplete PC matrix
[TABLE]
hence the Harker’s auxiliary matrix is
[TABLE]
and
[TABLE]
Thus the rescaled ranking vector obtained by EVM is
[TABLE]
which means that the priority of the first alternative is , and the second and third: , , correspondingly.
The corresponding graph has been shown in Fig. 2.
II-C Inconsistency
As represents the results of the comparisons between i-th and k-th alternative, and expresses the outcome of similitude between k-th and j-th alternative its natural to expect that . However, the entries of the PC matrix represent the subjective opinions of experts and due to human imperfection may happen that . Whenever it happens, we will call such a situation as inconsistency. If the difference between and is small or happens rarely, it probably will not have much impact on the final result. However, if the difference is large and it happens relatively often, then the results of the pairwise comparisons may be considered unreliable, and hence, the ranking results may not be trustworthy. This observation leads to a question about the degree of inconsistency of PC matrix . A popular way of determining the level of inconsistency in a set of pairwise comparisons is the use of inconsistency indexes. Probably the best-known index is one proposed by Saaty in 1977 [38]. It is defined as:
[TABLE]
where is the principal eigenvector of , and is the number of alternatives. It has been proven that CI reaches [math] when the PC matrix is fully consistent, and it gets the higher values, the more inconsistent is [38]. Since then, many different inconsistency indexes have been created. A comprehensive overview of the inconsistency indexes can be found in [6, 5, 29].
II-D Sensitivity
Another factor that affects the credibility of the ranking is the sensitivity of the result. By sensitivity we mean here the extent to which the disturbance of input data can change the final result. If a little disorder can significantly modify the result, then the ranking result is unstable and, therefore, not credible (we can not be sure if the final result is not accidental). Reversely, if reasonably small changes in the input data do not cause noticeable modifications of the result, then we can trust that the result obtained is a consequence of decision-making data deliberately introduced into the system. Simply put, it can be assumed that the sensitivity can be used to determine the data quality. The problem is, however, that the sensitivity is hardly measurable. What does it mean "a small change in input"? What change, how to quantify it? What does it mean “noticeable modifications of the result”? A person who wants to deal with the sensitivity analysis must answer all these questions. For the purpose of this article, we have assumed that the inconsistency index determines the input data disturbance. To measure the extent to which the results have been modified, we use two methods: the Manhattan distance 111In [18] Harker used Chebyshev distance for this purpose. and the rescaled Kendall tau distance.
The Manhattan distance between two priority vectors and is defined as follows:
[TABLE]
This metric provides us with information what is the average difference between two different priorities assigned to the same alternatives. As all the entries of priority vectors sum up to , the result .
Very often the ranking results are interpreted only qualitatively. This means that the decision makers are interested in who is the winner, who is in the second and in the third place but not what are the numerical priorities of alternatives. Let be the mapping assigning to every ranking vector its ordinal counterpart in such a way that i-th element of indicates the position of i-th alternative in the ranking (1). For example, if
[TABLE]
then its ordinal vector is
[TABLE]
Qualitative interpretation of ranking vectors leads to a question to what extent both: and differ from each other. The answer can be the Kendall tau rank distance that counts the number of pairwise disagreements between two ranking lists [21, 14]. Let us define *Kendall tau *distance formally:
[TABLE]
where are ordinal vectors. Since the maximal value of for two n-element vectors is it is convenient to use the rescaled Kendall tau distance, i.e.
[TABLE]
so that . The rescaled Kendall tau distance is the second method used in the article for the purpose of measuring discordance between ranking results. Since, vectors produced by EVM, GMM or Harker method are not ordinal before applying they have to be transformed to their ordinal counterparts using mapping.
Sometimes the Kendall tau distance is called a Bubble sort distance. The reason is that when there are no ties their value represents the number of swaps that are done by the bubble sort algorithm [11] when transforming the first list into the second one.
Example 7**.**
Let us consider two ordinal vectors and . It is easy to observe that as the discordant pairs of indices are: . Indeed there are five binary swaps needed to transform into . They are:
, 2. 2.
, 3. 3.
, 4. 4.
, 5. 5.
.
Assuming , the rescaled value is .
III Indices of incompleteness
III-A *Incompleteness and sensitivity *
According to EVM, the priority vector meets the equation (2). In other words, the weight of every alternative meets the equation
[TABLE]
Hence, the priority of one alternative is expressed by the weighted average of all others alternatives. With this regularity, we also deal with the case of GMM [28]. The equation (5) suggests that the disturbance of one single element , assuming that the other elements have not changed, should not affect significantly the value of . However, in the case of an incomplete PC matrix, the relationships between alternatives are weakened. The priorities of individual alternatives are determined by fewer expressions in the form than normally. It suggests that the susceptibility for disturbances of the rankings calculated based on the incomplete PC matrices is higher than normal. This, of course, should translate to the usually higher sensitivity of such decision models. It means that the completeness of the matrix correlates with the sensitivity of the method. The more comparisons are available, the less vulnerable the model is. One may ask whether the number of missing elements is not enough as an index? To answer this question let us consider the following two PC matrices with three (six, when the reciprocal elements are taken into account) missing comparisons.
[TABLE]
[TABLE]
In the first matrix is compared only with . Thus, disturbance on completely changes the value . In the second matrix is compared with and . Therefore, the same disturbance on will have less impact on the priority . In Section IV this intuition will be confirmed by the Montecarlo experiment. The above consideration leads us to the conclusion that the completeness index, that would be useful in determining the sensitivity of the decision model, should also take into account the arrangement of missing comparisons.
III-B -index
In PC matrix a single alternative can be compared with at most other alternatives. Therefore, the maximal value of for is (see Def. 3). Similarly, the number of missing comparisons is given by . Because the desired behavior is that the newly constructed index should be higher for than for the higher value of the expression for some particular should contribute more to the value of the index than two or more smaller expressions. To achieve this let us raise the expression to a positive real number . Thus, the expression
[TABLE]
combines two features together. It raises when the number of missing comparisons increases and providing that there are two matrices of the same size and with the same number of missing comparisons it is higher for this matrix that has larger irregularities in the distribution of missing values. Let us compute the mean of missing values raised to . As a result, we get the formula:
[TABLE]
which preserves both important features and its value is bounded and varies within the range . Hence, in order to get the final form of the index let us divide (8) by , i.e.
[TABLE]
It is clear that . When the PC matrix is fully incomplete, i.e. there are no comparisons between alternatives, is [math]. Reversely, if is complete, i.e. all the alternatives are defined, equals . Providing that the PC matrix is reciprocal, every alternative has to be compared with at least one different alternative. The maximal value of that allows to create the ranking is reached when just one alternative is compared with all the others. Then it is given by . Condition
[TABLE]
is necessary but it is not sufficient. Hence, there may exist PC matrices for which is smaller than but, in spite of this, one can not create the ranking.
Example 8**.**
Consider matrices and given by (6) and (7). Let us calculate their -indices:
[TABLE]
[TABLE]
As we can see the index of the first matrix is greater than the index of the second one, which reflects the fact that the distribution of the missing items in the rows of is more aligned than in . However, both indices are quite small, as both matrices lack of only elements (out of ).
III-C * index *
According the old adage “a chain is only as strong as its weakest link”. Following this common sense observation the second index does not consider the average number of missing comparisons for all alternatives but it focuses on the maximum number of missing comparisons for a single alternative:
[TABLE]
Because we can not omit the total number of comparisons the “weakest link” in the form of the above formula has to be combined with the sum:
[TABLE]
Thus, the proposed index gets the form:
[TABLE]
where the multiplier is introduced only for the purpose of fitting the index value to the segment . It is easy to observe that is [math] when the matrix is complete. Reversely, if there are no defined values in the matrix except its diagonal.
Example 9**.**
Similarly as before let us consider and given by (6) and (7). Their indices (where ) are
[TABLE]
[TABLE]
Similarly, as in the case of -index, the matrix gets the higher values of the index than the matrix . Both values, however, are quite small as only six elements (out of ) are missing.
III-D Tree index
As [24] shows, the existence of a spanning tree in the graph associated with an incomplete pairwise comparison matrix is a necessary condition to generate its missing values, and what follows, to create the ranking. Of course, the more spanning trees we have, the more reliable the data we obtain. The Cayley’s formula [7] states that the number of all spanning trees in a complete graph with vertices is equal to . If we consider a complete PC matrix, its incompleteness index should be equal to 0. The index should raise with the reduction of the number of spanning trees. However, as we remove the matrix entries one by one, the number of trees decreases exponentially from to [math]. To slow down its drop occurring when we remove the PC matrix elements, it is desirable to divide it over and apply the root to the ratio. These simple observations lead us to the definition of an alternative incompleteness indicator, which we will call the tree index.
Definition 10**.**
The tree-index of a pairwise comparison matrix is defined by the formula
[TABLE]
where denotes the number of the spanning trees in a graph associated with the matrix .
Remark 11*.*
Notice that if and only if is complete i.e. has got all elements. On the other hand, if and only if is disconnected, which means that we cannot create a priority vector based on the elements of .
According to the Kirchoff’s Theorem [30] the number of spanning trees in a connected graph with vertices can be computed as any cofactor of the Laplacian matrix of , whose elements are given by the formula:
[TABLE]
Example 12**.**
Once more, consider matrices and given by (6) and (7). The corresponding graphs and are given in Fig. 3.
Their Laplacian matrices are as follows:
[TABLE]
and
[TABLE]
Let us compute the cofactors of the left upper elements of the above matrices:
[TABLE]
[TABLE]
According to the Kirchoff’s Theorem, graphs and include and, respectively, spanning trees. Since , the tree incompleteness indices of and are
[TABLE]
[TABLE]
Again, the index of is higher than the index of , which reflects the fact that removing six elements of a matrix reduces the number of the respective graph’s spanning trees, which lowers the reliability of the resulting priority vector.
It is important to point out that both indices may be useful as measures of incompleteness. The first one measures the location of a matrix on the line between full and (almost) empty (i.e., having only 1s on the main diagonal) matrices. The latter reflects how far a matrix is from matrices which are useless for ordering the alternatives.
III-E Compound indices
As all the indices have the same domain (PC matrices) and codomain , then their product will also be a function with the same domain and codomain. It allows us to combine one index with the other to obtain the desirable properties of both. In this context, an interesting proposal seems to be combining and ** **indices. Thus, let us define a compound -index as follows:
[TABLE]
As it will turn out in the Section IV, this product allows us to combine together a dynamics of average sensitivity represented by together with differences in sensitivity resulting from different arrangements of missing pairwise comparisons. This second feature seems to be better represented by .
IV Properties of incompleteness indices - a numerical study
IV-A Relationship between incompleteness, inconsistency and sensitivity
An entirely consistent matrix is resistant to reducing the set of paired comparisons. That is because it suffices to compare one alternative with another already ranked to precisely determine the ranking of the former. Hence, as long as it is possible to compute the ranking, i.e., the PC matrix is irreducible, the calculated ranking is the same regardless of which comparisons are missing. However, if a PC matrix is inconsistent, missing comparisons start to matter.
In order to investigate the impact of inconsistency and incompleteness to the sensitivity we randomly prepare complete and consistent PC matrices . Then every matrix from was disturbed so that we obtain sets of matrices with the increasing average inconsistency CI given as
[TABLE]
The average of inconsistencies of those groups starts from , , and finally they reach . Next, we extend every by adding irreducible incomplete matrices randomly obtained from those originally located222As irreducible matrix must have at least comparisons (we are counting only comparisons over the diagonal) then, for every inconsistent matrix , we generate incomplete matrices. there. Let us denote the extended by and its elements by , where means the number of missing comparisons and indicates the consistent PC matrix from which originated. For every we compute incompleteness indices and , the measures of sensitivity i.e. Kendall distance and the Manhattan distance .
In the Figure 4 we can see the relationship between average value of sensitivity for matrices with the given average inconsistency and the average incompleteness given in the form of the three indices , and . When the considered PC matrices are consistent i.e. then also the resulting rankings do not depend on incompleteness. The distance between rankings obtained from consistent complete and incomplete matrices is [math]. However, when inconsistency starts increasing, the impact of incompleteness becomes apparent.
The increase of both inconsistency and incompleteness translates to the increase of the average Manhattan distance. For very small values of inconsistency () the Manhattan distance is about and following the increase of it takes values near . For the larger values e.g. the value of ranges between and , and similarly for the average values of are between and . This observation indicates that the highly incomplete PC matrices are almost four times more vulnerable to the random disturbances than the complete matrices. As the maximal possible value of the Manhattan distance for vectors whose elements add up to 1 is 2, the value of means that this index reaches of its maximal value. The similar behavior can be observed for the other two indices: and TI (Figs. 4b and 4c).
The values of Kendall distance reveals the similar properties (Figs. 5a, 5b and 5c). When the inconsistency is small () the average values of Kendall distance are spanned between and for all indices of incompleteness. Then for moderately inconsistent matrices () they range between and , then for () the values of Kendall index go through to .
It means that for PC matrices with the reasonably high inconsistency we may expect that or more pairs may randomly change their order. Similarly as before the incompleteness may significantly increase (from three to four times) the sensitivity of the PC method.
IV-B Impact of the distribution of missing comparisons to the sensitivity
We may suppose that the more missing comparisons to the given alternative the more vulnerable its weight and the position in the ranking. In the extreme case, if the given alternative is compared to only one other alternative , i.e., except , where all other values in the i-th row and j-th column of are undefined, the ranking of depends primarily on . Any disturbance of can translate into significant changes in the weight of the i-th alternative. On the opposite case, the missing comparisons are evenly distributed between alternatives. It ensures the relative safety of each alternative, providing of course, that the number of missing alternatives is not too high. The above observations allow us to indicate an example of the regular and the irregular PC matrix with a fixed number of missing comparisons.
Let us number the selected entries in the PC matrix in such a way that in the first row corresponds to , and has assigned number . Similarly, in the second row gets the number and the last element in the row gets . Finally, the last element gets the number . Elements directly above the diagonal are not indexed (the above numbering scheme has been shown in the form of a matrix ).
[TABLE]
Then, in order to prepare the highly irregular (and highly sensitive) matrix with missing comparisons it is enough to remove comparisons with assigned numbers from to and their counterparts below the diagonal. For example, the highly irregular by PC matrix with missing comparisons may look like:
[TABLE]
For the purpose of creating the matrices with the most even distribution of missing values we use another numbering scheme. Let assign number to , to , to , and to . The number be assigned to , to and finally to . The last numbered element is with value of index (the regular numbering scheme is shown as the matrix )
[TABLE]
For example, the regular by PC matrix with missing comparisons is as follows:
[TABLE]
It is easy to observe that in all alternatives have two missing comparisons (so each of them is compared with the three others), while in alternative is compared only with and is compared only with and . In the worst case the disturbances of and may lead to significant weight changes of and .
The question arises to what extent the regular and irregular distribution of missing comparisons translates to the measured sensitivity, and of course to the values of incompleteness indices. In order to answer these questions, we prepared random inconsistent and incomplete PC matrices with the average inconsistency then we removed their elements according to both: the regular and the irregular pattern subsequently assuming333Note that for we get . missing elements. Then we measured the average distance of the ranking vectors obtained from and and complete and not disturbed matrix, and, similarly, we computed the average value of all four indices including the compound -index.
In the Figures 6a and 6b we can see two plots. The lower plot on both figures represents the average sensitivity of incomplete PC matrices with the missing values distributed according to the scheme. The upper plot corresponds to the average sensitivity of incomplete PC matrices with the missing values distributed according to . Both plots look quite similar. They grow as the number of missing comparisons increases, but the plot corresponding to the irregular incompleteness scheme grows faster. It is interesting to note that starting from thirteen missing comparisons the difference in sensitivity between matrices in the form and reaches almost . It shows how important for sensitivity the distribution of missing comparisons is.
In the similar way we tested all the indices. In the Figure 7 we can see plots of , , TI and correspondingly.
Although all the indices rise along the increase of the number of missing values their increase differs from plots of sensitivity. For a not very high number of missing values (here which is of all comparisons possible to remove) all the indices seem to mimic the sensitivity charts (Fig. 6). However, for the larger numbers of missing comparisons the differences between PC matrices formed according to and are important. The use of or can help in this case.
IV-C Discussion
The first experiment (Section IV-A) clearly shows that both: inconsistency and incompleteness almost equally contribute to the sensitivity of the given PC matrix. This means that when assessing the quality of the matrix its completeness cannot be ignored. On the other hand, Figures 4 and 5 suggest that when the number of missing elements is small, the impact of this deficiency on the final ranking is almost negligible. However, when a lot of comparisons are missing the ranking can be significantly changed due to incompleteness.
Since the proposed indices aim to determine not only a simple number of missing comparisons but also their distributions in way that allows the user to discover potential risks of vulnerability to disturbances. In the second experiment (Section IV-B) we analyze the influence of the distribution of missing elements to the sensitivity of the PC method and the values of incompleteness indices.
The experiments carried out show that all the indices grow (or at least do not decrease) as the number of missing values increases. Similarly all the indices get the greater values when the distribution of missing values is potentially less favorable. However, despite many similarities the values of indices and the values of sensitivity are not identical. Thus, computing and analyzing the incompleteness indices can not replace the classical sensitivity analysis. Therefore, incompleteness indices should be treated as kind of a yardstick which allows to quickly detect that incompleteness can be a problem and should be improved. The great advantage of incompleteness indices is the ease of their calculation. As all of them use the number of missing comparisons on their inputs for PC matrix we need at most operations. Performing the sensitivity analysis usually is much more time and resource consuming. Even worse, as the sensitivity analysis tries to answer the questions how the changes in the input data translate to the method outcome, it might happen that the incompleteness as an actual source of problems can be overlooked. The indices of incompleteness eliminate danger. Due to their simplicity they are great for quick and simple test of completeness of the paired decision data.
V Summary
This paper has developed four incompleteness indices for using with the quantitative pairwise comparisons method with incomplete set of comparisons. These indices can be used as fast and computationally simple data quality tests. The constructed indices have been tested in Montecarlo experiments. Carried trials showed a significant impact of incompleteness expressed by these indices to the sensitivity of the pairwise comparisons based decision model. Although it is clear that the incompleteness only is just one of the factors affecting sensitivity, the defined indices can help the decision makers to discover the risks to sensitivity having their source in the data incompleteness.
Acknowledgment
The research is supported by The National Science Centre, Poland, project no. 2017/25/B/HS4/01617 and by the Polish Ministry of Science and Higher Education.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] C. Batini, C. Cappiello, C. Francalanci, and A. Maurino, “Methodologies for Data Quality Assessment and Improvement,” Hindawi Publishing Corporation The Scientific World Journal The Scientific World Journal , vol. 41, no. 3, pp. 16:1–16:52, Jan. 2009.
- 2[2] R. H. Blake and P. Mangiamelli, “Evaluating the Semantic and Representational Consistency of Interconnected Structured and Unstructured Data,” Americas Conference on Information Systems , 2009.
- 3[3] M. Bovee, R. P. Srivastava, and B. Mak, “A conceptual framework and belief-function approach to assessing overall information quality,” International Journal of Intelligent Systems , vol. 18, no. 1, pp. 51–74, Jan. 2003.
- 4[4] S. Bozóki, J. Fülöp, and L. Rónyai, “On optimal completion of incomplete pairwise comparison matrices,” Mathematical and Computer Modelling , vol. 52, no. 1–2, pp. 318 – 333, 2010. [Online]. Available: http://www.sciencedirect.com/science/article/pii/S 0895717710001159
- 5[5] M. Brunelli, L. Canal, and M. Fedrizzi, “Inconsistency indices for pairwise comparison matrices: a numerical study,” Annals of Operations Research , vol. 211, pp. 493–509, Feb. 2013.
- 6[6] M. Brunelli and M. Fedrizzi, “Axiomatic properties of inconsistency indices,” Journal of Operational Research Society , pp. –, 2013. [Online]. Available: http://dx.doi.org/10.1057/jors.2013.135 · doi ↗
- 7[7] A. Cayley, “A theorem on trees,” Quart. J. Pure Appl. Math. , vol. 23, pp. 376–378, 1889.
- 8[8] V. Čerňanová, W. Koczkodaj, and J. Szybowski, “Inconsistency of special cases of pairwise comparisons matrices,” International Journal of Approximate Reasoning , vol. 95, pp. 36 – 45, 2018. [Online]. Available: http://www.sciencedirect.com/science/article/pii/S 0888613 X 17307612
