On Pattern Setups and Pattern Multistructures
Aimene Belfodil (DM2L), Sergei Kuznetsov, Mehdi Kaytoue (DM2L)

TL;DR
This paper introduces the concept of pattern multistructures based on multilattices, expanding the formal framework for pattern mining beyond traditional pattern structures, and discusses techniques for transforming pattern setups into pattern structures.
Contribution
It proposes a new model called pattern multistructure that bridges the gap between pattern setups and pattern structures, addressing limitations in existing FCA-based pattern generation.
Findings
Pattern multistructures rely on multilattices for broader pattern representation.
Techniques for transforming pattern setups into pattern structures are discussed.
The framework extends FCA applicability to sequential and graph patterns.
Abstract
Modern order and lattice theory provides convenient mathematical tools for pattern mining, in particular for condensed irredundant representations of pattern spaces and their efficient generation. Formal Concept Analysis (FCA) offers a generic framework , called pattern structures, to formalize many types of patterns, such as itemsets, intervals, graph and sequence sets. Moreover, FCA provides generic algorithms to generate irredundantly all closed patterns, the only condition being that the pattern space is a meet-semilattice. This does not always hold, e.g., for sequential and graph patterns. Here, we discuss pattern setups consisting of descriptions making just a partial order. Such a framework can be too broad, causing several problems, so we propose a new model, dubbed pattern multistructure, lying between pattern setups and pattern structures, which relies on multilattices.…
Click any figure to enlarge with its caption.
Figure 1
Figure 2Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRough Sets and Fuzzy Logic · Data Mining Algorithms and Applications
On Pattern Setups and Pattern Multistructures
\nameAimene Belfodila, c, Sergei O. Kuznetsovb, Mehdi Kaytouea, d Contact Aimene Belfodil. Email: [email protected] aUniv Lyon, INSA Lyon, CNRS, LIRIS UMR 5205, F-69621, LYON, France; bNational Research University Higher School of Economics, Moscow, Russia; cMobile Devices Ingénierie, 100 Avenue Stalingrad, 94800, Villejuif, France; dInfologic, 99 avenue de Lyon, 26500 Bourg-Lès-Valence, France;
Abstract
Modern order and lattice theory provides convenient mathematical tools for pattern mining, in particular for condensed irredundant representations of pattern spaces and their efficient generation. Formal Concept Analysis (FCA) offers a generic framework, called pattern structures, to formalize many types of patterns, such as itemsets, intervals, graph and sequence sets. Moreover, FCA provides generic algorithms to generate irredundantly all closed patterns, the only condition being that the pattern space is a meet-semilattice. This does not always hold, e.g., for sequential and graph patterns. Here, we discuss pattern setups consisting of descriptions making just a partial order. Such a framework can be too broad, causing several problems, so we propose a new model, dubbed pattern multistructure, lying between pattern setups and pattern structures, which relies on multilattices. Finally, we consider some techniques, namely completions, transforming pattern setups to pattern structures using sets/antichains of patterns.
1 Introduction
Modern order and lattice theory provide convenient mathematical tools for pattern mining, in particular for condensed irredundant representations of pattern spaces and their efficient generation. Different formal tools has been proposed in the literature to model pattern spaces. Formal Concept Analysis (FCA - Ganter and Wille (1999)) has been proposed by Wille (1982) as a well-founded mathematical tool to models hierarchies of concepts related to some formal context (i.e. binary datasets). While basic FCA provides a natural way to analyze binary datasets, datasets with more complex attributes (e.g. numerical or nominal ones) need to be transformed to such before any manipulation. This kind of transformation has been proposed by Ganter and Wille (1989) under the term of conceptual scaling (i.e. binarizing). Yet, even if conceptual scaling is a quite general tool, binarizing a dataset with regard to some pattern search spaces is not always obvious (i.e. Baixeries, Kaytoue, and Napoli (2012), Belfodil et al. (2017)). In response to that, some other more natural tools to formalize complex pattern spaces has been proposed. One could cite Logical Concept Analysis (LCA) proposed by Ferré and Ridoux (2000) and Pattern Structures proposed by Ganter and Kuznetsov (2001). Pattern Structures allow for instance to model in a quite natural way many pattern search spaces. Indeed, itemsets, intervals (Kaytoue, Kuznetsov, and Napoli 2011), convex polygon (Belfodil et al. 2017), partition (Baixeries, Kaytoue, and Napoli 2014) pattern spaces among others (Ganter and Kuznetsov 2001; Kuznetsov 2009) can be modeled within the pattern structure framework. However, since pattern structures rely on meet-semilattices (i.e. conjunction of two patterns belongs to the pattern search spaces), some pattern spaces that are only partially ordered sets (posets) cannot be “directly” defined using such a framework.
Consider for instance the example dataset depicted in Fig. 1 containing objects described by attribute "value" and labeled positive or negative. We are interested by the task of finding “good” rules in this dataset with a description given by attribute value. Rather than considering the usual meet-semilattice of intervals as the one proposed by Kaytoue, Kuznetsov, and Napoli (2011); descriptions are restrained to open intervals of the form and or singleton (see Fig. 1 - right). Patterns (descriptions) form together a poset where is the subsumption order (i.e. if subsumes then if pattern holds for an object then pattern hold to). However, does not form a meet-semilattice. For instance, the set does not have a meet, since lower bounds of have two maximal elements w.r.t. (i.e. and ). Hence, the description space does not induce a pattern structure (Ganter and Kuznetsov 2001). It does form actually a pattern setup (Lumpe and Schmidt (2015)) which rely on a simple poset with no additional properties.
Description spaces like the one depicted in Fig. 1 are numerous. For instance, sequence of itemset patterns (Agrawal and Srikant (1995)) ordered by “is subsequence of” do not form a meet-semilattice (Zaki 2001; Codocedo et al. 2017). Sequential meet-semilattice in FCA (Buzmakov et al. 2016; Codocedo et al. 2017) refers usually to set (i.e. conjunction) of sequences rather than to the poset of sequences itself. Same holds for the graph meet-semilattice from Kuznetsov (1999). In general, the base pattern setup is transformed to a pattern structure using sets (i.e. conjunctions) of descriptions thus providing a richer pattern language. Such transformations are naturally called completions, which refers to well-known completions in ordered sets and lattice theory, like Dedekind-McNeille, Alexandroff or Antichain completions (Davey and Priestley 2002; Boldi and Vigna 2016). For instance, in Fig. 1, set of patterns which is equivalent to belongs to the antichain completion of but and not to the base description language .
Understanding properties of pattern setups independently from their completions is fundamental for answering many practical questions. For instance, consider the question “What are the best descriptions covering all positive instances?”. If better stands for more relevant than as in relevance theory (Garriga, Kralj, and Lavrac 2008), the answer will be the two best incomparable rules and rather than only one in the completion . More generally, we look for maximal common patterns (i.e. support-closed patterns as called by Boley et al. (2010)) of the positive instances which could be multiple.
Outlines and Contributions. In this paper we present the following results:
In Section 3, we start by studying pattern setups that was proposed by Lumpe and Schmidt (2015) as a tool to models pattern search spaces relying only on posets providing hence a better-understanding of this wider-class of framework. Afterwards, we revisit briefly the notion of pattern structures in Section 4. 2. 2.
In Section 5, we point-out the major problem related to pattern setups. In fact, since pattern setups rely only on posets without no additional properties, they are very permissive. Simply put, given some set of objects, they could share some common descriptions in the pattern space but none of them is maximal w.r.t. the subsumption order. We show that this problem is directly linked to the fact that the considered poset is not a multilattice. 3. 3.
In Section 6, we present multilattices and we show that all (doubly) chain-complete posets are complete multilattices but that the converse does not hold. 4. 4.
In Section 7, we propose the framework of pattern multistructures. In a nutshell, analogously to pattern structures, pattern multistructures are based on multilattices. Such a structure provides the fact that covering descriptions of a set of objects are deducible from the maximal common descriptions using the subsumption order. This does not necessarily hold in an arbitrary pattern setup. 5. 5.
Next, in Section 8 we revisit completion (i.e. transformation) of pattern setups to pattern structures and we show that the usual completion using antichain of patterns, namely antichain completion, induce a pattern structure if and only if the pattern setup is a pattern multistructure. 6. 6.
Finally, we wrap-up in section 9 and we discuss some open problems related particularly to the enumeration of extents (i.e. subset of objects that are separable in the description language) in an arbitrary pattern setup.
Please note that this paper is a thorough extension of a first paper published in CLA’18 (Belfodil, Kuznetsov, and Kaytoue 2018). The main difference with CLA’18 paper is more mathematical results and much more detailed explanations of the framework of pattern setups, pattern multistructures and their completions. Note that in this first paper, pattern multistructures were called pattern hyper-structures and were linked with the notion of hyper-lattices briefly investigated by Zaki (2001). Stefan Schmidt attracted our attention to the work of Benado (1955) on multilattices that interestingly was directly linked to pattern multistructure making this newly proposed framework more mathematically founded.
2 On Partially-Ordered Sets
To make this work self-contained, in this section we recall basic definitions and results from order theory and introduce our notations that are largely inspired by Ganter and Wille (1999); Davey and Priestley (2002); Roman (2008). In this paper, we will be using the following base notations:
- •
For any set , the set denotes the powerset of .
- •
For any mapping and any subset , the set denotes the image of w.r.t. , that is:
[TABLE]
- •
The set of all mappings is denoted by .
2.1 Basic Definitions
Definition 2.1**.**
A partial order on a set is a binary relation on that is reflexive ( ), transitive ( if and then .) and antisymmetric ( if and then ). The pair is called a partially ordered set or a poset for short. Two elements and from are said to be comparable if or ; otherwise, they are said to be incomparable. A subset is said to be a chain (resp. an antichain) if all elements of are pairwise comparable (resp. incomparable). The set of all chains (resp. antichains) of is denoted by (resp. ).
Note 1*.*
(Finite) posets are generally presented using the so called Hasse Diagram. Where elements of the poset are represented on the plane from the smallest ones to the largest ones. Only direct neighbors (i.e. two elements are said to be direct neighbors if there is no elements strictly lying between them in the poset) are linked by a segment of line. The poset can indeed be deduced from this diagram by adding reflexivity and transitivity. Fig. 2 (1) depicts the Hasse Diagram of the powerset of a set ordered by set inclusion (i.e. poset ).
Through some misuse of the notion, we will present in this paper some infinite posets using the Hasse Diagram. Fig. 2 (2) presents a poset where:
- •
,
- •
, and
- •
.
In what follows, denotes a poset and denotes an arbitrary subset.
Definition 2.2**.**
The principal ideal (resp. principal filter) of , denoted by (resp. ) is the set of all elements below (resp. above) it:
[TABLE]
Definition 2.3**.**
A set is said to be a lower ideal or a downset if:
[TABLE]
The notion of upper ideal (upset) is defined dually. The set of all lower (resp. upper) ideals of is denoted (resp. ).
Definition 2.4**.**
The down closure (resp. up closure), denoted by (resp. ) associates to a subset the smallest lower-ideal (resp. upper-ideal) enclosing it:
[TABLE]
Definition 2.5**.**
An element is said to be a lower bound (resp. upper bound) of if it is below (resp. above) all elements of . The set of lower (resp. upper) bounds of in , denoted by (resp. ), is the lower ideal (resp. upper ideal) given by:
[TABLE]
Note 2*.*
Note that and .
2.2 Minimal, Minimum, Infimum, …
Informally speaking, not all elements in a subset have the same “position”. Some elements are in the “interior” of , that is such that ; while some others are on its “border”, that is there is no element in strictly below/above them. These border elements are formally defined in Definition 2.6.
Definition 2.6**.**
An element is said to be a minimal (resp. maximal) element in if all its strict lower (resp. upper) bounds are outside . The set of minimal (resp. maximal) elements of is denoted by (resp. ) and given by:
[TABLE]
Note that and are antichains. Moreover, by definition. An intuitive and important relationship between the minimal elements of a subset and the minimal elements in its up-closure is presented lemma 2.7.
Lemma 2.7**.**
We have and
Proof.
We prove by double inclusion the property :
()
Let and suppose that . Since we have , that is s.t. but since . Thus, s.t. but . Thus with which contradicts the fact that (i.e. ). We conclude that . Suppose now that , that s.t. and . Hence, which contradicts the fact that . Thus . We conclude that .
()
Let , thus . Suppose that that is such that and . Thus such that with . Hence, with which contradicts the fact that . We conclude that or more generally .
On can follow the same steps to show . ∎
Definition 2.8**.**
An element is said to be minimum or a smallest element of if it is a lower bound of . Formally: . The notion of maximum or greatest element of is defined dually.
Note 3*.*
One should notice that not all subsets have a minimum or a maximum (see Example 2.9). Moreover, the minimum and the maximum are unique if they exist.
Example 2.9**.**
Consider again Fig. 2 (1) and the subset . It is clear that and . Obviously, subset has a minimum which is . However, it does not have a maximum since no element in is above all the other elements.
Note 4*.*
If a poset has a maximum (resp. minimum) element, we say that the poset is upper-bounded (resp. lower-bounded) and this element is called the top (resp. bottom) and is denoted (resp. ). A poset is said to be bounded if it is both lower-bounded and upper-bounded.
It is very important to distinguish between the minimal elements and the minimum. In fact, “if a subset has a minimum then it has a unique minimal element”. However, the converse of this statement is not true, that is even if a subset have a single minimal element, it can have no minimum. Indeed, the intuition that every element in a subset is at least above one minimal element in does not always hold in infinite posets. Such a property is presented in Definition 2.10.
Definition 2.10** (Definition 2.4 in Martínez et al. (2005)).**
We say that:
- •
is minimum-handle if has a minimum (i.e. ).
- •
is maximum-handle if has a maximum (i.e. ).
- •
is minimal-handle if
- •
is maximal-handle if .
Example 2.11**.**
Consider the poset depicted in Fig. 2 and let . It is clear that . Since , subset is not a maximal-handle. Hence, even if has a single maximal element, it has no maximum. On the other hand, . Since , we can say that is a minimal-handle. Moreover, since the minimal element is unique than is minimum-handle which minimum is .
Note 5*.*
If is minimal-handle, then has a minimum if and only if has a unique minimal element. It is clear that all subsets of a finite posets (i.e. posets with a finite set) are maximal-handle and minimal-handle. Posets where all subsets are minimum-handle are said to be well-founded or equivalently have the minimal condition or the descending chain condition (DCC). Dually, posets where all subsets are maximal-handle are said to be dually well-founded or equivalently have the maximal condition or the ascending chain condition (ACC). A poset having at the same time the ACC and DCC is said to be chain-finite since it has no infinite chain (but, still, could have infinite antichains). A poset is in fact finite if and only if it is chain-finite and antichain-finite.
Definition 2.12**.**
The largest lower bound of (i.e. the maximum of ) if it exists is called the infimum or the meet of and is denoted or . The join or the supremum of is given by the minimum of and is denoted or .
Having a minimum is a very strong property, in fact if a subset has a minimum then the minimum is also the infimum. However, the converse is not true. Indeed a subset can have an infimum without having a minimum. For instance, consider Fig. 2 (2), it is clear that the chain does not have a maximum. However, still have a supremum which is . Indeed, which minimum is .
One interesting property is stated in the following Lemma.
Lemma 2.13**.**
Let be a poset, , we have:
- •
For any . If has a join then .
- •
For any . If has a meet then .
Proof.
Let , we have by definition: , that is . Since is the least upper bound of and all elements of are upper bounds of then: . We conclude that . Same steps can be followed to show the second part of the Lemma. ∎
Note 6*.*
One should note that, in case of existence, we have:
[TABLE]
Moreover, since for a poset we have , then the empty set has a meet (resp. join) if and only if the poset is upper-bounded (resp. lower bounded) and we have and .
2.3 Lattices
Definition 2.14**.**
A poset is said to be:
- •
A meet-semilattice if for all nonempty finite subsets , has its meet.
- •
A join-semilattice if for all nonempty finite subsets , has its join.
- •
A lattice if it is both meet-semilattice and join-semilattice.
- •
A complete lattice if all its subsets including has their meet and join.
Note 7*.*
For any set , the poset is a complete lattice where the meet is set intersection and the join is set union . Such a poset is called a powerset lattice.
Example 2.15**.**
Consider posets depicted in Posets Fig. 2. Poset (1) is a powerset lattice on . Hence, it is a complete lattice. Poset (2) is also a complete lattice where the join of any infinite subset is . Poset (3) of common substrings of “ab” and “ba” ordered by is substring of is neither a meet-semilattice nor a join-semilattice. Indeed, has not an infimum since has two maximal elements and thus no maximum. Dually, subset has not a join since has two minimal elements and thus no minimum.
Note 8*.*
Meet-semilattices have a weaker, yet equivalent, definition characterizing them. In fact, to check if a poset is a lattice, one should only check if:
[TABLE]
That is if all pair of elements have their meets (resp. join) then all nonempty finite subsets of the poset have their meet (resp. join). Another important remark that is related to Note 6 is the fact that all complete semilattices are complete lattices.
One should notice that since in complete lattices, the empty set have also their meet and joins then all complete lattices are bounded. It is also important to note that all finite lattices are complete lattice. However, not all finite meet-semilattices are lattices since they may lack of a top element (i.e. the meet of the empty set is not guaranteed to exist). In fact, a finite meet-semilattice (resp. join-semilattice) is a lattice if and only if it is upper-bounded (resp. lower-bounded).
2.4 Morphisms on Posets
We will often use morphismes (i.e. mappings) between posets in this paper. Definition 2.16 below formulate some properties of morphismes between two posets.
Definition 2.16**.**
Let and be the two posets. A mapping is:
- •
order-preserving or monotone: .
- •
order-reversing:
- •
an order-embedding:
If an order-embedding exist from and then poset is said to be a completion of or embeds . If this order-embedding is surjective (i.e. an order-isomorphism) we say that and are order-isomorphic.
Definition 2.17**.**
A closure operator on is a mapping that is:
- •
monotone: ,
- •
extensive: vee, and
- •
idempotent:
Note 9*.*
The fixpoints of a given mapping is the set of elements s.t. . For idempotent operator, the set of fixpoints is .
Lemma 2.18**.**
Let be a poset, we have and are closures on .
Proof.
Let us show that is a closure operator on . It is clear that (i.e. is extensive) by definition. For in , we have if , then such that . That is, . Thus, (i.e. is order-preserving). Let us show that is idempotent. It is clear that since is extensive. It remains to show that . Let , that is such that . That is such that . We conclude that . One can follow the same steps to show that is a closure operator on . ∎
Note 10*.*
Fr any poset and since and are closure operators, we have:
- •
For all , if is minimal-handle then .
- •
For all , if is maximal-handle then .
Before finishing the section, we draw the reader attention to the tight link existing between closure operators and closure systems on a complete lattice defined below.
Definition 2.19**.**
Let be a complete lattice whose meet is . Given a subset , is said to be a closure system or a meet-structure on iff:
[TABLE]
Clearly, forms a complete lattice which infimum is .
In fact (see Theorem 1 in Ganter and Wille (1999)), if is a complete lattice and is a closure operator on then the poset of fixpoints of is a closure system. Conversely, if is a closure system on a complete lattice then the mapping that takes each element to the smallest element (fixpoint) in enclosing it is a closure operator with .
3 Pattern Setups
Formal Concept Analysis (FCA) were introduced in Wille (1982) as a mathematical framework to analyze and manipulate concepts in databases. FCA starts by a formal context where is a set of objects (i.e. Gegenstände), is a set of attributes (i.e. Merkmale) and is a binary relation on (i.e. Incidence relation). For , holds iff has attribute . Fig. 3 presents an example of a formal context. The basic theorem behinds FCA rely on the observation that any formal context can be transformed to a complete lattice called concept lattice. We invite the reader to read Ganter and Wille (1999) book for more details.
While (basic) FCA gives a tool to analyze datasets in a form of formal context, datasets with more complex attributes (eg. numerical or nominal attributes) needs to be transformed to such a form before any manipulation. Such a transformation is called conceptual scaling (i.e. binarizing) (Ganter and Wille (1989)). Yet, even if conceptual scaling is a quite general tool, binarizing a dataset with regard to the patterns we want to look for is not always straightforward (Baixeries, Kaytoue, and Napoli (2012)).
In response to that, a more natural way to handle complex datasets was introduced in Ganter and Kuznetsov (2001) under the name of pattern structures. Objects in a pattern structure have descriptions (e.g. the equivalent notion to set of attributes in in a formal context) with a meet-semilattice operation on them (e.g. equivalent to set intersection in in a formal context). This framework proved its usefulness in many data analysis tasks (see Kuznetsov (2009)). However, pattern structures demands that the description space to be a (upper-bounded) meet-semilattice which is not the case for all description spaces such as sequence of itemsets patterns (Codocedo et al. (2017)). Pattern setups were introduced in Lumpe and Schmidt (2015) to generalize pattern structures by demanding only a partial order on descriptions. We details in this sections the different notions related to pattern setups.
Definition 3.1**.**
A description space; called also description language, pattern space or pattern language; is any poset . Elements of are called descriptions or patterns. For any , should be read as “ is less restrictive than ” or “ subsumes ”.
Definition 3.2**.**
A pattern setup is a triple where is a set (of objects), is a description space and defines a mapping that takes each object to its description . Let and be an object and a description, respectively. We say that object realizes description or description hold for or cover object iff .
Example 3.3**.**
Consider the pattern setup in Fig. 4. We have . The description space is the set of nonempty words on the alphabet (i.e. ) ordered by the relationship “is substring of” . The mapping associates to each objects in its word in the description space. For instance . The diagram in the center of Fig. 4 depicts the Hasse Diagram of the poset with . In other words, it depicts the set of descriptions holding for at least one object in . It is clear that the description holds for since . However, description does not hold for since is not a substring of . More generally, descriptions holding for is the principal filter of (i.e. ). For instance, the set of descriptions holding for is given by .
Example 3.4**.**
Consider the pattern setup presented in Fig. 5 (left). The set of objects is and the description space is the powerset ordered by set inclusion (i.e. itemsets) with . Again descriptions holding for are all itemsets included in (i.e. ).
3.1 On Extent and Cover Operators
We have seen that the relation “realizes” build a binary relation between objects and descriptions. Based on this binary relation, two key operators, namely extent and cover are derived (see Definition 3.5 and 3.6).
Definition 3.5**.**
The extent operator, denoted by , is the operator that takes each description to the subset of objects in realizing it:
[TABLE]
The size of is called the support of , i.e. .
Note 11*.*
Please note that for any , we denote .
Definition 3.6**.**
The cover operator, denoted by , takes each subset of objects to the set of common descriptions in covering all of them:
[TABLE]
Example 3.7**.**
Consider the pattern setup presented in Example 3.3. We have: and .
Definition 3.8**.**
A subset is said to be:
- •
Definable, Separable or an Extent if .
- •
Coverable if .
The set of definable sets is then given by:
[TABLE]
Note 12*.*
One should note that poset does form a subposet of , that is definable sets are naturally ordered by . The set of coverable sets is naturally given by . In other words, any subset of a coverable set is coverable. Conversely, any superset of a non coverable set is a non coverable set.
Example 3.9**.**
The poset of definable sets associated to the pattern setup presented in Example 3.3 is depicted in Fig. 4 (right). For example, it is clear that is definable since . However, there is no description which extent is exactly , hence will be said non-definable. Still, is coverable since and share at least one common descritpion (i.e. ). One should note also that is non-coverable since they share no common symbol and the empty string is excluded from the pattern space.
An important property arising directly from the definition of both and is given in Proposition 3.10. It tells that: on the one hand, the more restrictive is a description, the less it covers objects in the database. On the other hand, the bigger is a set of objects the less they share descriptions in common.
Proposition 3.10**.**
Operators and are order-reversing:
[TABLE]
Proof.
We have:
Let , let , thus . Since , we conclude that that is . Thus .
Let such that . Let , that is thus ; that is . We conclude that .
This concludes the proof. ∎
Note 13*.*
We have seen in Proposition 3.10 that mappings and are order reversing. Hence, one could think that may form some Galois connection111A Galois connection between two posets and is a pair with and are order-reversing and both operators and are extensive.. However, it is not the case since associate an extent to one description while outputs a set of descriptions rather than one. In other words, these two mappings are not compatibles. Yet, we will see in next section that will be involved into a Galois connection when the considered pattern setup verifies additional properties (i.e. the pattern setup is a pattern structure Ganter and Kuznetsov (2001)). Mapping will also be involved in another Galois connection in Section 8.
Definition 3.11**.**
For , the pattern implication holds if . That is, every object realizing realizes . Dually, for , the object implication holds if . That is, every description covering all object in covers also all objects in .
We say that descriptions are equivalent if and and we have . Dual definition can be given for the equivalence between object sets.
Note 14*.*
Please note that if and since is an order-reversing mapping, we have . Regarding this observation, there is two types of implications between descriptions: (i) implications deduced directly from and (ii) implications that are dependent on the pattern setup. While the former implications are intrinsic to the description space, the latter are more important since they are those who enclose the knowledge hidden in the pattern setup.
Example 3.12**.**
In the pattern setup presented in Example 3.3 and Fig. 4, we have and . Hence, we have or in other words in every string containing in the pattern setup contains also .
Proposition 3.13 gives characterizations of the set and for and . This proposition will be useful later in this paper.
Proposition 3.13**.**
For and :
[TABLE]
Proof.
We show the two equations separately:
(1) Let , we have:
We conclude that
(2) Let , we have:
Thus . ∎
Example 3.14**.**
Consider the pattern setup presented in Example 3.3 and its associated depicted in Fig. 4 (right). We have: and . Hence:
- •
.
- •
.
3.2 A Minimal Representation of a Pattern Setup
An important notion analogous to what is called representation context in pattern structures (see Ganter and Kuznetsov (2001); Buzmakov, Kuznetsov, and Napoli (2015)) is introduced in Theorem 3.17. Technically, such a representation does not provide a practical way to explore definable sets of an arbitrary pattern setups, but helps to simulate definable sets search space of a pattern setup independently from the description space. Before introducing the Theorem, we present Proposition 3.15.
Proposition 3.15**.**
Let be a non empty finite set and let , we have
[TABLE]
Proof.
Before showing the equivalence let us prove the following property:
(1)
Recall that (See proposition 3.13). Let , we have , thus . Let us show that is a lower bound of . We have: . Thus, . Since is an order reversing operator, we obtain: . Thus, is the smallest element of . That is .
We show now the two implications independently:
()
Let for some pattern setup . Using eq. (1), .
()
Let for which we have . Let us now define the following pattern setup:
Let be a description, we have: . Let us show that by showing double inclusion: (1) Let , thus . It follows that . We conclude that . Therefor . (2) Let , thus . Since we have , we have , that is . We conclude that . Both inclusion leads us to have . We conclude that . In other words, .
This concludes the proof. ∎
Example 3.16**.**
Proposition 3.15 tells that not all families of subsets of could be seen as a set of extents of some pattern setup. Consider the poset depicted in Fig. 2 (4) where and can never be seen as a for some pattern setup . Indeed, whose intersection is not in .
Theorem 3.17**.**
For any pattern setup , the pattern setup given by
[TABLE]
is called the minimal representation of and we have .
Proof.
Theorem 3.17 is a corollary of Proposition 3.15. Indeed, the pattern setup is the same as the one built in the proof of Proposition 3.15 since is a set system verifying the property (i.e. implication ()). We have . Moreover, this representation is said to be minimal since any proper subposet of will drop at least one definable set. ∎
4 Pattern Structures
Pattern structures were introduced in Ganter and Kuznetsov (2001). They require that every set of objects has a greatest common description (least general generalization). A formal definition is given in Definition 4.1. Pattern structures provide a very strong tool to formalize a large class of pattern languages (Kuznetsov 2009). For instance, pattern setups over the language of itemsets (Ganter and Wille 1999), intervals (Kaytoue, Kuznetsov, and Napoli 2011), convex polygons (Belfodil et al. 2017), sequence sets (Buzmakov et al. 2016)222Sequence and graphs patterns will be discussed in the next section, since they do not induce pattern structures directly, but the sets of incomparable patterns do., and graph sets2(Kuznetsov 1999) are all pattern structures.
Definition 4.1**.**
A pattern setup is said to be a pattern structure iff every subset of objects has a greatest common description. Formally:
[TABLE]
Example 4.2**.**
The pattern setup presented in Example 3.4 and Fig. 5 is a pattern structure. Indeed, since the description space is the powerset lattice (i.e a complete lattice) then every subsets does have a meet which is the set intersection .
However, the pattern setup presented in Example 3.3 and Fig. 4 is not a pattern structure. Indeed, the set of common descriptions does not have a maximum (i.e. does not have a meet) since has two maximal elements.
One can define a new operator, namely the intent, in a pattern structure thanks to the existence of the meet.
Definition 4.3**.**
The intent operator, denoted by , is the operator that takes each subset of objects to the greatest common description in covering them (i.e. the maximum of ). Formally:
[TABLE]
Note 15*.*
In a pattern structure, the pair of operators forms a Galois connection between posets and . Thus, and form closure operators (cf. Proposition 8 in Ganter and Wille (1999) book) on the two posets respectively. Thanks to this Galois Connection, one can define a complete lattice based on the the closed elements.
Definition 4.4**.**
Let be a pattern structure. The (pattern) concept lattice associated to is the complete lattice denoted by . Elements of are called (pattern) concepts and are given by:
[TABLE]
The concepts are ordered by as follows: .
Note 16*.*
Two complete lattices isomorphic to the concept lattice can be derived:
The poset of definable sets which on a -structures (i.e. Moore family, closure system) in the powerset lattice . Note that definable sets are the fixpoints of the closure operator . 2. 2.
The poset of closed patterns with is a complete lattice. Elements of are called closed patterns since they are fixpoints of the closure operator .
Another important remark about the closure operator is that it takes to a subset of object the smallest definable set enclosing it. Formally:
Proposition 4.5**.**
Let be a pattern structure, we have:
[TABLE]
Proof.
This result is straightforward from the fact that is a closure operator. Indeed, according to Theorem 1 in Ganter and Wille (1999) (page 8), we have is closure system. By application of the theorem we have: . ∎
Example 4.6**.**
Consider again the pattern structure presented in Fig. 5. Since the meet is the set intersection we have:
[TABLE]
The concept lattice is depicted in Fig. 5 (right). One should note the set of definable sets can be deduced directly from the concept lattice by taking extents of the pattern concepts. It is important to highlight the fact that that pattern structure is derived from the formal context presented in Fig. 3 where is given by .
Definition 4.1 follows Lumpe and Schmidt (2015). The original equivalent definition of the pattern structure (Ganter and Kuznetsov 2001) requires that the description space must be a complete lattice. Theorem 4.7 builds a bridge between meet-semilattices and pattern structures over finite set of objects or more generally between pattern structures and complete lattices.
Theorem 4.7**.**
Let be a poset, the following properties are equivalent:
- •
For any finite set and any , is a pattern structure (where denotes the set of all mappings ).
- •
* is a an upper-bounded meet-semilattice (i.e. has also a meet).*
For an arbitrary set , the following properties are equivalent:
- •
For any set and any , is a pattern structure.
- •
* is a complete lattice.*
Proof.
Let us show both implications for a finite :
The empty set has a meet in since has a meet. Thus has a top element . Moreover, let be a finite set, one can build a finite set such that . Since is a pattern structure then has a meet. We conclude that is an upper-bounded meet-semilattice.
Let be a pattern setup. Any subset of is finite subset of and thus has a meet (including the since has its top element).
Let us now consider the case of arbitrary set :
Let , one can build such that . Since is a pattern structure then has a meet. We conclude that is a complete lattice.
Let be a pattern setup. Any subset of is a subset of and thus has a meet (including the since has its top element).
This concludes the proof. ∎
The state-of-the-art abounds with examples of descriptions spaces that are complete lattices that someone can use to build pattern structures:
- •
Itemset pattern structure (Ganter and Wille 1999). The description space is the Boolean lattice where is a non empty finite set of attributes.
- •
Interval pattern structure (Kaytoue, Kuznetsov, and Napoli 2011). The description space is the complete lattice 333 is the set of convex subsetes of . The set is the set of all possible intervals of , is then the set of all axis-parallel -dimensional hyperrectangles., where represents the set of all possible axis-parallel -dimensional hyperrectangles in ( is the number of attributes) and represents the hyperrectangle inclusion.
- •
Convex sets pattern structure (Belfodil et al. 2017). The description space is the complete lattice of all convex sets in ordered by inclusion .
- •
Partition pattern structure (Codocedo and Napoli 2014). The description space is the complete lattice of all partitions of some finite set . The order is finer-than order relation between partitions. That is for two partitions, if and only .
Note 17*.*
Before finishing this section, let us highlight some key differences between arbitrary pattern setups and pattern structures. It is clear that the main difference is the fact that the greatest common description does not necessarily exist for any subsets of objects in an arbitrary pattern setup. This implies that the set of definable sets is not necessarily closed under intersection in an arbitrary pattern setup as shown in Fig. 4. One should also note that, conversely to pattern setups where some subsets can be even non-coverable (see Example 3.9), In Pattern Structures, all subsets are coverables since . In fact, is always definable since .
5 From Closed Patterns to Support-Closed Patterns
In pattern mining, another notion of closedness is generally considered (Yan, Han, and Afshar 2003; Wang and Han 2004). Definition 5.1 defines formally such a notion dubbed support-closedness by Boley et al. (2010).
Definition 5.1**.**
A description is said to be support-closed in a pattern setup iff:
[TABLE]
We will see below that this notion is linked to maximal common descriptions.
5.1 On Maximal Common Descriptions
In pattern structures, support-closed patterns coincide exactly with closed descriptions (i.e. fixpoints of ) since takes a subset of objects to the greatest common description. However, when we consider an arbitrary pattern setup, such a maximum common description may not exist (see Example 4.2). One straightforward generalization is to associate to a subset of object the set of its maximal common descriptions (see Definition 5.2). Proposition 5.3 builds then a bridge between support-closed patterns and maximal common descritpions.
Definition 5.2**.**
The set of maximal covering (common) descriptions of a subset , denoted by , is given by:
[TABLE]
Proposition 5.3**.**
A description is support-closed iff . The set of all support-closed descriptions is given by: .
Proof.
We prove the two implications:
Let be a support-closed description and let . Hence, according to proposition 3.13 we have . Let us show now that . Suppose that that is . Since , there is then at least such that . Thus, in one hand and according to proposition 3.10, . And since , according to proposition 3.13, . Thus . This is contradictory with the fact that is support-closed ( s.t. and ). Therefore, .
Suppose that s.t. . According to proposition 3.13 and since , we have . Let now be such that , we have since is maximal in . According to proposition 3.10 we have . Moreover, using proposition 3.13 and since we have . Since then . Thus such that we have ; that is is support-closed.
The formula of is deduced directly. Please notice also that if there exists s.t. then (use then ). Hence, is support-closed iff . ∎
Example 5.4**.**
Reconsider Example 3.3, we have . Hence, the maximal covering ones are given by .
Note 18*.*
In a pattern structure we have: .
5.2 On Upper-Approximation Extents
Going back to pattern structures, the closure operator takes any subset to the smallest definable set enclosing it as stated by Proposition 4.5. This fact is used to enumerate all definable sets via the closure operator (see Kuznetsov (1993, 1999)). From Rough Set Theory (Pawlak 1982) perspective, the set can be seen as the upper approximation of an arbitrary and potentially non definable set in . However, when it comes to an arbitrary pattern setup, a non-definable set may have many minimal definable sets enclosing it or no one if it is non-coverable (see Example 5.6). Definition 5.5 formalizes this second generalization.
Definition 5.5**.**
The set of upper-approximation extents of a subset , denoted by , is given by the set of minimal definable sets in enclosing :
[TABLE]
Example 5.6**.**
Consider Example 3.3, the upper approximations of subset is given by since is definable. For the set , we have that is has two upper-approximation extents. For , it is clear that there is no definable set in enclosing (see Fig. 4 (right)), thus .
Note 19*.*
According to Proposition 3.13, we have . Moreover, in a pattern structure, for all .
5.3 Linking Upper-Approximation Extents to Support-Closed Patterns.
We have seen before that on the one hand operator is somehow a generalization of pattern structure operator in an arbitrary pattern setup and on the other hand, upper-approximation extents operator is a generalization of pattern structure closure operator . Indeed, in a pattern structure we have for , and . That is: . One judicious question is that, does this property still hold for an arbitrary pattern setup? Let us analyze the following example.
Example 5.7**.**
Reconsider Example 3.3 and the definable set . We have and . Thus on the one hand, we have:
[TABLE]
but on the other hand, since is definable, we have .
Hence, according to Example 5.7, it is clear that the property does not hold in a pattern setup. In fact, things can go even worse when the description space is an arbitrary infinite poset. Let us analyze for that a second Example:
Example 5.8**.**
Let be the poset presented in Fig. 7 where:
- •
,
- •
, and .
Let be the pattern setup such that , and . It is clear that . Yet, since is an infinitely ascending chain. Therefore, there is no maximal common description covering both and . Thus, given it seems that there is no link between and in an arbitrary pattern setup . In fact, this even mean that considering only maximal common descriptions to look for all possible definable sets is totally a wrong idea since maximal covering descriptions do not hold all the information about definable sets. Indeed, while , the extents obtained from the set of support-closed patterns is given by:
[TABLE]
Going back to the case of pattern structures, there is a strong link between and . Indeed, . In other words, knowing the intent of a subset of objects allows us to know every single common description to all objects in . If we had want to generalize such a property for an arbitrary pattern setup we would have expected: . In the sense that knowing maximal covering descriptions allows to deduce all the covering ones. Such a property does not hold for any pattern setup as shown in Example 5.8. This property is directly linked to what is called multilattices which we revisit in the next section.
6 Multilattices
The term multilattice was introduced for the first time by Benado (1955). This notion have not received much interest for a long period, but have been unearthed and revisited in the beginning of the century by Martínez et al. (2001); Cordero et al. (2004); Martínez et al. (2005) for other purposes. We will start here by presenting multilattices following Martinez’s et al. We will then understand the main difference between Benado’s multilattices and (Martinez’s et al.) multilattices afterward.
Before giving the formal definition of multilattices, we start by defining the notion of multi-infimum and multi-supremum. In the following section denotes an arbitrary poset and denotes an arbitrary subset.
Definition 6.1**.**
A multi-infimum (resp. multi-supremum) of is a maximal (resp. minimal) element of (resp. ). The set of multi-infima (resp. multi-suprema) of is denoted by minf(S) (resp. msup(S)) and:
[TABLE]
We say then that:
- •
has all its multi-infima iff: .
- •
has all its multi-suprema iff: .
Multilattices, as their names imply, are related in their definition with lattices. Simply put, multilattices are a relaxation of lattices where rather than demanding that the set of lower (resp. upper) bounds of each nonempty finite subset has its infimum (resp. supremum), multilattices demand that the set of lower (resp. upper) bounds of each nonempty finite subset has all its multi-infima (resp. multi-suprema).
Definition 6.2**.**
A poset is said to be:
- •
A meet-multisemilattice if for all nonempty finite subsets , has all its mutli-infima.
- •
A join-multisemilattice if for all nonempty finite subsets , has all its mutli-suprema.
- •
A multilattice if it is both a meet-multisemilattice and a join-multisemilattice.
- •
A complete meet-multisemilattice if for all subsets , has all its mutli-infima.
- •
A complete join-multisemilattice if for all subsets , has all its mutli-suprema.
- •
A complete multilattice if it is both a complete meet-multisemilattice and a complete join-multisemilattice.
It is clear that all finite posets, or more generally finite-chain posets, are complete multilattices. More precisely, posets having the ascending (resp. descending) chain condition are complete join-multisemilattices (resp. meet-multisemilattices). It is also clear that all (complete) (semi)lattices are (complete) multi(semi)lattices since requiring that a subset to have all its multi-infima is weaker than requiring it to have an infimum. One should note also that since then we have:
- •
If is a complete meet-multisemilattice then .
- •
If is a complete join-multisemilattice then .
When we compare with lattices (cf. note 8), two questions straightforwardly raise:
- •
Are pair of elements the building blocks of a multilattice?
- •
Are all complete semimultilattices complete multilattices?
The followings sections answers negatively to these both questions.
6.1 On Benado’s Multilattices
As said before, multilattices ware introduced for the first time by Benado (1955). However, Benado defined multilattices as follow: a poset is said to be multilattice if and only if all pairs of elements have all their multi-infima and all their multi-suprema. Formally:
[TABLE]
We will call here posets verifying such a property Benado’s multilattices. As clearly explained in Martínez et al. (2005), such a property is not sufficient to have all non empty finite subsets have their multi-infima and multi-suprema. This is in contrast to lattices where it suffices to have a meet and join for all subsets of two elements to have the meet and join for all non empty finite subsets. For instance, one could verify that the poset depicted in Fig. 7 is a Benado’s multilattice. Yet, it is not a multilattice according to Definition 6.2. For instance, the set does not have all its multi-infima. Indeed, , however, . It follows that .
6.2 On Complete Multisemilattices
We have seen that all complete semilattices are complete lattices. However, this property no longer holds for complete multisemilattices. In fact one could have complete meet-multisemilattice that is even not a join-multisemilattice and vice versa. For instance, Fig. 7 depicts a complete join-multisemilattice that is even not a meet-semimultilattice. For instance, the set of lower bounds is an infinitely ascending chain and thus . Thus, . In other words, does not have all its multi-infima.
6.3 On chain-completeness and complete multilattices
Complete multilattices are linked chain-complete posets. Before giving this relationship, let us recall the definition of chain-completeness.
Definition 6.3**.**
A poset is said to be:
- •
Chain-complete if all chains in , including , has its join.
- •
Dually chain-complete if all chains in , including , has its meet.
- •
Doubly chain-complete if it is chain-complete and dually chain-complete.
Since the empty set matches the definition of a chain-complete, all chain-complete posets are bounded. An important theorem linking complete lattices to chain-completeness is given in Theorem 6.4.
Theorem 6.4** (Theorem 3.24 from Roman (2008)).**
A lattice is a complete lattice if and only if it is chain-complete.
One straightforward question is what is the relationship between complete multilattices and chain-complete posets. The answer is given in Theorem 6.5.
Theorem 6.5**.**
Under Axiom of Choice (AC) assumption444I am grateful to Jozef Pócs for attracting my attention to Zorn’s Lemma., we have:
- •
All chain-complete posets are complete meet-multisemilattice.
- •
All dually chain-complete posets are complete join-multisemilattice.
- •
All doubly chain-complete posets are complete multilattices.
Proof.
Before proving the theorem, we attract the reader to Zorn’s Lemma. This lemma states that if every chain in a poset has an upper-bound, then has a maximal element. Formally:
A stronger statement, yet equivalent, of Zorn’s Lemma say even that:
(3)
Zorn’s Lemma need to be considered as an axiom since it is equivalent to the well-known axiom of choice (AC).
Let be now a chain-complete poset , we need to show that is a complete meet-multisemilattice. Let be an arbitrary subset of . We show here that . It is straightforward by definition and independently from any assumption that . It remains to show that . Since is chain-complete, then every has its join . Hence, according to Lemma 2.13 and since then . Thus every chain in the sub-poset has an upper bound . According to Zorn’s Lemma (cf. equation 3), we have . Hence, is a complete meet-multisemilattice.
In order to demonstrate the other statements of the theorem, one can follow the same steps to show that using Zorn’s Lemma on the dual poset of dually chain-complete posets. ∎
Note 20*.*
It is important to note that double chain-completeness is only a sufficient condition (under the Axiom of Choice) to have a complete multilattice but not a necessary one. Indeed, one can show that the poset depicted in Figure 8 is a complete multilattice (Remark that , , and ) but not chain-complete since the chain has not a join. Indeed, which is an antichain (i.e. has two minimal elements).
7 Pattern Multistructure
Definition 7.1 proposes a new structure that lies between pattern setups which rely only on arbitrary posets with no additional property and pattern structures which demand a meet for every subset of .
Definition 7.1**.**
A pattern setup is said to be a pattern multistructure if every subset of has all its multi-infima, i.e. :
[TABLE]
A pattern multistructure adds an additional condition on a pattern setup which is the following: knowing maximal common descriptions covering all elements of a set of objects allows us to deduce using the order every single covering description. Please note that using the notation of multi-infima, is given by . It is clear that all pattern structures are by definition pattern multi-structures.
Graphs ordered by subgraph isomorphism relation introduced in Kuznetsov (1999) induce a pattern multistructure on the set of graphs, but not a pattern structure (a pattern structure is induced on sets of graphs incomparable wrt. subgraph isomorphism). Same remark holds for sequential patterns (Buzmakov et al. 2016; Codocedo et al. 2017). This is under the assumption of the existence of a largest element subsumed by all sequences/graphs (see Example 7.2).
Note 21*.*
Note that in a pattern multistructure the empty set has all its multi-infima. Since , the set has all its maximal elements (i.e. ) or in other words every chain in is upper-bounded.
Example 7.2**.**
Reconsider the pattern setup presented in Example 3.3. Since only finite sequences are considered in the description space, we have: even if . Thus, the considered pattern setup in Example 3.3 is not a pattern multistructure due to the empty set (recall that for a nonempty , is finite and thus has all its multi-infima). The common trick to handle the empty set is to enrich with an additional largest element if it does not exist. In such a case, we have .
Let us now reconsider the question investigated at the end of section 5: “What is the link between maximal covering descriptions and upper-approximations extents in a pattern multistructure”. Before stating Theorem 7.4 answering this question, we shall state the following Lemma.
Lemma 7.3**.**
Let and be two posets and let be an order-reversing mapping. We have for any that .
Proof.
Recall that and are closure operator (cf. Lemma 2.18). Let us start by showing that . Since , we have . Since is monotone, we have . It remains to show that . Let , that is s.t. . Since , then s.t. . Hence s.t. . Using the fact that is an anti-embedding, we obtains that . In other words, s.t. . This is equivalent to say that . We conclude hence that . ∎
Theorem 7.4**.**
For any pattern multistructure we have:
[TABLE]
Proof.
The proof of the theorem is a straightforward application of Lemma 2.7 and Lemma 7.3. Let , since is a pattern multistructure, then:
Since is an order reversing, then using Lemma 7.3 we have:
Using Lemma 2.7 we obtain . That is, .∎
Another important remark related to Example 5.8 is the fact that the support-closed patterns in a pattern setup does not hold all the information about the definable sets. Theorem 7.5 states that this is no longer the case for pattern multistructures.
Theorem 7.5**.**
Given a pattern multistructure for which the set of support-closed patterns is (cf. Proposition 5.3), we have:
[TABLE]
Proof.
Recall that . Since and by definition . It is clear that . It remains to show that . Let , since is a pattern multistructure then . Let s.t. (we have ). Since is a pattern multistructure then we have a support-closed pattern s.t. . Hence, . Moreover, since , we have . Therefore, . We obtain thus , that is . ∎
Similarly to Theorem 4.7 for pattern structures, Theorem 7.6 connects multilattices with pattern multistructures. It state that (complete) meet-multisemilattices are to pattern multistructures what (complete) lattices are to pattern structures.
Theorem 7.6**.**
Let be a poset, the following properties are equivalent:
- •
For any finite set and any , is a pattern multistructure.
- •
* is a meet-multisemilattice having all its maximal elements (i.e. )*
The following properties are equivalent:
- •
For any set and any , is a pattern multistructure.
- •
* is a complete meet-multisemilattice.*
Proof.
Recall that is a pattern multistructure iff any subset has all its multi-infima. The proof of this theorem is almost the same as the one of Theorem 4.7 where the existence of the meet is replaced by the existence of all multi-infima. ∎
Last but not least, we have seen in section 4 that in the case of a pattern structure, is a complete lattice. One can say that the property of having the infimum in the description space has been transferred to the poset of definable sets thanks to extent operator. When it comes to a pattern setup on finite set of objects, it is clear that is a complete multilattice since it is finite. However, does this property still hold for the case of infinite set of objects? Unfortunately, the answer is negative as stated in Proposition 7.7. This proposition tells also that not all definable sets above in a pattern multistructure are above at least one upper-approximation of .
Proposition 7.7**.**
There exists a pattern multistructure such that is not a join-multisemilattice in which: .
Proof.
Consider the pattern setup where is the complete multilattice depicted in Fig. 9. We have:
•
.
•
.
Since is a complete multilattice (i.e. it is chain-finite), then is a pattern multistructure. Consider now an infinite set . The mapping is given by: , and . To show that the poset is not a join-multisemilattice we need to consider two definable sets in and show that the set of their common upper-bounds in does not have all its minimal elements. Let us compute for every :
•
and .
•
and .
Consider now the set of definable sets , it is clear that the set of their common upper-bounds (in is given by:
\displaystyle\{\{g_{\alpha}\},\{g_{\beta}\}\}^{u}=\big{\{}\left\{g_{\alpha},g_{\beta}\right\}\cup\left\{g_{j}\mid j\geq i\right\}\mid i\in\mathbb{N}\big{\}}
The set of upper bounds is hence an infinitely descending chain and hence does not have a minimal element, in other words: . Hence, is not a join-multisemilattice. The proof of the second part of the proposition is straightforward. Indeed, consider the non-definable set . We do have: \uparrow A\cap\mathbb{P}_{ext}=ext[cov(A)]=\big{\{}\left\{g_{\alpha},g_{\beta}\right\}\cup\left\{g_{j}\mid j\geq i\right\}\mid i\in\mathbb{N}\big{\}}. Hence, . That is . ∎
8 Pattern Setup and Pattern Multistructure Completions
Example 3.3 presents a pattern setup which is not a pattern structure. However, in FCA and pattern structure literature, it is recurrent to talk about sequential pattern structures (Buzmakov et al. 2016; Codocedo et al. 2017). In fact, instead of sequences, sets of sequences are considered which induce a richer description space. Same trick has been even used in the first paper introducing pattern structures (Ganter and Kuznetsov 2001) concerning graph description space ordered by subgraph isomorphism. Such a technique that embeds a poset into another is called a completion (see Definition 2.16). Different natural completions exist in the literature. For instance, the Dedekind-MacNeille completion (Davey and Priestley 2002) takes an arbitrary poset to the smallest complete lattice containing it. The usual trick used in FCA and Pattern Structure literature to augment a base pattern setup to a pattern structure is tightly linked to the antichain completion presented below.
Definition 8.1**.**
The antichain completion of is the poset s.t.:
- •
is the set of all antichains of .
- •
The order is given by .555Note that does not induce an order in , but just a pre-order, since the anti-symmetry does not hold (see Crampton and Loizou (2001)). Indeed, consider poset where . Since , we have and . Yet, . Therefor, does not induce an antisymmetric relation on but it is still reflexive and transitive (i.e. induce a preorder on .
- •
The order embedding from to is given by
[TABLE]
Boldi and Vigna (2016) and Crampton and Loizou (2001) had discussed the properties of such a completion. In fact, when have the ACC, is a distributive lattice, where the meet and the join are given by and , respectively. Moreover, is always a -semilattice whatever the nature of the poset , but not necessarily a lattice. Boldi and Vigna (2016) formulated a sufficient and necessary condition in order to have be a lattice: .
We take the opportunity here to underline an important link between the antichain completion and multilattices. Before expliciting this link in Propsosition 8.3, let us take a close look to the following Lemma.
Lemma 8.2**.**
Let be an arbitrary poset and let be the set of its antichain. We have : .
Proof.
The case of is trivial since and . Let be a nonempty set s.t. . Let us show that :
•
: let , suppose that that is s.t. . Since then s.t. . Thus such that which is a contradiction with the fact that is an antichain.
•
: Suppose s.t. . We have , that is: s.t. (since ). However, since then . Thus, s.t. which is in contradiction with the fact that .
This concluds the proof. ∎
Proposition 8.3**.**
Let be an arbitrary poset and let be its antichain completion: If is a lattice then is a meet-multisemilattice. Moreover, if has a top element then .
Proof.
Let us start by showing the first property that is if is a lattice then is meet-multisemilattice. We have is a lattice. Then, following Boldi and Vigna (2016), we have:
More generally:
Since is an antichain, we have (Lemma 8.2), that is:
(4)
Let be be a non empty finite subset. We need to show that has all its multi-infima. That is . We have . Since, is a lattice we have according to equation (4):
For the second part of the proposition, consider that has a top elements. That is . Hence, we have since is an antichain (Lemma 8.2). In other words, has all its multi-infima. ∎
Note 22*.*
One should note that the converse of Proposition 8.3. In fact, one can create complete lattices for which the antichain completion is not even a lattice. Fig. 10 depicts such a complete lattice. Indeed, for antichains and , we have . Hence, there is no antichain s.t. making the antichain completion not a meet-semilattice.
8.1 On Pattern Setups Antichain Completions
The main purpose of transforming a pattern setup to another one is to augment it to a pattern structure in order to use the different results related to this latter structure. We define below the most common trick used in the FCA literature which can be called pattern setup antichain completion.
Definition 8.4**.**
Let be a pattern setup, the antichain completion of is the pattern setup denoted by and given by:
[TABLE]
Earlier in this section, we have mentioned that is a lattice when is a finite poset (i.e. a sufficient condition). However, given an arbitrary pattern setup on an infinite description space, is not always guaranteed to be a pattern structure. Theorem 8.5 gives a necessary and sufficient condition on that makes a pattern structure. Here and denote extent and intent of , respectively.
Theorem 8.5**.**
Let be a pattern setup, the antichain completion of is a pattern structure if and only if is a pattern multistructure. Moreover:
[TABLE]
We have and .
Proof.
Let us show that:
Recall that is a pattern structure iff every subset of has a meet in . For we have:
where and denote respectively the lower bounds of w.r.t. and the lower bounds of of w.r.t. (recall that ). In this proof refers to the down-closure related to .
We show each implication independently:
•
() Let . Thus . Since , so is the meet of in .
•
() is a pattern structure is equivalent to say: , has a meet . That is, for : . Particularly, for with , we deduce that . Thus, . Moreover, since () and is a closure operator on we have by monotony (note that ). We conclude that we have . Using Lemma 8.2 we obtain .
We conclude the equivalence.
Let us now determine and . The previous proof has shown that for the meet of is . i.e.:
For operator, let . We have:
Let us show that . By definition of , the property holds. For the inverse inclusion, it is sufficient to show that (since is closed under intersection). Let . s.t. . Since , and . We conclude that . Hence, . ∎
8.2 On Pattern Setups Direct Completions
There is a completion that transforms any pattern setup to a pattern structure without demanding any additional property.
Theorem 8.6**.**
The direct completion of is the pattern structure:
[TABLE]
Where and . The set of definable sets is given by .
Proof.
Let us show that the pattern setup is a pattern structure. Let . We need to show that has a meet in . We have:
Since , we conclude is the meet of , Hence:
For the extent operator , let . We have
Let us show that . Thanks to the definition of , property holds. For the inverse inclusion, it is sufficient to show that (since is closed under intersection). Let , s.t. . We have . We conclude that and . ∎
Example 8.7**.**
Fig. 11 depicts the concept lattice of the antichain completion of the pattern multistructure considered in Fig. 4 (i.e., the description space is augmented with the top element ). For any concept , descriptions in bold are those which whose . Please notice that while description has for extent , description does belong to the concept related to the extent . Another important remark, are the underlined concepts. They represent concepts that are related to the non definable sets in but still definable in , i.e. and in . For instance, consider the intent of in the completion, each pattern has extent . Extent is non-coverable in and thus .
Note that while in Example 8.7, the size difference between the set of definable sets in the base pattern setup and the set of definable sets in the antichain completion is not large (i.e. = 2). In some cases, the size of can be exponentially larger than . Consider, for instance, the following example:
Example 8.8**.**
Let with . We denote by the subset . Let be the pattern setup with ,
[TABLE]
and the mapping for all . One can verify that we have . Indeed, we have:
- •
.
- •
.
Hence, we have . However, according to Theorem 8.5, we have
[TABLE]
since contains all the coatoms of (i.e. ) and the powerset lattice is coatomistic. It follows that . In other words, is exponentially larger than . One should notice that the new description space associated to is (order-)isomorphic to .
9 Conclusion and Discussion
In this paper, we provided a better understanding of the pattern setup framework. We have shown that while pattern structures demand a strong condition on the partially ordered set of descriptions, pattern setups do not require any additional property on the description space, which makes them rather permissive. We have introduced a new framework, namely pattern multistructure, lying between both structures. Informally, pattern multistructures demands that the set of maximal common description resume properly the set of common descriptions of any subset of objects. Analogously to pattern structures, pattern multistructures are tightly linked to multilattices. We have shown also that the usual antichain completion used in FCA literature to transform pattern setups, like sequence of itemsets ones, to pattern structure is applicable if and only if the considered pattern setup is a pattern multistructure.
An important open problem we are thoroughly working on is the following: “Given an arbitrary pattern setup with a finite set of objects, generate its definable sets exhaustively and irredundantly”. If the pattern setup is a pattern structure, the literature abounds of algorithms solving such a problem (e.g. (Ganter 1984; Kuznetsov 1993; Outrata and Vychodil 2012)). However, no algorithm exists to solve such a problem for an arbitrary pattern setup. Indeed, the usual solution in FCA is to transform the pattern setup to a pattern structure via a completion (e.g. antichain completion for (Kuznetsov and Samokhin 2005; Buzmakov et al. 2016; Codocedo et al. 2017)). Such a solution could create a (exponentially) larger search space. Other algorithms in the literature tackles this problem by enumerating support-closed patterns (e.g. Yan, Han, and Afshar (2003)). These latter algorihms may generate twice the same definable sets, since two support-closed patterns could have the same extent. Solving this problem will be the subject of future research.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Agrawal and Srikant (1995) Agrawal, Rakesh, and Ramakrishnan Srikant. 1995. “Mining sequential patterns.” In Data Engineering , 3–14. IEEE.
- 2Baixeries, Kaytoue, and Napoli (2012) Baixeries, Jaume, Mehdi Kaytoue, and Amedeo Napoli. 2012. “Computing Functional Dependencies with Pattern Structures.” In CLA , Vol. 972, 175–186.
- 3Baixeries, Kaytoue, and Napoli (2014) Baixeries, Jaume, Mehdi Kaytoue, and Amedeo Napoli. 2014. “Characterizing functional dependencies in formal concept analysis with pattern structures.” Ann. Math. Artif. Intell. 72 (1-2): 129–149.
- 4Belfodil, Kuznetsov, and Kaytoue (2018) Belfodil, Aimene, Sergei O. Kuznetsov, and Mehdi Kaytoue. 2018. “Pattern Setups and Their Completions.” In CLA , Vol. 2123, 243–253.
- 5Belfodil et al. (2017) Belfodil, Aimene, Sergei O. Kuznetsov, Céline Robardet, and Mehdi Kaytoue. 2017. “Mining Convex Polygon Patterns with Formal Concept Analysis.” In IJCAI , 1425–1432.
- 6Benado (1955) Benado, Mihail. 1955. “Les ensembles partiellement ordonnés et le théorème de raffinement de Schreier I Théorie des multistructures.” Czechoslovak Math. J. 5 (3): 308–344.
- 7Boldi and Vigna (2016) Boldi, Paolo, and Sebastiano Vigna. 2016. “On the lattice of antichains of finite intervals.” Co RR abs/1510.03675. https://arxiv.org/abs/1510.03675 .
- 8Boley et al. (2010) Boley, Mario, Tamás Horváth, Axel Poigné, and Stefan Wrobel. 2010. “Listing closed sets of strongly accessible set systems with applications to data mining.” Theor. Comput. Sci. 411 (3): 691–700.
