A General Theory of Concept Lattice (I): Emergence of General Concept Lattice
Tsong-Ming Liaw, Simon C. Lin

TL;DR
This paper introduces a general concept lattice framework that unifies and extends traditional lattices like formal concept and rough set lattices, ensuring invariance of information content across different variable presentations.
Contribution
It develops the first part of a comprehensive theory for a general concept lattice that can encompass various categorization methods and guarantees invariance of information content.
Findings
Existence of a general concept lattice compatible with traditional lattices
The general lattice is invariant under different variable presentations
Formal concept and rough set lattices are special cases of the general lattice
Abstract
As the first part of the treatise on A General Theory of Concept Lattice (I-V), this work develops the general concept lattice for the problem concerning categorization of objects according to their properties. Unlike the conventional approaches, such as the formal concept lattice and the rough set lattice, the general concept lattice is designed to adhere to the general principle that the information content should be invariant regardless how the variables/parameters are presented. Here, one will explicitly demonstrate the existence of such a construction by a sequence of fulfillment compatible with the conventional lattice structure. The general concept lattice promises to be a comprehensive categorization for all the distinctive object classes according to whatever properties they are equipped with. It will be shown that one can always regain the formal concept lattice and rough set…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRough Sets and Fuzzy Logic · Advanced Algebra and Logic · Logic, Reasoning, and Knowledge
A General Theory of Concept Lattice (I):
Emergence of General Concept Lattice
Tsong-Ming Liaw
Institute of Physics, Academia Sinica, Taipei, Taiwan 11529
Simon C. Lin
[email protected], [email protected]
Institute of Physics and ASGC, Academia Sinica, Taipei, Taiwan 11529
Abstract
As the first part of the treatise on A General Theory of Concept Lattice (IV), this work develops the general concept lattice for the problem concerning categorisation of objects according to their properties. Unlike the conventional approaches, such as the formal concept lattice and the rough set lattice, the general concept lattice is designed to adhere to the general principle that the information content should be invariant regardless how the variables/parameters are presented. Here, one will explicitly demonstrate the existence of such a construction by a sequence of fulfilments compatible with the conventional lattice structure. The general concept lattice promises to be a comprehensive categorisation for all the distinctive object classes according to whatever properties they are equipped with. It will be shown that one can always regain the formal concept lattice and rough set lattice from the general concept lattice.
General Concept Lattice; Categorization; Formal Concept Lattice; Rough Set Lattice.
1 introduction
It is commonly accepted that the formal concept analysis (FCA) Wi82 ; GW99 ; Wi05 and the rough set theory (RST) Pa82 ; Pa91 are important approaches to the Big Data Analytics. One can superficially tell such importance from the rapid growth of interest in both fields, e.g., the inclusion of the FCA and/or RST topics in many international conferences and workshops. Although FCA and RST might have been motivated differently, it is believed that they are mutually expressible Ke96 ; DGO01 ; GD02 ; DG03 ; YY04 ; Wa05 . Historically, the formal concept lattice (FCL) is a native part for FCA since FCA had its origin in activities of restructuring mathematics, in particular, mathematics order and lattice theory Wi05 . On the other hand, the rough set lattice (RSL) has also been accomplished DG03 ; YY04 in terms of the modal logic operators which cope with RST in the binary version. Although the two concept lattices may be seen as dealing with different categorisations, one may articulate them of the same object collection and the same attribute set related in a unique information system, i.e. the same formal context. However, the systematic incorporation of the FCL and the RSL in the problem analysis remains unobserved.
The treatise “A General Theory of Concept Lattice (IV)” is initiated by the general idea that the information content should be invariant regardless how the variables/parameters are presented. Notice the fact that the conventional employment of objects and attributes are intuitively different. Unlike the objects, which are individuals, the attributes as the object property description can overlap per conjunction. Thus, for general characterisation of logic problems it is essential also to take into accout the composite attributes based on the operations . There is no reason to stress the priority of the simple attributes in over the composite attributes since one may always rename the composite attributes into new simple (in contrast to composite) ones, which may render those originally simple attributes composite. Instructed by such a principle, the general concept lattice (GCL) thus speaks of the generalised attributes, which incorporate both the simple and the composite attributes constructable out of the members of .
In this part (I) the existence of GCL subject to the formal context will be demonstrated, giving rise to a more informative structure than the conventional RSL and FCL. The GCL provides a comprehensive categorisation for whatever objects discernible by the formal context since in its construction one can exhaust all the generalised attributes, as will be clarified in the part (II) LLJD12-2 . By contrast, the FCL is devised for those attributes given as conjunctions of the members of and the RSL for those disjunctions. Remarkably, the comprehensive categorisation in the GCL then brings about a simple framework for all the implications extractable from the lattice structure. While these implications are considered between two generalised attributes, those implications deducible from the FCL and RSL can be regained when one restricts oneself to the attributes of conjunction type and disjunction type, respectively. The simple framework for all the implications enabled by the GCL in fact inspires the idea to represent any implication relation by one single attribute, thereby rendering the primary deduction system (PDS), see the part (III) LLJD12-3 , which is a simplified systems where logic statements only take into account the properties on a single subject by means of predicates. The PDS is algebraically manipulable in the sense that the deductions are solely Boolean algebras bypassing any of axioms. Note that being able to achieve an algebraically manipulable deduction is rather practical. One may then manage to resolve any logic problem mindlessly if it could be arranged according to the syntax of the algebraically manipulable deduction. Indeed, certain prevalent puzzles will be shown to be resolved in the PDS in this manner. Another point is that tautology in effect occurs whenever the algebraically manipulable deduction results in the Truth 1, which is essentially what all the axioms should end up. It can be shown that the PDS adheres to the classical logic rules since the Hilbert axioms all turns out to be tautology when restricted to the PDS.
In the parts (IV) and (V) of this treatise LLJD12-4 ; LLJD12-5 , efforts are devoted to render the algebraically manipulable deduction more realistic such that it can cope with the conventional reasoning. The following two points are of concern.
- •
Dealing with pure attribute-typed logic statements in the PDS is naïve since in reality every assertion, where a property assignment has to be ascribed to a definite referential attribute set based on which a collection of judgements altogether forms the statement.
- •
The PDS as concerned with the deduction on one sole subject (object class) cannot be sufficiently expressive in formalising a general logic statement; one is looking forward to the further extensions.
For the first point, various attribute-type logic statements, though ascribed to different referential attribute sets, may become the same one. In fact, such a problem reminds one about the conventional indifference of the set unions and so forth. Therefore, one argues LLJD12-4 that the notion of finite resolution enters the syntax of PDS and participates in the deduction as a novelty. The PDS incorporating the finite resolution will then be referred to as the resolvable PDS, which however exhibits anomalies for some of the Hilbert Axioms, entailing that the classical logic comprises certain counter-intuitiveness. As for the second point, the secondary deduction system (2DS) LLJD12-5 is designed to take into account those logic statements relating predicate forms among different subjects. Note that, despite involving the PDS syntax as its particular statements, the 2DS can be developed in parallel to the PDS. Thus, one does not anticipate different deduction rules from the 2DS than from the PDS. Moreover, the resolvable 2DS can likewise arise from the reasoning incorporating finite resolution, which can be traced back to the same origin that elicits the resolvable PDS.
To start with in this part (I), the GCL is a structure ordering lattice nodes marking the general concepts by means of Galois connections. The general concept takes the 2-tuple representation (general extent, general intent), (Ge,Gi) for short, which is similar to the formal concepts employed in FCL and in RSL. Below, one will demonstrate the existence of such GCL through the accomplishment of the following goals.
G1
The general extent runs over all the distinctive object classes recognised by the formal context. The general intent plays the role of properties of a definite object class and can be characterised by a pair of attributes referred to as the generalised rough set property (Grsp) and the generalised formal concept property (Gfcp). The 2-tuples (Ge, Grsp) and (Ge, Gfcp) in effect satisfy the generalisation of lattice conditions, respectively, employed for the RSL DG03 ; YY04 and for the FCL Wi82 ; GW99 ; Wi05 .
G2
The GCL constitutes a Hasse diagram with a power set structure, where the nodes in the Hasse diagram can be related by means of set inclusion relations among the object classes discernible by the formal context.
G3
Both RSL and FCL can be regained from the GCL as particular features.
G4
The GCL allows for deterministic construction based on a particular collection of general concepts to be described later.
G5
The GCL manifests a conjugate relation between its RSL- and FCL- counterparts and consequently emerges as a self-dual lattice.
The formulation of the work at hand differs from the traditional approaches (typically, Wi82 ; GW99 ; DG03 ; YY04 ) mainly by the employment of the generalised attributes. For concreteness, let be the attribute set under consideration then the GCL takes into account generalised attributes which are all the possible compositions of members from by means of all the basic Boolean operations. Thus, the information provided by the composite attributes could be as significant as the information given in terms of simple (in contrast to the composite ones) attributes . In Sec. 2 some preliminaries of the RST and FCA are revisited, where one attempts an equal-footing treatment for both theories based on their original conventions with minimal notation refinements. This is to establish a convenient framework above which one may differentiate the GCL with the RSL and FCL. In Sec. 3, it is shown that both the FCL- and RSL- intents could have been represented by single composite attributes without loss of generality. Hence, the consideration to generalise the attribute domain from to in the problem setting becomes intuitive. Moreover, the sign of the GCL is revealed from some inconsistency observed in the conventional FCL and RSL approach. In order to retain the consistency, one inevitably takes into account the full correspondence between the objects and generalised attribute set , which then gives rise to the GCL. It turns out that the GCL ends up a comprehensive categorisation since its object classes range over all the possible object sets discernible by the formal context. In addition, the nodes on the GCL can be ordered as general concepts. While each object class is identified with a general extent, the corresponding general intent as its associate property may be acquired from the generalisation of FCL-intent and RSL-intent. Although all the general intents can be directly determined per read out from the formal context, as will be demonstrated in the next paper, one chooses here a constructive approach for them by means of certain irreducibility conditions. It is through the design of such irreducibility conditions that one can reveal how the traditional RSL and FCL being regained as part of GCL. In Sec. 4, results are summarised to affirm the accomplishments of goals G1 to G5. There are also discussions concerning further developments for the general theory of concept lattice.
2 preliminaries and beneath
The formal context as stated in the FCA approach Wi05 is defined as a set structure , for which and are sets while is binary relation between and , i.e. ; the elements of and are called (formal) objects and (formal) attributes, respectively, and , i.e. , is read: the object has the attribute . However, for the RST theorists there is a different convention, governing similar functions, where the information system (IS) Pa82 is often employed as a synonym of formal context. Hence, in order to prevent from confusion caused by simultaneously treating both theories there is a compromise that preserves most of the original notations employed in FCA and RST as follows.
Definition 2.1**.**
A formal context is a set structure (let it also be denoted by ), for which and are sets while is binary relation between and , i.e. ; the elements of and are called formal objects and formal attributes, respectively, and , i.e. , is read: the object has the attribute .
Here, the notation is to emphasise that and are related, conceived through the formal context , which is instructive whenever more than one formal contexts are simultaneously treated. In particular, the definition given in Ref. Wi05 is modified into Definition 2.1 in the following manner.
- •
The original notation used in FCA is reserved for the derivation operator, which is typically a map from subset to subset. is a binary operation which relates a single object (attribute) to an attribute-set (object-set), denoted by Wi82 ; GW99 ; Wi05 where is an attribute-set and is an object-set.
- –
(): the object has the attribute ,
- –
(): is one of the objects that possess the attribute .
Based on , one may define all the derivation operators for both FCA and RST, i.e. , and , as will be clarified in Definition 2.2.
- •
The word formal is kept for the purpose to distinguish objects () and attributes (), in contrast to the traditional approaches where only set operations are employed for both objects and attributes.
- –
Members of are to be treated formally like objects. In practice, there is a collection of individuals to be categorized into classes. Thus, the operations for objects are those which are apt to manipulate set relations, say intersection (), union () and complementarity ().
- –
Members of are to be treated formally like attributes. It is natural to extend simple attributes into the composite attributes by means of the Boolean operations. Here,
the product “” () is employed for the conjunction, i.e. the logical AND,
the summation “” () is employed for the disjunction, i.e. the logical OR,
the unary operation “” is employed for the negation, i.e. the logical NOT.
A brief summary in terms of the above new notation for the traditional approaches in FCA and RST is in order.
Definition 2.2** (Wi82 ; GW99 ; Wi05 ; DG03 ; YY04 ).**
Given a formal context , the derivation operators are
[TABLE]
where, notably, is equivalently since iff .
Subsequently, it is known that the maps in Eqs (1-3) manifest the following relations, for and ,
[TABLE]
[TABLE]
where the same relations are also applied to and . For convenience, let one employ the notions extent and intent Wi82 ; GW99 for both FCL and RSL such that the two theories can be handled on an equal footing.
Definition 2.3**.**
Consider and subject to the formal context .
- •
The 2-tuple is called an FCL concept if and , where is the FCL extent, is the FCL intent Wi82 ; GW99 .
- •
The 2-tuple is called an RSL concept if and , where is the RSL extent, is the RSL intent.
Notably, the 2-tuple with has been referred to as object-oriented RSL concept DG03 ; YY04 , in contrast to the case of property-oriented concept defined through . However, one will ignore the consideration of since it can be derived from by means of interchanging the object and attribute. In addition, as a consequence of Eq. (10),
[TABLE]
telling YY04 that if is a concept appropriate for then is a concept appropriate for and vice versa. It is remarkable that Definition 2.3 provides fundamental ingredients for the original FCL and RSL GW99 ; YY04 . Based on Eq. (6), a -tuple that fulfils is guaranteed to exist and can be written as or , while a -tuple that fulfils is guaranteed to be existent and can be written as or . Moreover, the Galois connection composed by the two posets and can be accomplished in terms of the concepts by virtue of Eq. (9).
To proceed with the general theory of concept lattice, it is of crucial importance to clarify the notions contained in formal objects and formal attributes. Since a formal context discerns the objects based on the attributes they possess, the objects which possess the same attributes are grouped into an equivalent class.
Definition 2.4**.**
The discernible object classes in the perspective of are definite. One will call these -distinct subsets of and denote each such class with in the sense that ’s are regarded as distinct entities by . The total number of these -distinct object classes is less than or equal to , which is the cardinality of . In effect, , and iff . Hence, one has the collection of -distinct object classes based on the equivalence relation “ iff ”, which means that for some . Likewise, the -codistinct subsets are gathered as .
In contrast to the formal objects, the formal attributes can also emerge in a composite manner. One is free to construct various new attributes out of the given attributes by means of the standard Boolean operations, thereby forming an extended framework for the general consideration.
Definition 2.5**.**
Given a set of attributes, the set of generalised attributes over is defined as , where iterates over all the possibilities of Boolean functions in which the concerned operations are “” (conjunction), “” (disjunction) and “” (negation). To be concrete, is meant to denote whatever one may construct out of by means of arbitrary compositions provided by the Boolean functions. In contrast to the members of , which may be composite, the attributes in are then referred to as simple attributes.
Intuitively, also comprises the set itself as well as the conventional truth and falsity . Moreover, by means of negation one may define several interesting subclasses of .
Definition 2.6**.**
Let be certain collection of attributes of interest. A conjugate class is referred to as one of the possibilities of . Accordingly, the set of all conjugate classes is denoted with . The symmetrised attribute set is employed to extend by means of negation: .
One could also employ the set of generalised attributes as with any . It is noteworthy that prescribing , either by or by Definition 2.5, may encounter abundant choices of Boolean function . Technically, such abundance can be reduced by means of normal forms, where one is in particular interested in the conjunctive normal form (CNF) and the disjunctive normal form (DNF). The other choice to suppress the redundance is to consider , where differs from by that no further negation is needed (cf. Definition 2.5). In what follows, the particular significance of the generalised attributes over to the structure of will be presented step by step.
Attributes are considered to be further restrained through the attribute-object correspondence provided in the formal context .
Definition 2.7**.**
According to Definition 2.5 any generalised attribute can be regarded as , namely that is a (Boolean) function of , say , for certain and for some . Subject to , the contextual (Boolean) function for any generalised attribute is obtained from the expression of by (1) replacing the concerned attribute with (), (2) replacing the concerned negative attribute with , (3) replacing the conjunction and disjunction operations with and , respectively. Notably, the contextual function is essentially a subset of .
For the extreme case, one has “” such that . The contextual function can represent all the information contained in the formal context since the contextual function assigns every attribute, say , in with a definite object subset which possesses in common. Furthermore, such idea can be extended.
Lemma 2.8**.**
From one can deduce the extended formal context without any additional assumption in the sense that can be determined by means of , which is called “ is well defined in ”.
Proof.
If is well defined in then one can achieve the extended formal context such that each attribute in is equipped with an object set which possess the attribute in common, similar to the case of . Here, the proof of “ is well defined in ” can be carried out in two steps. It is to show that (1) (Definition 2.7) for such that is well defined in and (2) both and are well defined . Once these are done, one will end up with “ is well defined in ” since can be obtained as by means of “” and “” (see the discussion after Definition 2.6).
- (1)
For “” can be interpreted as “”. In other words, , hence, , which gives rise to after taking complementarity. Moreover, if the negated attributes are also taken into account, the same interpretation implies that . This is consistent since coincides with : .
- (2)
Since is well defined in , one may consider that in . Likewise, .
∎
Definition 2.9**.**
A contextual Venn diagram (or Euler diagram) subject to can illustrate the set relations among the contextual functions of attributes in . One can achieve by encircling the object-set within the object collection , where the object-labellings are ignored. In contrast, may also have its own intrinsic logical structure in terms of the (conventional) Venn digram .
In practice, there are two types of ordering systems for attributes. As the first type, “” is concerned with the intrinsic ordering of the entities in . For instance, denotes that the region representing are included in the region representing on . The second type of ordering system “” is employed for because s are in fact object sets. Notably, preserving the object labellings in the contextual Venn digram will restore all the information content of the formal context. The contextual Venn diagram with explicit object labellings turns out to be an alternative representation for the formal context. Once is given, Lemma 2.8 becomes a rather intuitive result since one may read off every for . The other point is that the formal context does not per se suggest any intrinsic relation among members in . Attributes in can be basically independent, hence, any two attributes have an intersection on the corresponding Venn diagram (). However, it is also allowable that one imposes additional constraints on by removing some of the disjoint regions from .
Lemma 2.10**.**
Subject to the formal context ,
- •
,
- •
* but ,*
- •
* but .*
Proof.
- •
If but then the existence of the object set is contradictory.
- •
since from it follows that (Lemma 2.8). If then and/or which means that and/or . Consequently, and/or , which contradicts .
- •
For one may assume in which . Then, according to Lemma 2.8. Moreover, if but , contraction occurs since implies which denies . Thus, .
∎
The above shows that the distinction between and is stronger than the pair . More concretely, the number of disjoint regions on is , since is less discernible than , thus 111 The formal context with will be named degenerate formal context LLJD12-2 , because the Grsp is then equivalent to Gfcp (Proposition 3.4) at each general extent. Although the degenerate formal context is not a practical example for the problem of categorisation, it provides a very useful theoretical tool for inspecting the GCL structure in every detail LLJD12-3 LLJD12-4 ..
Definition 2.11**.**
The contextual equivalent class of attribute subject to is an attribute-set defined in the following manner. Given , one may collect all the attributes which have the same contextual function (Definition 2.7) with as . Accordingly, since , the idea can be employed with respect to definite object-set in the sense that for .
It is noteworthy that the construction is in fact the precursor of the concept on the GCL. Here, the remaining condition lies on determining whether an arbitrary subset of can form a node of the GCL. Remarkably, a subset of with trivial contextual equivalent class, i.e. , can hardly participate in the categorization provided by the formal context since no attribute can be found to label it. The point is that any of the -distinct subsets of (Definition 2.7), say , is readily a smallest possible subset equipped with non-trivial contextual equivalent attributes. In other words, the -distinct subsets correspond to the smallest regions on the contextual Venn diagram (Definition 2.9), hence, there exists no attribute, say , whose contextual function exactly encloses a region which is smaller than the region marked by . Consequently, for that can be written as the union of -distinct subsets () one has non-trivial , otherwise .
A further instructive notion is the smallest yet non-trivial constituents of the generalised attribute set, for which one may develop two independent non-trivial minimisation aspects. Firstly, one may speak of the minimal attributes of which are not the Falsity and thus correspond to the smallest regions on the Venn diagram . Secondly, subject to , there are classes of irreducible attributes that determines the object classes which participate in the categorisation in a least possible manner. Here, the first non-trivial minimisation aspect is a prevalent issue which deserves clarification.
Lemma 2.12**.**
Given an attribute-set , one may define the set of non-trivial infima and the set of non-trivial suprema for the corresponding generalised attribute-set . In the case of , one denotes that without any attribute which fulfils . Both and can be expressed in terms of (Definition 2.6):
- •
, is called -atom, which is considered as an atom for .
- •
, is called -coatom.
Proof.
One may rewrite as “”.
- •
For any one may write down the DNF as , where the range of is up to the given . Hence, . On the other hand, assume but for . However, attribute may take the form in DNF, which apparently implies . One concludes then .
- •
. Subsequently, based on the above result since can be identified as for some , which implies that , i.e. . On the other hand, assume but for . However, attribute may take the form in CNF, which apparently implies . Therefore, .
∎
Corollary 2.13**.**
Whenever there are no intrinsic conditions pre-imposed on the attributes in , the cardinalities are as follows.
- •
The numbers of non-trivial suprema and infima for are equivalent: .
- •
There are distinct generalised attributes in , i.e., .
Proof.
Whenever there are no intrinsic conditions pre-imposed on the attributes in , the elements of can be represented by the disjoint regions on .
- •
Alternatively, by Definition 2.6 one may compute that . The elements of amount to the complementary parts of the disjoint regions on , which demonstrates an one-to-one correspondence; .
- •
Denote the members in with where . An attribute in , say , can take the expression in DNF. In other words, the attributes in may be implemented by where ranges over all the possibilities for . Counting all these possibilities is equivalent to enumerating the number of the power sets of a set with members, i.e., .
∎
The second non-trivial minimisation aspect mentioned above is related to the formal context.
Definition 2.14**.**
A generalised attribute composed of the disjunction of attributes in (Definition 2.6) is called an irreducible disjunction subject to if eliminating any term in the disjunction from , which results in , always causes . Accordingly, an -irreducible disjunction class can be defined as , see Definition 2.11. On the other hand, a generalised attribute composed of the conjunction of attributes in is called an irreducible conjunction in subject to if eliminating any term in the conjunction from , which results in , always causes . The -irreducible conjunction class is given as .
In practice, can be written in terms of for some (Definition 2.6) in which . Likewise, can be written in terms of for some where . Generally speaking, and . The other useful expressions are
[TABLE]
Note that only when and only when . What follows is mainly concerned with regaining the FCL and RSL nodes on the GCL.
Lemma 2.15**.**
- •
* and, furthermore, .*
- •
* and .*
Proof.
- •
Any attribute (Definition 2.6) is non-composite and can be thus regarded as both an one-term disjunction and an one-term conjunction. Therefore, is simultaneously an -irreducible disjunction and an -irreducible conjunction. Moreover, in which and in which .
- •
For , let where for some . Accordingly, but , which implies that but . It then turns out that
[TABLE]
where with . Therefore, . Similarly, starting with , one will end up with .
∎
3 obtaining the general concept lattice
The general theory of concept lattice is capable of providing the categorisation for the formal objects according to members of . First of all, one observes that one could have replaced both FCL and RSL intents in the original constructions in terms of single composite attributes without altering their information contents.
Lemma 3.1**.**
Let , for the formal context . The concepts for FCL and RSL (Definition 2.3) can be both expressed in terms of in the following manner.
- •
For FCL, use , where is referred to as a formal concept property (fcp). Imposing on is indistinguishable from imposing on .
- •
*For RSL, use , where is referred to as a rough set property (rsp). Imposing on is indistinguishable from imposing on . *
Proof.
- •
. Reversely, since both and consist of distinct members in , . Thus, and it is appropriate to employ . Moreover, by Eq. (1) . Consequently, .
- •
. Reversely, since both and consist of distinct members in , . Thus, and it is appropriate to employ . Moreover, by Eq. (2) and (3) . Consequently, .
∎
Since the above proof has invoked some of variants of derivation operators (see Eq. (1)-(3)), it is worthwhile to mention about all of such variants:
[TABLE]
where it is intuitive that interchanging the role of and also gives rise to the other set of derivation operators. Another point is that there are particular object-classes which are fundamental in both the FCL and the RSL.
Proposition 3.2**.**
Given a formal context , the object-set is a common extent for FCL and RSL.
Proof.
coincides with and coincides with , where in both cases is identified as .∎
Note that the result of Proposition 3.2 is rather intriguing, albeit true. Since the formal context conventionally develops a table, Proposition 3.2 in fact entails that the object-class is recognised as an extent for both the FCL and RSL just because is listed in the table. Now, assume that there is a formal concept for FCL subject to . Then, by virtue of Proposition 3.1 one has with . Consider which is obtained by explicitly including the given correspondence as an additional column in . By Lemma 2.8 is something deducible from , thus, should not provide different concept lattices. However, is now listed in the new table , which implies that must also be an RSL-extent when it is known to be an FCL extent. This is denied by the original FCL and RSL (see the comparison in Ref. YY04 ), therefore, it is unnatural to neglect the role the composite attribute may play just because it is not simple (Definition 2.5). Hereafter, the general theory of concept lattice will proceed in a different way from the original ones by attempting a democratic consideration for all the members in .
The concept lattice subject to inevitably makes reference on its accompanied extended formal context, i.e. . Since is well defined (Lemma 2.8), it is straightforward to manipulate in parallel to the conventional formal context.
Definition 3.3**.**
Following from Definition 2.2, the derivation operators appropriate for are given as
[TABLE]
Obviously, Eq. (6) to (10) remain valid after the substitutions “, , ”.
Proposition 3.4**.**
Subject to , the general concept can be obtained as a generalisation of the original FCL and RSL concepts (Lemma 3.1) as follows. If and then is called the general extent, where the allowable general extents are collected as . In addition, is the generalised rough set property (Grsp) and is the generalised formal concept property (Gfcp).
Proof.
The general concept here is the consequence of applying Proposition 3.2 to , which is telling that is a common extent of FCL and RSL in the perspective of . In practice, represents the dual use of in Proposition 3.1 generalised in the sense of Definition 3.3. Accordingly, as well as .∎
The general extent can then take the form of with or , thereby being referred to as an object class with non-trivial contextual equivalent attributes (Definition 2.11).
Proposition 3.5**.**
Let denote the collection of all the object classes with non-trivial contextual equivalent classes of attributes.
- •
The set is the sigma algebra .
- •
* is only concerned with unions of the members in and . *
- •
Given then .
Proof.
The significance of the sigma algebra generated by a collection of sets is that it exhausts whatever constructable out of the collection by means of iterating various steps of set operations such as intersection, union and complementarity. The sigma algebra is essentially closed under intersection, union and complementarity.
- •
(Proposition 3.4), where is well defined according to Lemma 2.8. Hence, can be obtained from by means of intersection, union and complementarity (Definition 2.7). Moreover, upon representing the member of as for some , the closure relations are as follows.
since ,
since ,
since .
Therefore, is closed under intersection, union and complementarity.
- •
Firstly, in that s.t. . For instance, is satisfied by taking . Moreover, due to the fact that is itself a sigma algebra. On the other hand, since, e.g., can be expressed in terms of . Likewise, since is a sigma algebra. Therefore, . Secondly, because only comprises the disjoint members “” where . Accordingly, one obtains that .
- •
is the smallest collection that is closed under intersection, union and complementarity and contains , therefore, because is a sigma algebra.
∎
Proposition 3.6**.**
*The requirement for the general concept in Proposition 3.4 is satisfied if .
is in effect the full collection of general extents for the GCL.*
- •
* and where in both cases .*
- •
Equivalently, and .
Proof.
It is remarkable for any subset that implies otherwise . In what follows, is reserved for the use of such that and can be defined.
- •
According to Definition 3.3,
,
, which satisfies Proposition 3.4,
,
, ditto.
- •
If then as well as .
Therefore,
and .
∎
The GCL then comprises nodes which correspond to general concepts (Proposition 3.5). Specifically, there is no more need to search for the object classes that satisfy the requirements for general concepts. The general extents are nothing but the object classes equipped with non-trivial contextual equivalent classes of attributes. The GCL thus provides a categorisation for whatever object sets that can be explicitly labelled by attributes. Subsequently, the following conjugate relation considerably reduces the complexity for deducing GCL.
Proposition 3.7**.**
* is a general concept if only if is a general concept,
where and .*
Proof.
Since is a sigma algebra by Proposition 3.5, . Moreover, with Proposition 3.6,
. Likewise,
.∎
Interestingly, although the above result has a similar appearance to Eq. (11), it possesses a completely different meaning. The GCL is self-dual in the sense that the general concepts always appear pairwise. Subsequently, in order to furnish the lattice structure, the way to order concepts as nodes on the lattice remains in question. We now proceed to resolve the question.
Proposition 3.8**.**
Given a formal context ,
- •
if and ,
- •
if and .
Proof.
- •
by Proposition 3.6, hence, . In addition, if one can prove that then the statement is established. Since , there exists some subset s.t. and . Consider then . Clearly, for any one has because . Moreover, , which implies that , where notably for (Proposition 3.5). Consequently, , which implies that .
- •
The above result implies that for . Then, by Lemma 3.7, for which is equivalent to for .
∎
All the general concepts, i.e. , can be deduced in a definite manner based on a few fundamental attributes.
Proposition 3.9**.**
Subject to the formal context
- •
if then for some ,
- •
if then for some , where .
Proof.
Since , Proposition 3.5 entails that for some .
- •
If then (Definition 2.4, Lemma 3.8). Meanwhile, entails that and for some .
- •
If then . Meanwhile, entails that and for some .
∎
Thus, once or is obtained, Proposition 3.9 in practice ensures that the full Grsp’s and Gfcp’s can be unambiguously determined. However, the other instructive issue is that certain particular attributes can be determined without or .
Proposition 3.10**.**
Given a formal context , is an -atom and an -coatom (Lemma 2.12) for . It can be identified that and with , where (Definition 2.6).
Proof.
Since , if then . It turns out that , and . Meanwhile, by Lemma 2.12, telling that unless . Note that for since is the empty object set . Therefore, , where . On the other hand, by Proposition 3.7, where notably .∎
Notably, Proposition 3.10 is concerned with and rather than and respectively required in and . It is also remarkable that based on the results of Proposition 3.10 one may further resolve a truly feasible and realistic construction for GCL LLJD12-2 . However, to recover the RSL- and FCL- nodes in terms of the general theory of concept lattice remains the concern for the moment. Subject to this goal, it is more convenient to adopt the irreducible expressions of Definition 2.14 to find the elements of and , which is described in order.
Proposition 3.11**.**
Subject to the formal context the Grsp and Gfcp can be expressed in terms of irreducible attribute classes:
- •
Expressed in terms of DNF, the Grsp can be simplified (Eq. (12)) as
[TABLE]
- •
Expressed in terms of CNF, the Gfcp can be simplified as
[TABLE]
Proof.
- •
Written in DNF, in which is a product of members of and since . Thus, (cf. Proposition 3.6), where is a subset of which comprises only the attributes given in terms of a product of members of . For the members of , if then there must be such that () and (Definition 2.14). Therefore, , hence, .
Moreover, consider a part of expression of which emerges as with . For , if with or then is absent in since . Consequently, .
- •
By Proposition 3.7, . Subsequently, iff by Lemma 2.15. Therefore, .
Similar to the case for , consider in which within the expression . For , if with or then . Consequently, .
∎
Apparently, the above results are consequences of Proposition 3.6.
Proposition 3.12**.**
Given a formal context , one can obtain the contextual truth and falsity and , in contrast to the conventional truth and falsity and in the following way,
- •
* and ,*
- •
* and ,*
* and , .*
Accordingly, since it behaves like the falsity for Grsp, while since it behaves like the truth for Gfcp. In addition, from Proposition 3.7, it follows that . Moreover, for the truth of Grsp , while for the falsity of Gfcp .
Proof.
- •
by Proposition 3.8 and 3.10 since for .
On the other hand, by Proposition 3.7.
- •
On employing Proposition 3.6,
,
,
,
.
∎
Corollary 3.13**.**
- •
* and .*
- •
* and ,*
where and for .
Proof.
These are direct consequences of Proposition 3.12, upon using the formulae in Proposition 3.11.
- •
and .
- •
and .
∎
Hence, one may also replace and by and , respectively. Here, one may recover the full general concepts via Proposition 3.8 in terms of the irreducible attribute classes given in Definition 2.14. According to Proposition 3.9 and based on and , one then obtains Grsp’s in DNF and the Gfcp’s in CNF. We now proceed to consider the corresponding lattice structure.
The ordering of general concepts can be constructed in an unambiguous way, thereby forming the desired Galois connection.
Proposition 3.14**.**
- •
For iff and .
- •
*For iff and . *
Proof.
- •
By Lemma 2.10, . Here, means , which implies . Thus, . On the other hand, if then . Consequently, if or then , i.e., . Moreover, and . Therefore, .
- •
With Definition 3.3, one has the ordering rules
and ,
which is the consequence of Eq. (9) extended to . Therefore,
[TABLE]
by Proposition 3.4.
∎
Proposition 3.15**.**
The conventional FCL and RSL nodes can always be recovered from the GCL:
- •
The criterion for being an RSL extent, which is by , can be satisfied with
.
The criterion for being an FCL extent, which is by , can be satisfied with
.
- •
Whenever is an FCL and/or RSL extent,
*the corresponding RSL intent can be faithfully read out from , *
the corresponding FCL intent can be faithfully read out from .
Proof.
- •
The criteria for being the extent can be rearranged by means of Eq. (13).
For , it turns out that . Note that if then , thus, . Therefore, for . On the other hand, for , one requires that . Likewise, if then , thus, . Consequently, for .
- •
By Proposition 3.11,
[TABLE]
where according to Lemma 2.15. Moreover, by Eq. (13). Therefore, if then . Similarly,
[TABLE]
where by Lemma 2.15 and Eq. (13). Therefore, if then .
∎
Remarkably, the roles the RSL and FCL play within the GCL can be understood easily. The components of an RSL intent are those simple attributes (Definition 2.5) found from the Grsp in DNF, whereas the components of an FCL intent are simple attributes found from Gfcp in CNF. Moreover, the RSL is only able to categorise those object classes which are expressible in terms of a union of ’s for some ’s in , whereas the FCL is only able to categorise those object classes which are expressible in terms of an intersection of ’s for some ’s in .
One may construct the GCL in terms of lattice theory Wi82 ; YY04 ; BH00 .
Proposition 3.16**.**
Based on the collection of all the general concepts , the GCL exhibits a lattice structure , where (meet) and (join) are binary operators and the unary operator provides a kind of duality:
- •
* defines the partially ordered set (poset) with the unambiguous order .*
- •
The lattice operations , and are well defined.
* is self-dual in the sense that iff . reverses the ordering of lattice nodes in the sense that iff . *
- •
* is complete in the sense that the supremum and infimum are found to be and , respectively, where is the full collection of objects.*
Proof.
For all , let .
- •
Because iff (Proposition 3.14), it is natural to define via .
- •
, one has
[TABLE]
where is based on Proposition 3.7.
Moreover, iff as or .
- •
Given , let in which . Then,
[TABLE]
Since , it turns out that
[TABLE]
by Proposition 3.8, 3.12.
∎
It is also noteworthy that within the expression one may regard as a prescription for the general intent corresponding to the general extent . Such a prescription is convenient for the discussion concerning the RSL and FCL in relation to the GCL.
4 discussion
One has proved the existence of the general concept lattice above the conventional framework of RSL and FCL, following the anticipated accomplishments G1-G5 proposed in Sec. 1. The formal context automatically induces the extended formal context (Lemma 2.8). In effect, based on there are intrinsic relations between the objects in and the members of the full generalised attribute set , which all pertain to the indispensable structure of the content information. It is by virtue of that one can assure the 2-tuple general concept expression in terms of for G1, where the general extents emerge as the object classes allowable within GCL and the general intents their corresponding property descriptions. From Proposition 3.4 and 3.6 one notices that the general intent can be represented by the pair 222 This is a fact that is further supported by Proposition 3.14, which entails that both the generalised rough set property and generalised formal concept property are ordered simultaneously.
(generalised rough set property, generalised formal concept property) , i.e. (Grsp,Gfcp).
Moreover, the general extents are found to be the collection in view of Proposition 3.5 and Proposition 3.6, where in particular iff . Note that G3 is achieved as follows. Since in effect comprises whatever distinctive object classes, making the GCL constitute a Hasse diagram with nodes (G2), it is certain on the GCL that one can always find all the nodes corresponding to the object classes categorised according to the RSL and the FCL, respectively. Indeed, Proposition 3.15 also provides systematic ways to regain both the RSL intent and FCL intent from the irreducible expressions of Grsp and Gfcp, respectively. Another point is that the GCL permits deterministic construction (G4), as is the method proposed by Lemma 3.8, 3.11 and Corollary 3.13. Though to complete the general concept construction in such a manner turns out to be inefficient, the method is rather elucidative for the GCL in relation to the conventional lattices. Especially, one may notice that the freedom involved in the formal context can be significantly reduced. The GCL in fact emerges as a self-dual lattice bearing the structure of Proposition 3.16 based on the one-to-one correspondence demonstrated in Lemma 3.7, which is the conjugateness relation needed by G5. Obviously, the self-duality halves the complexity for completing the general intents.
Being an original construction, the present approach of GCL has thus far focused on its comparabibility with the existing approaches. To this end, one found the employment of Grsp in DNF and Gfcp in CNF particularly instructive for carrying out such comparisons.
- •
and can be regarded respectively as the generalisation for RSL intent and the generalisation for FCL intent (Proposition 3.1, 3.4):
Without altering its theoretical content, one may replace the RSL intent by its disjunction while , which can be intuitively arranged into an expression in DNF.
The FCL intent can be represented by its conjunction while , which can be arranged into an expression in CNF.
- •
One has acquired more precise descriptions for the properties of the object class through and than through the RSL intent and FCL intent:
Observe that since obviously . If is an RSL extent, suggests that there are in fact more properties peculiar to the class to be considered, in contrast to .
Likewise, (). Namely, if is an FCL extent then , in contrast to , in fact incorporates more properties the objects in the class can possess in common.
Technically, the general concept construction reflecting the above conventional wisdom of the traditional FCL and RSL can be accomplished via Lemma 3.8, 3.11 and Corollary 3.13, which requires the determination of various irreducible attributes (Definition 2.14). Unfortunately, such a method remains tedious and impractical even though the irreducible attributes are constructable. However, things turn out drastically different when one instead considers the Gfcp in DNF and the Grsp in CNF. In Ref. LLJD12-2 it will be shown that the Gfcp can be identified as components of the contextual truth and the contextual falsity defined in Proposition 3.12. Moreover, the determination of the general intents can be implemented by binary representations which label the object classes. Noteworthy is also that the contextual truth (or falsity) also plays the crucial role for the logical deduction in the frame of classical logics. In practice, all the rules of implication concerned with the attributes will be shown to be derivable based on (or, equivalently, from ). We shall present the tractability in both obtaining the full general concepts and determining all the accompanied rules of logic deduction for the general concept lattice in the next paper LLJD12-2 .
In conclusion, the GCL provides the comprehensive categorisation for whatever distinctive objects according to whatever properties one can refer to in the formal context. The general extents are collected as as opposed to Proposition 3.15, which states 333More precisely, and , as will be further clarified in Ref. LLJD12-2 . Note that in their constructions (the empty object set) is always included as an RSL extent and as an FCL extent in view of the completness of lattice Wi82 ; YY04 . that
[TABLE]
Clearly, is also referred to as (Proposition 3.5), meaning that the categorisation the GCL can accomplish ranges over all the subsets of that are distinct from the perspective of formal context. It is also remarkable that the general intent is more adequately represented by the equivalent class of attributes meaning that refers to all the attributes the objects in possess in common. Here, and simply appear as the bounds of . It can be further shown LLJD12-2 that every general intent is distinctive in the sense that iff and . Thus, the GCL not only provides the categorisation for all the discernible subsets of but also exhausts the full generalised attribute set .
acknowledgements
The authors are grateful to Dr. Arthur Chen of Tamkang University who asked for clarifying some mathematical definitions from the FCA theory in the year 2012, a crucial issue that inspired the present treatise. This paper is partially supported by Ministry of Science and Technology, Taiwan (Grant number: MOST 105-2633-E-001-001).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1) R. Wille, Restructuring lattice theory, In Rival, I., editor, Ordered Sets. Reidel, Dodrecht. (1982) 445–470.
- 2(2) B. Ganter, R. Wille, Formal Concept Analysis: Mathematical Foundation, Springer 1999.
- 3(3) R. Wille, Formal Concept Analysis as Mathematical Theory of Concepts and Concept Hierarchies in B. Ganter et al. (Eds.): Formal Concept Analysis, LNAI 3626 (2005) 1–33.
- 4(4) Z. Pawlak, Rough sets. International Journal of Computing and Information Sciences. 18 (1982) 341–356.
- 5(5) Z. Pawlak, Rough sets, Theoretical Aspects of Reasoning About Data, Kluwer Academic Publisher 1991.
- 6(6) R. E. Kent, Rough concept analysis: a synthesis of rough sets and formal concept analysis, Fundamenta Informaticae, 27, 169-181, 1996.
- 7(7) I. Düntsch, G. Gediga, E. Orlowska, Relational attribute systems, International Journal of Human-Computer Studies. 55 (2001) 293–309.
- 8(8) G. Gediga and I. Düntsch, Modal-style operators in qualitative data analysis, Proceedings of the 2002 IEEE International Conference on Data Mining, (2002) 155-162.
