Dire n'est pas concevoir

Christophe Roche (LISTIC)

arXiv:1002.2034·cs.AI·February 11, 2010·1 cites

Dire n'est pas concevoir

Christophe Roche (LISTIC)

PDF

Open Access

TL;DR

This paper discusses the challenges of extracting ontologies from text, highlighting issues like corpus dependence, mismatch with expert ontologies, and linguistic limitations affecting conceptual understanding.

Contribution

It clarifies the distinction between textual knowledge and ontologies, emphasizing the limitations of current text-based conceptual modeling.

Findings

01

Textual conceptualizations are corpus-dependent and not true ontologies.

02

Ontology extraction from text often mismatches expert-defined ontologies.

03

Linguistic features like ellipsis affect the perception of conceptual content.

Abstract

The conceptual modelling built from text is rarely an ontology. As a matter of fact, such a conceptualization is corpus-dependent and does not offer the main properties we expect from ontology. Furthermore, ontology extracted from text in general does not match ontology defined by expert using a formal language. It is not surprising since ontology is an extra-linguistic conceptualization whereas knowledge extracted from text is the concern of textual linguistics. Incompleteness of text and using rhetorical figures, like ellipsis, modify the perception of the conceptualization we may have. Ontological knowledge, which is necessary for text understanding, is not in general embedded into documents.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLinguistics and Discourse Analysis