SMILK, linking natural language and data from the web
C\'edric Lopez (TEXTE), Molka Dhouib (I3S, WIMMICS), Elena Cabrio, (WIMMICS), Catherine Faron Zucker (I3S, WIMMICS), Fabien Gandon (UCA,, WIMMICS), Fr\'ed\'erique Segond

TL;DR
This paper presents SMILK, a system that links natural language data with web data to enrich knowledge bases and improve text analysis, demonstrated through a cosmetics brand information retrieval application.
Contribution
It introduces ProVoc, an ontology for products and brands, and a method for automatically populating a knowledge base from diverse textual sources.
Findings
Effective brand-related information retrieval demonstrated
Successful creation and population of ProVoc ontology
Browser plugin provides enriched knowledge to users
Abstract
As part of the SMILK Joint Lab, we studied the use of Natural Language Processing to: (1) enrich knowledge bases and link data on the web, and conversely (2) use this linked data to contribute to the improvement of text analysis and the annotation of textual content, and to support knowledge extraction. The evaluation focused on brand-related information retrieval in the field of cosmetics. This article describes each step of our approach: the creation of ProVoc, an ontology to describe products and brands; the automatic population of a knowledge base mainly based on ProVoc from heterogeneous textual resources; and the evaluation of an application which that takes the form of a browser plugin providing additional knowledge to users browsing the web.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Biomedical Text Mining and Ontologies · Web Data Mining and Analysis
