Interpr\'etation vague des contraintes structurelles pour la RI dans des corpus de documents XML - \'Evaluation d'une m\'ethode approch\'ee de RI structur\'ee
Eugen Popovici (VALORIA), Gilbas M\'enier (VALORIA),, Pierre-Fran\c{c}ois Marteau (VALORIA)

TL;DR
This paper introduces specialized data structures and approximate search methods for indexing and retrieving information in heterogeneous XML databases, demonstrating competitive performance in the INEX 2005 evaluation campaign.
Contribution
It presents a novel indexing scheme and approximate search mechanisms tailored for heterogeneous XML data, combining structured and free text information.
Findings
Achieved high retrieval performance in INEX 2005
Ranked among the best XML IR systems for VVCAS task
Effective management of heterogeneous XML data
Abstract
We propose specific data structures designed to the indexing and retrieval of information elements in heterogeneous XML data bases. The indexing scheme is well suited to the management of various contextual searches, expressed either at a structural level or at an information content level. The approximate search mechanisms are based on a modified Levenshtein editing distance and information fusion heuristics. The implementation described highlights the mixing of structured information presented as field/value instances and free text elements. The retrieval performances of the proposed approach are evaluated within the INEX 2005 evaluation campaign. The evaluation results rank the proposed approach among the best evaluated XML IR systems for the VVCAS task.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
