Navigating multilingual news collections using automatically extracted   information

Ralf Steinberger; Bruno Pouliquen; Camelia Ignat (European Commission; - Joint Research Centre)

arXiv:cs/0609053·cs.CL·May 23, 2007

Navigating multilingual news collections using automatically extracted information

Ralf Steinberger, Bruno Pouliquen, Camelia Ignat (European Commission, - Joint Research Centre)

PDF

Open Access

TL;DR

This paper introduces a multilingual news analysis tool that automatically clusters articles, extracts key entities, links related information, and learns relationships over time to facilitate efficient navigation of large news collections.

Contribution

It presents a comprehensive tool set for multilingual news analysis that integrates clustering, entity extraction, linking, and relationship learning, enabling effective exploration of large collections.

Findings

01

Successfully clusters multilingual news articles

02

Automatically extracts and links entities across languages

03

Learns relationships between entities over time

Abstract

We are presenting a text analysis tool set that allows analysts in various fields to sieve through large collections of multilingual news items quickly and to find information that is of relevance to them. For a given document collection, the tool set automatically clusters the texts into groups of similar articles, extracts names of places, people and organisations, lists the user-defined specialist terms found, links clusters and entities, and generates hyperlinks. Through its daily news analysis operating on thousands of articles per day, the tool also learns relationships between people and other entities. The fully functional prototype system allows users to explore and navigate multilingual document collections across languages and time.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Semantic Web and Ontologies