Next Generation Language Resources using GRID
Federico Calzolari, Eva Sassolini, Manuela Sassi, Sebastiana, Cucurullo, Eugenio Picchi, Francesca Bertagna, Alessandro Enea, Monica, Monachini, Claudia Soria, Nicoletta Calzolari

TL;DR
This paper explores the use of Grid computing technology to develop next-generation, open, distributed language resources, testing its potential and limitations through experiments on linguistic pattern extraction.
Contribution
It introduces a model for collaborative language infrastructure leveraging Grid computing and evaluates its feasibility with practical NLP experiments.
Findings
Grid computing shows promise for language resource collaboration
Experiments reveal potential and limitations of Grid in NLP tasks
The approach supports scalable linguistic analysis
Abstract
This paper presents a case study concerning the challenges and requirements posed by next generation language resources, realized as an overall model of open, distributed and collaborative language infrastructure. If a sort of "new paradigm" is required, we think that the emerging and still evolving technology connected to Grid computing is a very interesting and suitable one for a concrete realization of this vision. Given the current limitations of Grid computing, it is very important to test the new environment on basic language analysis tools, in order to get the feeling of what are the potentialities and possible limitations connected to its use in NLP. For this reason, we have done some experiments on a module of Linguistic Miner, i.e. the extraction of linguistic patterns from restricted domain corpora.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Distributed and Parallel Computing Systems · Topic Modeling
