MT4CrossOIE: Multi-stage Tuning for Cross-lingual Open Information Extraction
Tongliang Li, Zixiang Wang, Linzheng Chai, Jian Yang, Jiaqi Bai, Yuwei, Yin, Jiaheng Liu, Hongcheng Guo, Liqun Yang, Hebboul Zine el-abidine, Zhoujun, Li

TL;DR
This paper introduces MT4CrossIE, a multi-stage tuning framework that enhances cross-lingual open information extraction by integrating language-specific modules and prompting techniques, significantly improving performance across multiple languages.
Contribution
The paper proposes a novel multi-stage tuning approach with language-specific modules and prompting, advancing cross-lingual OIE beyond shared models alone.
Findings
Effective multi-stage tuning improves cross-lingual extraction accuracy.
Language-specific modules significantly enhance multilingual performance.
Combining model-based and data-based transfer techniques yields superior results.
Abstract
Cross-lingual open information extraction aims to extract structured information from raw text across multiple languages. Previous work uses a shared cross-lingual pre-trained model to handle the different languages but underuses the potential of the language-specific representation. In this paper, we propose an effective multi-stage tuning framework called MT4CrossIE, designed for enhancing cross-lingual open information extraction by injecting language-specific knowledge into the shared model. Specifically, the cross-lingual pre-trained model is first tuned in a shared semantic space (e.g., embedding matrix) in the fixed encoder and then other components are optimized in the second stage. After enough training, we freeze the pre-trained model and tune the multiple extra low-rank language-specific modules using mixture-of-LoRAs for model-based cross-lingual transfer. In addition, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems
