Supervised learning model for parsing Arabic language
Nabil Khoufi, Chafik Aloulou, Lamia Hadrich Belguith

TL;DR
This paper presents a supervised machine learning approach using SVMs for parsing Arabic, addressing resource scarcity and demonstrating promising results on the Penn Arabic Treebank.
Contribution
It introduces a novel SVM-based parsing method tailored for Arabic language, leveraging existing annotated corpora for improved syntactic analysis.
Findings
Encouraging parsing accuracy results
Effective SVM-based label selection
Validated on Penn Arabic Treebank
Abstract
Parsing the Arabic language is a difficult task given the specificities of this language and given the scarcity of digital resources (grammars and annotated corpora). In this paper, we suggest a method for Arabic parsing based on supervised machine learning. We used the SVMs algorithm to select the syntactic labels of the sentence. Furthermore, we evaluated our parser following the cross validation method by using the Penn Arabic Treebank. The obtained results are very encouraging.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Text and Document Classification Technologies · Algorithms and Data Compression
