Supervised learning model for parsing Arabic language

Nabil Khoufi; Chafik Aloulou; Lamia Hadrich Belguith

arXiv:1410.8783·cs.CL·November 3, 2014·5 cites

Supervised learning model for parsing Arabic language

Nabil Khoufi, Chafik Aloulou, Lamia Hadrich Belguith

PDF

Open Access

TL;DR

This paper presents a supervised machine learning approach using SVMs for parsing Arabic, addressing resource scarcity and demonstrating promising results on the Penn Arabic Treebank.

Contribution

It introduces a novel SVM-based parsing method tailored for Arabic language, leveraging existing annotated corpora for improved syntactic analysis.

Findings

01

Encouraging parsing accuracy results

02

Effective SVM-based label selection

03

Validated on Penn Arabic Treebank

Abstract

Parsing the Arabic language is a difficult task given the specificities of this language and given the scarcity of digital resources (grammars and annotated corpora). In this paper, we suggest a method for Arabic parsing based on supervised machine learning. We used the SVMs algorithm to select the syntactic labels of the sentence. Furthermore, we evaluated our parser following the cross validation method by using the Penn Arabic Treebank. The obtained results are very encouraging.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Text and Document Classification Technologies · Algorithms and Data Compression