Loading…

A Comprehensive Context-Free Grammar for the Arabic Language: Including Non-Fundamentalist Phrases

Dixon's assertion regarding the idiosyncratic nature of natural languages initiates an investigation into the unique characteristics of the Arabic language. Contrary to Dixon's viewpoint, some scholars suggest the presence of regularity within Arabic, attributable to its extensive array of...

Full description

Saved in:
Bibliographic Details
Published in:Ingénierie des systèmes d'Information 2023-06, Vol.28 (3), p.611-619
Main Authors: Yahiaoui, Yasser, Bourouis, Azeddine, Al-Dujaili, Ayad Q., Abdulkareem, Ahmed I., Agboola, Olasumbo O., Okagbue, Hilary I., Adedotun, Adedayo F., Humaidi, Amjad J., Imaga, Ogbu F., Odekina, Onuche G.
Format: Article
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Dixon's assertion regarding the idiosyncratic nature of natural languages initiates an investigation into the unique characteristics of the Arabic language. Contrary to Dixon's viewpoint, some scholars suggest the presence of regularity within Arabic, attributable to its extensive array of syntactic rules and formulations. Yet, the copious volume of terminal vocabulary in Arabic poses significant challenges to grammar development. While annotations have offered partial solutions, they bring forth additional difficulties due to the necessity of retrieving data from the annotated corpora. To mitigate these issues, an innovative study was executed that utilized an annotated taxonomy of syntactic roles, coupled with an examination of both fundamentalist and non-fundamentalist phrases. A codification method was applied to a knowledge base employing the Subsumption Hierarchical Attribute (SHA), enabling the integration of Arabic word classes based on their potential syntactic roles. The SHA acts as an annotation method for deriving a grammar class 02, where classes are coded as terminal vocabulary. Its primary objectives are twofold: to moderate the complexity of the parsing system and to automate the generation of over 1490 distinct possible sentence structures. The study culminated in the development of a novel context-free grammar (CFG) for Arabic, broadening the horizons of language processing techniques.
ISSN:1633-1311
2116-7125
DOI:10.18280/isi.280309