CAMeLBERT-UD-parser
Model description
The CAMeLBERT-UD-parser is a neural dependency parsing model for Arabic text, specifically designed for the Universal Dependencies (UD) dependency formalism. It is based on the Biaffine Attention Dependency Parsing model introduced by Dozat and Manning (2017) and implemented in SuPar, which has been shown to be very effective for dependency parsing in many languages. The model is trained on the NUDAR (NYUAD Universal Dependency for Arabic) train set, which is a large Arabic corpus annotated with UD dependency labels. The model uses a CamelBERT-MSA word embedding layer, which is a pre-trained language model that has been trained on a massive dataset of Arabic text. The model was introduced in our paper "CamelParser2.0: A State-of-the-Art Dependency Parser for Arabic". The paper describes the model in detail and evaluates its performance on various Arabic dependency parsing tasks.
Intended uses
The CAMeLBERT-UD-parser is shipped with the CAMeLParser as one of the default parsing models, and can be selected when parsing texts using the UD formalism.
Citation
@inproceedings{Elshabrawy:2023:camelparser,
title = "{CamelParser2.0: A State-of-the-Art Dependency Parser for Arabic}",
author = {Ahmed Elshabrawy and
Muhammed AbuOdeh and
Go Inoue and
Nizar Habash} ,
booktitle = {Proceedings of The First Arabic Natural Language Processing Conference (ArabicNLP 2023)},
year = "2023"
}