riotu-lab commited on
Commit
f2f0bcf
1 Parent(s): faaa9a4

Update readme.md

Browse files
Files changed (1) hide show
  1. README.md +71 -3
README.md CHANGED
@@ -1,3 +1,71 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - ar
5
+ tags:
6
+ - ArabianGPT
7
+ widget:
8
+ - text: "أعلنت وزارة الحج في المملكة العربية السعودية"
9
+ example_title: "مثال ١"
10
+ - text: "يبدو اليوم جميلا، سأقوم بتحضير"
11
+ example_title: "مثال ٢"
12
+ - text: "إن التقنيات الحديثة"
13
+ example_title: "مثال ٣"
14
+ ---
15
+
16
+ # ArabianGPT 1.5B Model Overview
17
+
18
+ ## Disclaimer for the Use of Large Language Models (LLMs) for Text Generation
19
+
20
+ <p style="color: red;">We disclaim all responsibility for any harm, inaccuracies, or inappropriate content generated by ArabianGPT-1.5B, and users engage with and apply the model's outputs at their own risk.</p>
21
+
22
+ > **Important Note:** Currently, we offer a raw pre-trained model. Our team is actively working on releasing instruction-based LLMs that are fine-tuned and augmented with LRHF. The first set of pre-trained models has been made available for community exploration. While we do have models fine-tuned for specific tasks such as summarization and sentiment analysis, they are still in the development phase.
23
+
24
+ ## How you can use this Pre-Trained Model?
25
+ You are invited to utilize this pre-trained, native Arabic language model as an experimental tool to assess its capabilities, aid in its fine-tuning, and evaluate its performance across a variety of downstream tasks. We encourage you to review our technical report for a comprehensive understanding of the model's performance metrics and the specific downstream tasks it has been tested on. This will provide valuable insights into its applicability and effectiveness in diverse applications.
26
+
27
+ ## Introduction
28
+ ArabianGPT-1.5B, part of the ArabianLLM initiatives, is a specialized GPT model optimized for the Arabic language. Developed at Prince Sultan University's Robotics and Internet of Things Lab, this model is a significant advancement in natural language modeling and generation for Arabic, addressing the language's unique challenges.
29
+
30
+ ## Key Features
31
+ - **Architecture**: GPT-2
32
+ - **Model Size**: 1.558 billion parameters
33
+ - **Layers**: 48
34
+ - **Model Attention Layers (MAL)**: 25
35
+ - **Context Window Size**: 1024 tokens
36
+
37
+ ## Training
38
+ - **Dataset**: over 30 billion tokens from a web-scraped dataset
39
+ - **Tokenizer**: Aranizer 64K
40
+ - **Hardware**: 6 NVIDIA A100 GPUs
41
+
42
+ ## Role in ArabianLLM Initiatives
43
+ ArabianGPT-1.5B is crucial for advancing Arabic language processing, addressing challenges unique to Arabic morphology and dialects.
44
+
45
+ ## Usage
46
+ Suitable for Arabic text generation tasks. Example usage with Transformers Pipeline:
47
+ ```python
48
+ from transformers import pipeline
49
+
50
+ pipe = pipeline("text-generation", model="riotu-lab/ArabianGPT-1.5B", max_new_tokens=1024)
51
+ text = ''
52
+ pipe(text)
53
+ ```
54
+
55
+ ## Limitations and Ethical Considerations
56
+
57
+ - The model may have context understanding or text generation limitations in certain scenarios.
58
+ - Emphasis on ethical use to prevent misinformation or harmful content propagation.
59
+
60
+ ## Acknowledgments
61
+
62
+ Special thanks to Prince Sultan University, particularly the Robotics and Internet of Things Lab.
63
+
64
+ ## Contact Information
65
+
66
+ For inquiries: [[email protected]](mailto:[email protected]).
67
+
68
+ ## Disclaimer for the Use of Large Language Models (LLMs) for Text Generation
69
+
70
+ <p style="color: red;">We disclaim all responsibility for any harm, inaccuracies, or inappropriate content generated by ArabianGPT-1.5B, and users engage with and apply the model's outputs at their own risk.</p>
71
+