|
--- |
|
language: |
|
- en |
|
- hi |
|
license: llama2 |
|
library_name: transformers |
|
tags: |
|
- hindi |
|
- 'english ' |
|
- Bilingual |
|
datasets: |
|
- sarvamai/samvaad-hi-v1 |
|
- aaditya/orca_dpo_pairs-Hindi_ |
|
pipeline_tag: text-generation |
|
--- |
|
# Model |
|
|
|
# π Gaja |
|
Gaja is a Hindi/Hinglish chat model, initially trained on SarvamAI's OpenHathi model and further fine-tuned for conversational interactions. |
|
<img src="https://cdn-uploads.huggingface.co/production/uploads/6487239cca30096ea9f52115/sLgskU1h2Tih_bJo0ktYR.jpeg" width="600" alt="Image"> |
|
|
|
# Additional Information |
|
- It outperforms Airavata, AI4Bharat's chat version, on Huggingface OpenLLM benchmark suite. |
|
- It was fine-tuned on only 1k samples |
|
|
|
|
|
# π¬ Prompt template |
|
|
|
``` |
|
<|im_start|>user |
|
{}<|im_end|> |
|
<|im_start|>assistant |
|
{}<|im_end|> |
|
``` |
|
|
|
# π Features: |
|
- Language Support: Gaja is designed to understand and generate responses in both Hindi and Hinglish, catering to a diverse range of users. |
|
- Base Model: Built upon SarvamAI's OpenHathi model, Gaja inherits its foundational capabilities while being optimized for conversational tasks. |
|
- Fine-tuning: Gaja has undergone fine-tuning specifically for chat-based interactions, enhancing its ability to engage in meaningful conversations with users. |
|
- Experimental Platform: With its flexibility and adaptability, Gaja serves as a valuable platform for conducting experiments and exploring innovative approaches to chatbot development. |
|
|