partex-nv commited on
Commit
4a7b1b4
1 Parent(s): 96f521d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -2
README.md CHANGED
@@ -25,7 +25,7 @@ This model aims to bridge the gap in **open-source English to Punjabi translatio
25
  - **Training Data**: 10 million English<>Punjabi parallel sentences from [AI4Bharat's Bharat Parallel Corpus Collection (BPCC)](https://github.com/AI4Bharat/IndicTrans2).
26
  - **Evaluation Data**: The model has been evaluated on **1503 samples** from the **IN22-Conv dataset**, which is also available via [IndicTrans2](https://github.com/AI4Bharat/IndicTrans2).
27
  - **Model Architecture**: Based on **LLaMA 3.1 8B** with BF16 precision.
28
- - **Score (chrF++)**: Achieved a **chrF++ score of 28.1** on the IN22-Conv dataset, which is an excellent score for an open-source model. The benchmark chrF++ score for Google Translate is 61.1 (as noted in [this paper](https://arxiv.org/pdf/2305.16307)).
29
 
30
  This is the **first release** of the model, and future updates aim to improve the chrF++ score for enhanced translation quality.
31
 
@@ -133,7 +133,6 @@ Stay tuned for updates, and feel free to contribute or raise issues on Hugging F
133
 
134
  - **Training Data**: [Bharat Parallel Corpus Collection (BPCC)](https://github.com/AI4Bharat/IndicTrans2) by AI4Bharat.
135
  - **Evaluation Data**: [IN22-Conv dataset](https://github.com/AI4Bharat/IndicTrans2).
136
- - **Benchmarks**: [Translation Benchmarks Paper](https://arxiv.org/pdf/2305.16307).
137
 
138
  ## Contributors
139
 
 
25
  - **Training Data**: 10 million English<>Punjabi parallel sentences from [AI4Bharat's Bharat Parallel Corpus Collection (BPCC)](https://github.com/AI4Bharat/IndicTrans2).
26
  - **Evaluation Data**: The model has been evaluated on **1503 samples** from the **IN22-Conv dataset**, which is also available via [IndicTrans2](https://github.com/AI4Bharat/IndicTrans2).
27
  - **Model Architecture**: Based on **LLaMA 3.1 8B** with BF16 precision.
28
+ - **Score (chrF++)**: Achieved a **chrF++ score of 28.1** on the IN22-Conv dataset, which is an excellent score for an open-source model.
29
 
30
  This is the **first release** of the model, and future updates aim to improve the chrF++ score for enhanced translation quality.
31
 
 
133
 
134
  - **Training Data**: [Bharat Parallel Corpus Collection (BPCC)](https://github.com/AI4Bharat/IndicTrans2) by AI4Bharat.
135
  - **Evaluation Data**: [IN22-Conv dataset](https://github.com/AI4Bharat/IndicTrans2).
 
136
 
137
  ## Contributors
138