MoritzLaurer
/

DeBERTa-v3-base-mnli-fever-anli

@@ -1,6 +1,7 @@
 ---
 language:
 - en
 tags:
 - text-classification
 - zero-shot-classification
@@ -18,11 +19,14 @@ pipeline_tag: zero-shot-classification
 This model was trained on the MultiNLI, Fever-NLI and Adversarial-NLI (ANLI) datasets, which comprise 763 913 NLI hypothesis-premise pairs. This base model outperforms almost all large models on the [ANLI benchmark](https://github.com/facebookresearch/anli).
 The base model is [DeBERTa-v3-base from Microsoft](https://huggingface.co/microsoft/deberta-v3-base). The v3 variant of DeBERTa substantially outperforms previous versions of the model by including a different pre-training objective, see annex 11 of the original [DeBERTa paper](https://arxiv.org/pdf/2006.03654.pdf).
 ## Intended uses & limitations
 #### How to use the model
 ```python
 from transformers import AutoTokenizer, AutoModelForSequenceClassification
 import torch
 model_name = "MoritzLaurer/DeBERTa-v3-base-mnli-fever-anli"
 tokenizer = AutoTokenizer.from_pretrained(model_name)
@@ -64,11 +68,11 @@ mnli-m | mnli-mm | fever-nli | anli-all | anli-r3
 ## Limitations and bias
 Please consult the original DeBERTa paper and literature on different NLI datasets for potential biases.
-### BibTeX entry and citation info
-If you want to cite this model, please cite the original DeBERTa paper, the respective NLI datasets and include a link to this model on the Hugging Face hub.
 ### Ideas for cooperation or questions?
 If you have questions or ideas for cooperation, contact me at m{dot}laurer{at}vu{dot}nl or [LinkedIn](https://www.linkedin.com/in/moritz-laurer/)
 ### Debugging and issues
-Note that DeBERTa-v3 was released recently and older versions of HF Transformers seem to have issues running the model (e.g. resulting in an issue with the tokenizer). Using Transformers==4.13 might solve some issues.

 ---
 language:
 - en
+license: mit
 tags:
 - text-classification
 - zero-shot-classification
 This model was trained on the MultiNLI, Fever-NLI and Adversarial-NLI (ANLI) datasets, which comprise 763 913 NLI hypothesis-premise pairs. This base model outperforms almost all large models on the [ANLI benchmark](https://github.com/facebookresearch/anli).
 The base model is [DeBERTa-v3-base from Microsoft](https://huggingface.co/microsoft/deberta-v3-base). The v3 variant of DeBERTa substantially outperforms previous versions of the model by including a different pre-training objective, see annex 11 of the original [DeBERTa paper](https://arxiv.org/pdf/2006.03654.pdf).
+For highest performance (but less speed), I recommend using https://huggingface.co/MoritzLaurer/DeBERTa-v3-large-mnli-fever-anli-ling-wanli.
 ## Intended uses & limitations
 #### How to use the model
 ```python
 from transformers import AutoTokenizer, AutoModelForSequenceClassification
 import torch
+device = torch.device("cuda") if torch.cuda.is_available() else torch.device("cpu")
 model_name = "MoritzLaurer/DeBERTa-v3-base-mnli-fever-anli"
 tokenizer = AutoTokenizer.from_pretrained(model_name)
 ## Limitations and bias
 Please consult the original DeBERTa paper and literature on different NLI datasets for potential biases.
+## Citation
+If you use this model, please cite: Laurer, Moritz, Wouter van Atteveldt, Andreu Salleras Casas, and Kasper Welbers. 2022. ‘Less Annotating, More Classifying – Addressing the Data Scarcity Issue of Supervised Machine Learning with Deep Transfer Learning and BERT - NLI’. Preprint, June. Open Science Framework. https://osf.io/74b8k.
 ### Ideas for cooperation or questions?
 If you have questions or ideas for cooperation, contact me at m{dot}laurer{at}vu{dot}nl or [LinkedIn](https://www.linkedin.com/in/moritz-laurer/)
 ### Debugging and issues
+Note that DeBERTa-v3 was released on 06.12.21 and older versions of HF Transformers seem to have issues running the model (e.g. resulting in an issue with the tokenizer). Using Transformers>=4.13 might solve some issues.