---
license: apache-2.0
datasets:
- HiTZ/casimedicos-squad
language:
- en
- es
- fr
- it
metrics:
- f1
library_name: transformers
widget:
- text: The answer is 2, an EMG, which would be the diagnostic test to confirm the diagnostic suspicion of ALS, the imaging tests both CT, but especially MRI would help us to rule out, but would not confirm the diagnosis.
  example_title: Example 1
- text: Although other hemorrhagic diseases can have a prolonged thromboplastin time, due to the intensity of the lesion and the child's sex and family history, the most likely diagnosis is hemophilia.
  example_title: Example 2
- text: Paradoxical pulse is a drop in blood pressure > 10 mmHg during inspiration;it represents an exaggeration of the physiological phenomenon consisting ofinspiratory lowering of BP (normal up to 10 mmHg). In cardiac tamponade,inspiration, which causes an increase in blood flow to the right chambers,increasing their volume, secondarily causes a displacement of theinterventricular septum to the left, so that the left heart lodges andexpels less blood during systole and the pulse, therefore, decreases. In anormal heart this exaggerated displacement, caused by the pressure exertedby the tamponade on the RV free wall, does not occur. Sinus X represents thesystolic collapse of the venous pulse, i.e., the pressure drop due to atrialrelaxation (also partly due to a downward displacement of the RV base duringsystole). Sinus Y represents the diastolic collapse of the venous pulse,i.e., the pressure drop that occurs from the moment blood enters thetricuspid valve into the ventricle. In cardiac tamponade, the deep sinus Xis characteristic. In constrictive pericarditis, the deep Y sinus. For allthese reasons, the correct answer is 5.
  example_title: Example 3
pipeline_tag: token-classification
---

<p align="center">
    <br>
    <img src="http://www.ixa.eus/sites/default/files/anitdote.png" style="width: 45%;">
    <be>


# mDeBERTa-base for Multilingual Correct Explanation Extraction in the Medical Domain


This model is a fine-tuned version of [mdeberta-v3-base](https://huggingface.co/microsoft/mdeberta-v3-base) for a **novel extractive task**
which consists of **identifying the explanation of the correct answer** written by medical doctors. The model
has been fine-tuned using the multilingual [https://huggingface.co/datasets/HiTZ/casimedicos-squad](https://huggingface.co/datasets/HiTZ/casimedicos-squad) dataset,
which includes English, French, Italian and Spanish.


## Performance

The model scores **74.64 F1 partial match** (as defined in [SQuAD extractive QA task](https://huggingface.co/datasets/rajpurkar/squad_v2)) averaged across the 4 languages.

- **tags:** to delimit explanations of the correct answers and the rest at token level.
  -  0: explanation of the correct answer
  -  1: others
  
<!--<img src="https://raw.githubusercontent.com/hitz-zentroa/multilingual-abstrct/main/resources/multilingual-abstrct-results.png" style="width: 75%;"> -->

### Fine-tuning hyperparameters

The following hyperparameters were used during training:
- learning_rate: 5e-05
- train_batch_size: 48
- eval_batch_size: 8
- seed: random
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- num_epochs: 20.0

### Framework versions

- Transformers 4.30.0.dev0
- Pytorch 2.1.2+cu121
- Datasets 2.16.1
- Tokenizers 0.15.2

## Citation

If you use this model please **cite the following paper**:

```bibtex
@misc{goenaga2023explanatory,
      title={Explanatory Argument Extraction of Correct Answers in Resident Medical Exams}, 
      author={Iakes Goenaga and Aitziber Atutxa and Koldo Gojenola and Maite Oronoz and Rodrigo Agerri},
      year={2023},
      eprint={2312.00567},
      archivePrefix={arXiv}
}
```

**Contact**: [Iakes Goenaga](http://www.hitz.eus/es/node/65) and [Rodrigo Agerri](https://ragerri.github.io/)
HiTZ Center - Ixa, University of the Basque Country UPV/EHU