Edit model card

s2-tdro-Qwen1.5-1.8B-curr

Arxiv | Github

tDRO: Task-level Distributionally Robust Optimization for Large Language Model-based Dense Retrieval. Guangyuan Ma, Yongliang Ma, Xing Wu, Zhenpeng Su, Ming Zhou and Songlin Hu.

This is a fine-tuned tDRO optimized retriever with Sample Ratio Reweighting of tdro-llm/finetune_data.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for tdro-llm/s2-tdro-Qwen1.5-1.8B-curr

Base model

Qwen/Qwen1.5-1.8B
Finetuned
(11)
this model

Dataset used to train tdro-llm/s2-tdro-Qwen1.5-1.8B-curr