Describe CrossEncoder integration with Sentence Transformers

by tomaarsen HF staff - opened Jun 25

base: refs/heads/main

←

from: refs/pr/8

Discussion Files changed

+60

-0

tomaarsen

Jun 25

•

edited Jun 25

Hello!

Pull Request overview

Describe CrossEncoder integration with Sentence Transformers

Details

Because CrossEncoder already works directly with AutoModelForSequenceClassification, it works out of the box. You've got some nice extra behaviour for your rerank method though, so I've placed the Sentence Transformers support below the transformers usage.

Congratulations on the release, I think I'll be recommending this model on my GitHub issues. At least, for those for who the license isn't an issue.

Tom Aarsen

Describe usage via sentence-transformers CrossEncoder4d6f3df7

tomaarsen changed pull request status to open Jun 25

jemfu

Jun 26

Hi Tom, thanks for your contribution!

We have tested the capability with sentence-transformer. The model.predict() works pretty well and returns the desired results. However, model.rerank() does not work since we use bfloat16 and sentence-transformer will return the ranking score as numpy list which doesn't support bfloat16.

We will raise a PR in sentence-transformer to fix this issue. And we will merge your PR once the issue is resolved in sentence-transformer.

Thanks again!

tomaarsen

Jun 26

•

edited Jun 26

Hello!

I've encountered the same, and so has one of my users: https://github.com/UKPLab/sentence-transformers/pull/2783
I'll be merging @malteos his PR, and then users can use both predict and rank with the default parameters (e.g. no need for convert_to_tensor=True) when installing the bleeding-edge version of Sentence Transformers.
Until then, I've added convert_to_tensor=True to the rank method. This will cause the method to keep the outputs as bfloat16 rather than convert them to numpy, and it avoids the problem altogether. I think this PR should be ready to go now :)

Tom Aarsen

Add convert_to_tensor=True to fix predict/rank until the next ST release61b3d169

numb3r3

Jina AI org Jun 28

LGTM, thanks for your work.

numb3r3 changed pull request status to merged Jun 28

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment