Question about model types

by cyt79 - opened Jun 15, 2023

Jun 15, 2023

Hi, thanks for sharing all of these great models! I'm wondering if you can tell a bit more about which model can be used as bi-encoder and which can be used as cross-encoder. For instance, does it make sense to use this model to initialise CrossEncoder of Sentence-Transformers as shown below?

from sentence_transformers import CrossEncoder
model = CrossEncoder('Muennighoff/SGPT-2.7B-weightedmean-nli-bitfit')

Muennighoff

Owner Jun 15, 2023

Hey! That does not make sense; The uploaded SGPT models are all Bi-Encoders.
I havn't experiment with sentence_transformers.CrossEncoder - The SGPT methodology for Cross-Encoders is to use the log probabilities of raw pre-trained GPT models like e.g. https://huggingface.co/EleutherAI/gpt-j-6b. You can check the example scripts here for that: https://github.com/Muennighoff/sgpt#cross-encoder

cyt79

Jun 15, 2023

Ah got it. I wasn't sure what models are Bi-Encoders and what models are cross encoders. Thanks for the clarification!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment