Spaces:

mteb
/

leaderboard

Running on CPU Upgrade

I've been working on some other version of this leaderboard and I want to share some changes that might be of interest to those looking to fork the project or add new tabs.

This PR basically pulls out the configuration settings for different languages and models metadata of the app.py and put its in two separate configuration files: config.yaml and model_meta.yaml

With this the app.py goes from 2282 lines to 615 lines

I believe that way it's easier to debug, maintain and add new rows by the config.yaml

Additionally, I made modifications to the get_mteb_data function. Previously, this function was looping through the model list downloading all MODEL CARD's every time a new tab was instantiated, which caused the leaderboard take +30 min to initialize in my machine, bbfe97ce caches the MODEL CARD's results while it's initiating, reducing the initialization time to less than 5 min. (The refresh button still works)

You can see the changes working on here: https://huggingface.co/spaces/pt-mteb/mteb_code_refactor (Should have the same interface/results as the current one)

eduagarcia changed pull request status to open May 1

Fix a weird bug that made the cicklabe model name fail to render in some boards349b10b0

eduagarcia

May 1

This comment has been hidden

Fix column order on refresha20529c6

eduagarcia

May 1

•

edited May 1

349b10b0 and a20529c6 fixes some bugs with Model Name rendering and column order on clicking REFRESH for some tabs.

fix missing German clustering9066f738

Caches models metadata card to a temporary file to speed up initilization6f8ad2fa

Clean some invalid tasks and columns for when loading the leaderboard and using the refresh button879c7e7b

Muennighoff

Massive Text Embedding Benchmark org May 4

Looks great; From my side we can merge this! @tomaarsen do you have thoughts? 😊

tomaarsen

Massive Text Embedding Benchmark org May 6

•

edited May 6

I agree completely, I think these are excellent changes. It's a big step forward in terms of modularity and the caching is very welcome. Great job @eduagarcia !
I ran it locally, and I see no further issues.

Do/should we give points for this for MMTEB? cc @KennethEnevoldsen

Tom Aarsen

KennethEnevoldsen

Massive Text Embedding Benchmark org May 6

@tomaarsen do give point for this on MTEB, you can just open a PR the the score file called mteb_leaderboard_106.jsonl

Muennighoff changed pull request status to merged May 6

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment