-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
chore: add jina-v3 evaluation result on mmteb multilingual #41
chore: add jina-v3 evaluation result on mmteb multilingual #41
Conversation
Hi! Could you run |
@Samoed on it, do i miss any other tasks? this is what i'm doing: import mteb
model = mteb.get_model("jinaai/jina-embeddings-v3")
tasks = mteb.get_benchmark("MTEB(Multilingual, beta)")
evaluation = mteb.MTEB(tasks=tasks)
evaluation.run(model) |
This task part of Also currently I'm fixing tests |
i think i can re-use what we submitted to MTEB on MIRACL, it is the same task and should get same results. |
Currenly in |
that's strange, now i'm re-running |
Adding the fix for the tests (#42). Then I believe all tests should pass.
This is also logged in the model_meta.json as well as the file structure |
didn't aware it take that much time (i'm only using 1 gpu), i'll patch the PR once ready |
@Samoed @KennethEnevoldsen @isaac-chung i patched the MIRACLRetrieval.json to the PR :) |
The only thing left. You should run |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@bwanglzu thanks for adding these! In terms of MMTEB multilingual results, they look complete. I'll merge this once the tests pass.
@Samoed this PR is not intended for the RU benchmark, so I do not expect those results to be added here. Also, after adding the jina-v3 model path and running results.py
, the updated paths.json
contains extra changes that are outside of jina-v3. Would you mind looking into that? We can add jina-v3 into paths in a separate PR.
@KennethEnevoldsen @Muennighoff or whoever has admin rights, would you mind taking a look at changing the required checks? |
Maybe github tries to merge different PRS when checking, because in #40 there is only 3.9 python in tests |
Looks like it merged. Thanks @bwanglzu again for the PR, and all for the discussion! Note that in this PR I've also updated tests to only run with Python 3.9 at the moment. |
This PR add jina-embeddings-v3's evaluation result (MMTEB, beta), the evaluation was conducted with the latest checkpoint
215a6e121fa0183376388ac6b1ae230326bfeaed
.