You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The problem is that we're not getting enough votes. People use LMSys ChatbotArena, because it is pretty useful in itself, e.g. to play with models you cannot access & help you solve problems. For our arena this is more difficult as we have fixed corpora and people cannot easily add their own large corpus so it is more constrained.
@shaoyijia suggested incentivizing more people to vote by making it actually useful, e.g. maybe it could be a research/learning partner. For example, if we sell it as a better arxiv search than the native support of arxiv (most people think arxiv search support is bad), people may be curious to try and have more incentive to vote if we also show the top-k recommendation of the winner models you choose to help you know more. Currently, people may not have a lot of incentive to vote/play if they just see the below.
(paraphrasing Yijia's comments here)
Some concrete things we can do:
Add arXiv abs or pdf links to the search result so people can go read the paper
Show top-k results if user asks for it (maybe some way to expand results in the UI)
Maybe improve interface to ease search
Does someone have other concrete ideas?
The text was updated successfully, but these errors were encountered:
I think this is great! These changes could also work for wikipedia as well.
We could have it highlight the answer in the retrieved document either using a specific model (this does introduce a bias), alternatively we could also do it by embedding segments on the answer and see which segments are the best match. This is a non-trivial change though.
The problem is that we're not getting enough votes. People use LMSys ChatbotArena, because it is pretty useful in itself, e.g. to play with models you cannot access & help you solve problems. For our arena this is more difficult as we have fixed corpora and people cannot easily add their own large corpus so it is more constrained.
@shaoyijia suggested incentivizing more people to vote by making it actually useful, e.g. maybe it could be a research/learning partner. For example, if we sell it as a better arxiv search than the native support of arxiv (most people think arxiv search support is bad), people may be curious to try and have more incentive to vote if we also show the top-k recommendation of the winner models you choose to help you know more. Currently, people may not have a lot of incentive to vote/play if they just see the below.
(paraphrasing Yijia's comments here)
Some concrete things we can do:
The text was updated successfully, but these errors were encountered: