-
Notifications
You must be signed in to change notification settings - Fork 50
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
instance level metric outputs #6
Comments
Can you expand on what you mean by this? You can interact with the model right now on Huggingface if that helps https://huggingface.co/vectara/hallucination_evaluation_model |
Ah sorry I meant the outputs of the hallucination evaluation model for each instance, e.g. a new column with the model's output in this file: https://github.com/vectara/hallucination-leaderboard/blob/main/leaderboard_summaries.csv Would love to dive in and compare the summarization models more in depth, similar to reports we've published recently on the HF leaderboard and Whisper transcription models: https://twitter.com/gneubig/status/1724872160144171104 |
Hello, I'm sorry I didn't see this earlier. Tragically, Simon passed away over Thanksgiving, and other members of the team are picking this up. We'll try to get the new column added soon. |
Oh no, I'm so sorry! Best wishes to the family and team. Of course, no rush at all! |
This is fantastic work!
I was wondering if you all could release the instance-level outputs from the analysis. We'd love to visualize the results using Zeno
The text was updated successfully, but these errors were encountered: