-
Notifications
You must be signed in to change notification settings - Fork 193
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
MixEval-X Image / Video #434
Conversation
@@ -15,6 +15,7 @@ | |||
from lmms_eval.api.instance import Instance | |||
from lmms_eval.api.model import lmms | |||
from lmms_eval.api.registry import register_model | |||
from lmms_eval.models.model_utils.load_video import read_video_pyav_pil |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This currently breaks as read_video_pyav_pil
doesn't exist @pufanyi
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hello, sorry about this. It seems that this change was accidentally brought over from another branch. Thank you for pointing it out! I've reverted it in #447.
* reformat mix eval * video2text fix * video * Add image-to-text evaluation tasks and templates * Refactor image-to-text evaluation tasks and update configurations * Enhance LlamaVision model with video loading improvements and configuration updates * fix internvl2 * llava vid default numframe too small * add max_new_tokens * remove "with a short phrase" for `gpt4v`
Image (LLaVA-1.5-13B)
official: 50.2
Video (LLaVA-Next-Video-7B)
official: 58.7