Skip to content

Commit

Permalink
fix
Browse files Browse the repository at this point in the history
  • Loading branch information
pufanyi committed Mar 15, 2024
1 parent fde7593 commit d492c6f
Show file tree
Hide file tree
Showing 5 changed files with 20 additions and 21 deletions.
3 changes: 2 additions & 1 deletion lmms_eval/tasks/__init__.py
Original file line number Diff line number Diff line change
Expand Up @@ -90,9 +90,10 @@ def include_task_folder(task_dir: str, register_task: bool = True) -> None:
# the user defines the appropriate verbosity.
except ModuleNotFoundError as e:
eval_logger.debug(f"{yaml_path}: {e}. Config will not be added to registry.")
print(f"{yaml_path}: {e}. Config will not be added to registry.")
except Exception as error:
import traceback

eval_logger.debug(f"Failed to load config in {yaml_path}. Config will not be added to registry\n" f"Error: {error}\n" f"Traceback: {traceback.format_exc()}")
return 0

Expand Down
12 changes: 0 additions & 12 deletions lmms_eval/tasks/mmbench/_default_template_mmbench.yaml

This file was deleted.

8 changes: 1 addition & 7 deletions lmms_eval/tasks/mmbench/_default_template_mmbench_cn.yaml
Original file line number Diff line number Diff line change
@@ -1,28 +1,22 @@
dataset_path: lmms-lab/MMBench
dataset_kwargs:
token: True
doc_to_target: "answer"
dataset_name: "cn"
output_type: generate_until
doc_to_visual: !function cn_utils.mmbench_doc_to_visual
doc_to_text: !function cn_utils.mmbench_doc_to_text
doc_to_target: "answer"
generation_kwargs:
max_new_tokens: 256
temperature: 0
top_p: 0
num_beams: 1
do_sample: false
process_results: !function cn_utils.mmbench_process_results
metadata:
version: 0.0
gpt_eval_model_name: "gpt-3.5-turbo"
quick_extract: true
sys_prompt: "有如下几个选项:"
model_specific_prompt_kwargs:
default:
pre_prompt: ""
post_prompt: "\n请直接使用所提供的选项字母作为答案回答。"
model_specific_generation_kwargs:
llava:
image_aspect_ratio: original
include: _default_template_mmbench.yaml
13 changes: 12 additions & 1 deletion lmms_eval/tasks/mmbench/_default_template_mmbench_en.yaml
Original file line number Diff line number Diff line change
@@ -1,3 +1,7 @@
dataset_path: lmms-lab/MMBench
dataset_kwargs:
token: True
doc_to_target: "answer"
model_specific_prompt_kwargs:
default:
pre_prompt: ""
Expand All @@ -10,5 +14,12 @@ model_specific_generation_kwargs:
llava:
image_aspect_ratio: original
output_type: generate_until
include: _default_template_mmbench.yaml
dataset_name: "en"
generation_kwargs:
until:
- "ASSISTANT:"
max_new_tokens: 1024
temperature: 0
top_p: 0
num_beams: 1
do_sample: false
5 changes: 5 additions & 0 deletions lmms_eval/tasks/mmbench/mmbench_cn.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -3,3 +3,8 @@ task:
- mmbench_cn_dev
- mmbench_cn_test
- mmbench_cn_cc
metadata:
version: 0.0
gpt_eval_model_name: "gpt-3.5-turbo"
quick_extract: true
sys_prompt: "有如下几个选项:"

0 comments on commit d492c6f

Please sign in to comment.