-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The test results of lynx on the MSCOCO ITM task are questionable #1
Comments
Hello, thanks for trying our benchmark! I ran the command mentioned above and got the reasonable result in
Can you share some samples of the output json files? When you use |
Sorry for taking so long to reply. Recently, the machine broke down and I have not been able to do the experiment.
The
Maybe the pre-trained model of lynx is not loaded correctly. Are your LYNX Settings the same as mine?
Also, when I run it, it shows that the adapter parameters have been reinitialized. Is this normal?
|
Our ## Data
image_rdir: "./images/"
# put your test file in jsonl format
test_files: [ "./data/Open_VQA_images.jsonl" ]
# change this prompt for different task
prompt: "User: {question}\nBot:"
# the key must match the vision key in test_files
# if you test Open_VQA_videos.jsonl, need to change to "video"
vision_prompt_dict: "image"
output_prompt_dict: "answer"
data: {
num_frames: 5,
}
## Model
vision_encoder: 'eva_vit_1b'
video_encoding: 'concate'
add_frame_pos: True
LLM: 'vicuna-7b'
LLM_base: '/remote-home/share/LLM_CKPT/vicuna-7B-v1.1/'
use_flash_attn: False
use_adapter: True
adapter_freq: 2
bridge: 'resampler'
bridge_depth: 3
num_bridge_tokens: 32
## General
use_left_pad: True
lower_text: True
freeze_vit: True
freeze_llm: True
image_res: 224
image_mean: [ 0.48145466, 0.4578275, 0.40821073 ]
image_std: [ 0.26862954, 0.26130258, 0.27577711 ]
## Testing
checkpoint: "/remote-home/share/multimodal-models/lynx/finetune_lynx.pt"
## infer params
max_input_tokens: 40
batch_size_test: 16
max_new_tokens: 64
min_length: 2
num_beams: 5
length_penalty: -2.0
top_p: 0.9
top_k: 3
no_repeat_ngram_size: 2
apply_lemmatizer: False
use_nucleus_sampling: True Have you put the lynx repository in |
Yes, I clone it from https://github.com/bytedance/lynx-llm.git |
First of all, thank you for a great job! I ran into a few issues while following the tutorial to reproduce:
I first follow tutorial to emersion lynx ACC on MSCOCO_ITM task, that is, Table18 in the paper. I used the following command:
I used
generation
as the inference method, but the results I get were rather strange:If I use
likelihood
as the inference method, the results are still different from that in the paper:I'm at a loss to understand, and I hope you can help to point out where the problem may be.
The text was updated successfully, but these errors were encountered: