qwen2vl support #1042

eaidova · 2024-11-29T11:02:41Z

What does this PR do?

Enable qwen2vl models export and inference with openvino.
Example of usage:

import requests
from PIL import Image
from transformers import AutoProcessor

from optimum.intel.openvino import OVModelForVisualCausalLM


model_id = "Qwen/Qwen2-VL-2B-Instruct"
# Load the model in half-precision on the available device(s)
model = OVModelForVisualCausalLM.from_pretrained(model_id)
processor = AutoProcessor.from_pretrained(model_id)

# Image
url = "https://qianwen-res.oss-cn-beijing.aliyuncs.com/Qwen-VL/assets/demo.jpeg"
image = Image.open(requests.get(url, stream=True).raw)

conversation = [
    {
        "role": "user",
        "content": [
            {
                "type": "image",
            },
            {"type": "text", "text": "Describe this image."},
        ],
    }
]


# Preprocess the inputs
text_prompt = processor.apply_chat_template(conversation, add_generation_prompt=True)
# Excepted output: '<|im_start|>system\nYou are a helpful assistant.<|im_end|>\n<|im_start|>user\n<|vision_start|><|image_pad|><|vision_end|>Describe this image.<|im_end|>\n<|im_start|>assistant\n'

inputs = processor(text=[text_prompt], images=[image], padding=True, return_tensors="pt")

# Inference: Generation of the output
output_ids = model.generate(**inputs, max_new_tokens=10)
generated_ids = [output_ids[len(input_ids) :] for input_ids, output_ids in zip(inputs.input_ids, output_ids)]
output_text = processor.batch_decode(generated_ids, skip_special_tokens=True, clean_up_tokenization_spaces=True)
print(output_text)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

HuggingFaceDocBuilderDev · 2024-12-04T05:49:17Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

notebooks/openvino/sentence_transformer_quantization.ipynb

optimum/intel/openvino/modeling_visual_language.py

Co-authored-by: Nikita Savelyev <[email protected]>

AlexKoff88 · 2024-12-06T12:03:49Z

I don't have major comments. Thanks @eaidova!

@echarlaix, @IlyasMoutawwakil, the PR is ready for your review.

echarlaix

Thanks for the addition @eaidova !

optimum/intel/openvino/modeling_visual_language.py

optimum/exporters/openvino/model_patcher.py

optimum/exporters/openvino/model_configs.py

optimum/exporters/openvino/model_patcher.py

Add compression tests for qwen2-vl

optimum/exporters/openvino/model_configs.py

…into ea/qwen2vl

eaidova · 2024-12-17T15:00:31Z

@echarlaix could you please check one more time?
Thanks!

optimum/intel/openvino/modeling_visual_language.py

echarlaix · 2024-12-19T08:35:37Z

Awesome work, thanks a lot @eaidova

qwen2vl support

c26a450

AlexKoff88 requested review from echarlaix and AlexKoff88 December 3, 2024 06:45

eaidova force-pushed the ea/qwen2vl branch 2 times, most recently from ce7789f to 5af0206 Compare December 4, 2024 05:44

fix code style

973f155

eaidova force-pushed the ea/qwen2vl branch from 5af0206 to 973f155 Compare December 4, 2024 05:58

add test case

7af7cdc

eaidova force-pushed the ea/qwen2vl branch from 612475c to 7af7cdc Compare December 4, 2024 07:03

eaidova requested review from IlyasMoutawwakil and nikita-savelyevv December 5, 2024 08:04

nikita-savelyevv approved these changes Dec 5, 2024

View reviewed changes

nikita-savelyevv and others added 4 commits December 5, 2024 16:13

Added compression tests for qwen2-vl

02e6dc9

Remove trust_remote_code

14c186f

Apply suggestions from code review

163dadd

Co-authored-by: Nikita Savelyev <[email protected]>

revert changes in notebook

550d8a6

AlexKoff88 approved these changes Dec 6, 2024

View reviewed changes

echarlaix reviewed Dec 10, 2024

View reviewed changes

Merge pull request #4 from nikita-savelyevv/ns/qwen2vl-quant-test

bf44a19

Add compression tests for qwen2-vl

echarlaix reviewed Dec 16, 2024

View reviewed changes

optimum/exporters/openvino/model_configs.py Outdated Show resolved Hide resolved

optimum/exporters/openvino/model_configs.py Show resolved Hide resolved

eaidova added 4 commits December 17, 2024 18:21

apply review comments

2f92e6e

Merge branch 'main' into ea/qwen2vl

b6deb1a

add comments for patching

f6fdfba

Merge branch 'ea/qwen2vl' of https://github.com/eaidova/optimum-intel …

d7ba440

…into ea/qwen2vl

reuse original methods if possile

797f912

eaidova force-pushed the ea/qwen2vl branch from ad64bdf to 797f912 Compare December 17, 2024 15:01

eaidova requested a review from echarlaix December 17, 2024 15:04

eaidova commented Dec 18, 2024

View reviewed changes

optimum/intel/openvino/modeling_visual_language.py Show resolved Hide resolved

eaidova added 2 commits December 18, 2024 09:20

Update optimum/intel/openvino/modeling_visual_language.py

f791b94

fix typings in patchers

0eb94f5

echarlaix merged commit 93777ec into huggingface:main Dec 19, 2024
22 checks passed

nikita-savelyevv mentioned this pull request Dec 19, 2024

[OV] Fix qwen2-vl tests #1084

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

qwen2vl support #1042

qwen2vl support #1042

eaidova commented Nov 29, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Dec 4, 2024

AlexKoff88 commented Dec 6, 2024

echarlaix left a comment

eaidova commented Dec 17, 2024

echarlaix commented Dec 19, 2024

qwen2vl support #1042

qwen2vl support #1042

Conversation

eaidova commented Nov 29, 2024 • edited Loading

What does this PR do?

Before submitting

HuggingFaceDocBuilderDev commented Dec 4, 2024

AlexKoff88 commented Dec 6, 2024

echarlaix left a comment

Choose a reason for hiding this comment

eaidova commented Dec 17, 2024

echarlaix commented Dec 19, 2024

eaidova commented Nov 29, 2024 •

edited

Loading