Tensor dtype error #72

sreedattaSanjay · 2024-03-07T11:01:11Z

Hello
I'm trying to use the bert_mha_layernorm_fuse() function in the examples.py

The problem is when I'm trying to infer the fused model I'm getting a dtype mismatch error
After a little debugging, I found out that in Tensor() class if the instance of tensor is str
you are automatically assigning dtype as numpy.float32

from .node import Node
if isinstance(t, str):
self.name = t
self.proto = None
self.shape = []
self.numpy = None
self.type = DYNAMIC_TENSOR if t != '' else STATIC_TENSOR
self.dtype = numpy.float32

Due to this output tensors that are Dynamic and do not have any value are loaded as float32 tensors

In my case, the output tensor dtype should be int but it is float32

onnxruntime.capi.onnxruntime_pybind11_state.Fail: [ONNXRuntimeError] : 1 : FAIL : Load model from bertsquad_mha_layernorm.onnx failed:Type Error: Type (tensor(int32)) of output arg (bert/encoder/Shape:0) of node (bert/encoder/Shape) does not match expected type (tensor(int64)).

Here is the original bertsquad-12 model

And here is the fused model

Can you please let me know how to resolve this issue? Is there a function that lets us save model with proper datatypes

sreedattaSanjay · 2024-03-07T11:02:33Z

Is there any script in your local testing that lets us infer the models you have created in the examples.py?
If yes can you please let us use it
Thank you

ThanatosShinji · 2024-03-09T05:12:46Z

The link from the README is out of date, you can use this link to download the BERT model

ThanatosShinji added the question Further information is requested label Mar 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tensor dtype error #72

Tensor dtype error #72

sreedattaSanjay commented Mar 7, 2024

sreedattaSanjay commented Mar 7, 2024

ThanatosShinji commented Mar 9, 2024

Tensor dtype error #72

Tensor dtype error #72

Comments

sreedattaSanjay commented Mar 7, 2024

sreedattaSanjay commented Mar 7, 2024

ThanatosShinji commented Mar 9, 2024