Model size check doesn't seem to work #39

kleiti · 2024-11-12T14:00:05Z

Even with my model that is less than 250KB in size, I get the onnx_data file after quantization.

neural-compressor/onnx_neural_compressor/onnx_model.py

Line 245 in aabbf96

save_as_external_data=True,

mengniwang95 · 2024-11-13T00:52:04Z

Hi, is your original model format .onnx file + weight file?

neural-compressor/onnx_neural_compressor/onnx_model.py

Line 116 in aabbf96

    
           if init.HasField("data_location") and init.data_location == onnx.TensorProto.EXTERNAL:

If the original format is .onnx file + weight file, saving operation will follow it.

kleiti · 2024-11-13T06:53:33Z

My model is just the .onnx file, no external weight file.

mengniwang95 · 2024-11-13T08:34:24Z

Do you quantize the model with this API

neural-compressor/onnx_neural_compressor/quantization/quantize.py

Line 28 in aabbf96

def quantize(

?
If so, during optimization it will save external data if model size > min_size:

neural-compressor/onnx_neural_compressor/quantization/quantize.py

Line 43 in aabbf96

"session.optimized_model_external_initializers_min_size_in_bytes", "1024"

kleiti · 2024-11-13T08:37:58Z

Yes, I do quantize using that API. Why is the size limit so low?

mengniwang95 · 2024-11-13T08:53:11Z

Since onnxruntime can load external data automatically and we find model with external data format can be processed faster. But according to you case, the size limitation may need optimization.
Do you have some limitations to use external data format?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model size check doesn't seem to work #39

Model size check doesn't seem to work #39

kleiti commented Nov 12, 2024

mengniwang95 commented Nov 13, 2024

kleiti commented Nov 13, 2024

mengniwang95 commented Nov 13, 2024 •

edited

Loading

kleiti commented Nov 13, 2024

mengniwang95 commented Nov 13, 2024

Model size check doesn't seem to work #39

Model size check doesn't seem to work #39

Comments

kleiti commented Nov 12, 2024

mengniwang95 commented Nov 13, 2024

kleiti commented Nov 13, 2024

mengniwang95 commented Nov 13, 2024 • edited Loading

kleiti commented Nov 13, 2024

mengniwang95 commented Nov 13, 2024

mengniwang95 commented Nov 13, 2024 •

edited

Loading