-
Notifications
You must be signed in to change notification settings - Fork 124
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Print properties of compile model #488
Print properties of compile model #488
Conversation
Thanks @wgzintel ! Could you run |
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. |
Done @helena-intel |
Thanks for this PR @wgzintel this is really useful. It would be great if you could add a simple test to https://github.com/huggingface/optimum-intel/blob/main/tests/openvino/test_modeling.py. Nothing complicated, just that the function to print properties is run in the tests so we are alerted if it breaks at some point. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
thanks for the addition @wgzintel
Co-authored-by: Ella Charlaix <[email protected]>
Co-authored-by: Ella Charlaix <[email protected]>
* print properties of compile model * Reference to benchmark_tool/openvino/tools/benchmark/main.py * format code * format code with ruff * print properties of compile model of diffusion and seq2seq * format code with ruff * Update optimum/intel/openvino/utils.py Co-authored-by: Ella Charlaix <[email protected]> * Update optimum/intel/openvino/utils.py Co-authored-by: Ella Charlaix <[email protected]> * refine code --------- Co-authored-by: Ella Charlaix <[email protected]>
print properties of compiled model:
Compiling the model to CPU ...
CPU SUPPORTED_PROPERTIES:
NETWORK_NAME: Model0
OPTIMAL_NUMBER_OF_INFER_REQUESTS: 1
NUM_STREAMS: 1
AFFINITY: Affinity.CORE
INFERENCE_NUM_THREADS: 96
PERF_COUNT: NO
INFERENCE_PRECISION_HINT: <Type: 'bfloat16'>
PERFORMANCE_HINT: LATENCY
EXECUTION_MODE_HINT: ExecutionMode.PERFORMANCE
PERFORMANCE_HINT_NUM_REQUESTS: 0
ENABLE_CPU_PINNING: True
SCHEDULING_CORE_TYPE: SchedulingCoreType.ANY_CORE
ENABLE_HYPER_THREADING: False
EXECUTION_DEVICES: CPU
CPU_DENORMALS_OPTIMIZATION: False
CPU_SPARSE_WEIGHTS_DECOMPRESSION_RATE: 1.0