Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature]: Compile warmup take too long #201

Open
Zjq9409 opened this issue Aug 26, 2024 · 2 comments
Open

[Feature]: Compile warmup take too long #201

Zjq9409 opened this issue Aug 26, 2024 · 2 comments
Labels
intel Issues or PRs submitted by Intel stale

Comments

@Zjq9409
Copy link

Zjq9409 commented Aug 26, 2024

🚀 The feature, motivation and pitch

INFO 08-26 07:31:47 habana_model_runner.py:1192] [Warmup][Prompt][1/56] batch_size:64 seq_len:1024 free_mem:13.93 GiB
INFO 08-26 07:32:25 habana_model_runner.py:1192] [Warmup][Prompt][2/56] batch_size:64 seq_len:896 free_mem:13.86 GiB
INFO 08-26 07:32:59 habana_model_runner.py:1192] [Warmup][Prompt][3/56] batch_size:64 seq_len:768 free_mem:13.86 GiB
INFO 08-26 07:33:27 habana_model_runner.py:1192] [Warmup][Prompt][4/56] batch_size:64 seq_len:640 free_mem:13.86 GiB
INFO 08-26 07:33:52 habana_model_runner.py:1192] [Warmup][Prompt][5/56] batch_size:32 seq_len:1024 free_mem:13.86 GiB
INFO 08-26 07:34:08 habana_model_runner.py:1192] [Warmup][Prompt][6/56] batch_size:64 seq_len:512 free_mem:13.86 GiB
INFO 08-26 07:34:25 habana_model_runner.py:1192] [Warmup][Prompt][7/56] batch_size:32 seq_len:896 free_mem:13.86 GiB
INFO 08-26 07:34:40 habana_model_runner.py:1192] [Warmup][Prompt][8/56] batch_size:32 seq_len:768 free_mem:13.86 GiB

Could it be compiled with multiple threads, if the graph compilation use CPU? During my compilation process, CPU utilization is very low.
image

Alternatives

No response

Additional context

No response

@mswiniarsk
Copy link

@Zjq9409 yes, we are planning to decrease warmup time in several directions:

  • decrease compilation time, so that single graph takes less time
  • more sophisticated way of selecting buckets, so that we are not compiling graphs that are not truly needed
  • parallel compilation

@HabanaAI HabanaAI deleted a comment Aug 26, 2024
@HabanaAI HabanaAI deleted a comment Aug 26, 2024
@HabanaAI HabanaAI deleted a comment Aug 26, 2024
@kzawora-intel kzawora-intel added the intel Issues or PRs submitted by Intel label Aug 29, 2024
Copy link

This issue has been automatically marked as stale because it has not had any activity within 90 days. It will be automatically closed if no further activity occurs within 30 days. Leave a comment if you feel this issue should remain open. Thank you!

@github-actions github-actions bot added the stale label Nov 28, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
intel Issues or PRs submitted by Intel stale
Projects
None yet
Development

No branches or pull requests

3 participants