hotfix - Revert vllm/attention/layer.py changes from 0f8cafe - fix torch.compile recompilations #518
Job | Run time |
---|---|
3m 2s | |
4s | |
6s | |
8m 39s | |
1h 0m 44s | |
8m 13s | |
11m 39s | |
8m 32s | |
7m 10s | |
8m 28s | |
8m 14s | |
4m 18s | |
9m 37s | |
7m 58s | |
9m 24s | |
9m 39s | |
4m 46s | |
4m 24s | |
5m 9s | |
8m 15s | |
8m 36s | |
10m 3s | |
3h 27m 0s |