Implement PageAllocation as a handle into a PagedAttentionCache, allowing publishing and releasing an allocation via handle rather than cache #35
Job | Run time |
---|---|
4m 27s | |
4m 6s | |
4m 35s | |
4m 58s | |
4m 12s | |
4m 1s | |
3m 27s | |
5m 25s | |
35m 11s |
Job | Run time |
---|---|
4m 27s | |
4m 6s | |
4m 35s | |
4m 58s | |
4m 12s | |
4m 1s | |
3m 27s | |
5m 25s | |
35m 11s |