Implement PageAllocation as a handle into a PagedAttentionCache, allowing publishing and releasing an allocation via handle rather than cache #46
Job | Run time |
---|---|
4m 19s | |
4m 12s | |
4m 12s | |
4m 0s | |
4m 7s | |
4m 37s | |
3m 40s | |
4m 59s | |
34m 6s |
Job | Run time |
---|---|
4m 19s | |
4m 12s | |
4m 12s | |
4m 0s | |
4m 7s | |
4m 37s | |
3m 40s | |
4m 59s | |
34m 6s |