Implement PageAllocation as a handle into a PagedAttentionCache, allowing publishing and releasing an allocation via handle rather than cache #52
Job | Run time |
---|---|
5m 58s | |
4m 18s | |
4m 39s | |
4m 8s | |
4m 21s | |
4m 23s | |
4m 53s | |
5m 7s | |
37m 47s |
Job | Run time |
---|---|
5m 58s | |
4m 18s | |
4m 39s | |
4m 8s | |
4m 21s | |
4m 23s | |
4m 53s | |
5m 7s | |
37m 47s |