Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
There are equivalent tests now on the TinyLlama model, that run faster, use the KV cache and sharding. The only test that does not have an equivalence is the continuous batching one, but the test was not working for most other models, so I prefer to remove it anyway, as having it passing was not representative anyway of the current state.
- Loading branch information