-
Notifications
You must be signed in to change notification settings - Fork 19
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
🦙 Llama3 on TGI - Jetstream Pytorch (#90)
* fix(engine_loader): correct n_reps and cache_shape settings * feat(tokenizer): donwload all json files when fetching model This will ensure that tokenizer_config.json is loaded if needed. * feat(jetstream pt): relax llama compatibility requirements This change will allow Llama3 models to be loaded. * test(jetstream pt): move Llama2-7b test to runslow/nightly * fix(jetstream pt): clone weight before mapping This is a workaround to avoid a core dump observed when testing on TinyLLama-v0 model. It should allow to prevent other similar problems later. This allows to add again the basic test (not slow) that will run on PRs and check Jetstream/Pytorch. * test(jetstream pt): add test showing support of Llama3-8B * review: fix imports for type checking * fix: correct type hint * fix(pyproject): correct jetstream git revision
- Loading branch information
1 parent
fa24cc4
commit b25e973
Showing
6 changed files
with
56 additions
and
20 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters