Skip to content

Commit

Permalink
llama tokenizer
Browse files Browse the repository at this point in the history
  • Loading branch information
Andrei Panferov committed Jan 21, 2024
1 parent ebbeaba commit 36a8308
Show file tree
Hide file tree
Showing 5 changed files with 93,456 additions and 0 deletions.
7 changes: 7 additions & 0 deletions transformers/llama/generation_config.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,7 @@
{
"_from_model_config": true,
"bos_token_id": 1,
"eos_token_id": 2,
"pad_token_id": 0,
"transformers_version": "4.36.2"
}
23 changes: 23 additions & 0 deletions transformers/llama/special_tokens_map.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,23 @@
{
"bos_token": {
"content": "<s>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false
},
"eos_token": {
"content": "</s>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false
},
"unk_token": {
"content": "<unk>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false
}
}
Loading

0 comments on commit 36a8308

Please sign in to comment.