Rare Tokens #1779
Closed
JustAnOkapi
started this conversation in
General
Rare Tokens
#1779
Replies: 4 comments 3 replies
-
the text encoder and vocab.json go together I think, you can't simply add a token to vocab |
Beta Was this translation helpful? Give feedback.
0 replies
-
Ill try it
|
Beta Was this translation helpful? Give feedback.
0 replies
-
oh yea to update, |
Beta Was this translation helpful? Give feedback.
1 reply
-
trying again with |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
https://gist.github.com/JustAnOkapi/80be6959476599f1ab899f948a140eba
I filtered and organized the list of all tokens sd can read. It includes longer than one token words (merged).
wouldnt the best training option be to add a word to vocab.json that it doesnt know?
the word ive seen used ohwx is number 943, it is known just not very well.
Beta Was this translation helpful? Give feedback.
All reactions