why first_bytes plus one in path_trie.cpp #1806
Answered
by
reuben
zh794390558
asked this question in
Q&A
-
STT/native_client/ctcdecode/path_trie.cpp Lines 129 to 141 in f2e9c85 |
Beta Was this translation helpful? Give feedback.
Answered by
reuben
Mar 23, 2021
Replies: 1 comment
-
In bytes output mode, label values 0-254 correspond to byte values 1-255. Label value 255 is the CTC blank symbol. The +1 here is hardcoding that assumption into the decoder code. This should definitely have a comment explaining it, so thanks for raising. See also https://stt.readthedocs.io/en/latest/Decoder.html#bytes-output-mode |
Beta Was this translation helpful? Give feedback.
0 replies
Answer selected by
zh794390558
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
In bytes output mode, label values 0-254 correspond to byte values 1-255. Label value 255 is the CTC blank symbol. The +1 here is hardcoding that assumption into the decoder code. This should definitely have a comment explaining it, so thanks for raising.
See also https://stt.readthedocs.io/en/latest/Decoder.html#bytes-output-mode