-
Notifications
You must be signed in to change notification settings - Fork 205
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Potentially confusing API next16
when operating on values with sizes < 2
#126
Comments
I guess it would help to issue a compiler error if the value type is not the right size. I am not sure how I accomplish that, though. Any ideas? |
What makes this slightly "ugly" is that template<typename word_iterator>
typename std::enable_if<sizeof(decltype(*std::declval<word_iterator>())) == sizeof(char16_t), utfchar32_t>::type next16(
word_iterator& it, word_iterator end) {
... With this one can only invoke |
My goal is to keep the core of the library C++ 98 compliant. I played a little with SFINAE but nowadays I am not a fan of template magic. But to take a step back - how did you even end up with UTF-16 encoding in std::string? |
We are using
Ah, I didn't realize that when looking at the test suite. If you believe this is not worthwhile fixing, please feel free to close this issue. It is not impossible to work around, just a potential gotcha. |
Actually, I just discovered std::iterator_traits::value_type |
Added static asserts for the size of UTF-16 code units: 65701fe |
I am implementing a custom error handling strategy for converting from UTF-16 to UTF-8. My raw bytes are held in a
std::string
.I naively attempted to iterate this
std::string
by usingnext16
. This fails since the iteration undernext16
now iterates single bytes instead of pairs.With the right warnings enabled this raises a sign-conversion warning in the bowels of utfcpp. I however naively expected that
next16
would already pick the right iteration strategy (it could in principle detect the sizes of the values behind theword_iterator
passed tonext16
, but does not).I get the correct behavior if I additionally wrap the string in a
u16string_view
which ensure the iterator advances like expected bynext16
.I think it would be great if utfcpp could either automatically pick the right iteration strategy based on the value type behind its iterators, or at least reject cases where an iterator over values of smaller size is passed than is expected.
The text was updated successfully, but these errors were encountered: