On 3/23/2014 5:32 PM, Mike wrote: > This example only considers encodings of up to 4 bytes, but UTF-8 can encode > code points in as many as 6 bytes. Is that not a concern? It's not anymore. The 5 and 6 byte encodings are now illegal.