Ezio Melotti wrote:
>
> Ezio Melotti <ezio.melotti@gmail.com> added the comment:
>
> Here is an incomplete patch. It seems to solve the problem but I still have to add more tests and check it better.
Thanks. Please also check whether it's worthwhile unrolling those
loops by hand.
> I also wonder if the sequences with the first byte in range F5-FD (start of 4/5/6-byte sequences, restricted by RFC 3629) should behave in the same way. Right now they just "eat" the following 4/5/6 chars without checking.
I think we need to do this all the way, even though 5 and 6 byte
sequences are not used at the moment. |