Today's programming challenge - How's your Range-Fu ?

Walter Bright via Digitalmars-d digitalmars-d at puremagic.com
Sat Apr 18 03:27:08 PDT 2015


On 4/18/2015 1:26 AM, Panke wrote:
> On Saturday, 18 April 2015 at 08:18:46 UTC, Walter Bright wrote:
>> On 4/18/2015 12:58 AM, John Colvin wrote:
>>> On Friday, 17 April 2015 at 18:41:59 UTC, Walter Bright wrote:
>>>> On 4/17/2015 9:59 AM, H. S. Teoh via Digitalmars-d wrote:
>>>>> So either you have to throw out all pretenses of Unicode-correctness and
>>>>> just stick with ASCII-style per-character line-wrapping, or you have to
>>>>> live with byGrapheme with all the complexity that it entails. The former
>>>>> is quite easy to write -- I could throw it together in a couple o' hours
>>>>> max, but the latter is a pretty big project (cf. Unicode line-breaking
>>>>> algorithm, which is one of the TR's).
>>>>
>>>> It'd be good enough to duplicate the existing behavior, which is to treat
>>>> decoded unicode characters as one column.
>>>
>>> Code points aren't equivalent to characters. They're not the same thing in most
>>> European languages,
>>
>> I know a bit of German, for what characters is that not true?
>
> Umlauts, if combined characters are used. Also words that still have their
> accents left after import from foreign languages. E.g. Café

That doesn't make sense to me, because the umlauts and the accented e all have 
Unicode code point assignments.



More information about the Digitalmars-d mailing list