What's left for 1.0?
BCS
BCS at pathlink.com
Fri Nov 17 12:04:16 PST 2006
Nahon wrote:
> Marcin Kuszczak wrote:
>
>> I think that one thing which is missed in phobos right now is string
>> class
>> which encapsulates utf-8/utf-16/utf-32 handling and issues connected with
>> utf-8 strings e.g.:
>>
>> char[] foo = "hög";
>> assert(foo.length == 3); // Sorry UTF-8, this is == 4
>> assert(foo[1] == 'ö'); // Not a chance!
>
>
> My Win version doesn't even start to parse the source file if an ASCII
> >127 character is present even if it is in a comment!
>
> If I create an a.d (ö is #f6):
> void main()
> {
> //hög
> }
>
> And then run:
> $ dmd a
>
> The result is:
> a.d(3): invalid UTF-8 sequence
>
> I think that would be nice to somehow tell the parser which format is
> the source file. It could be a command line parameter
> -encoding:ANSI|UTF-8|etc.
> or the first line of the file should contain that like
> //!Encoding: ANSI
>
> Regards,
> Nahon
http://www.digitalmars.com/d/lex.html
look for BOM
However, I don't known how to put in a BOM.
More information about the Digitalmars-d
mailing list