toString issue

Derek Parnell derek at nomail.afraid.org
Mon Oct 2 00:11:40 PDT 2006


On Mon, 02 Oct 2006 00:52:44 -0600, Hasan Aljudy wrote:

> Sean Kelly wrote:
>> 
>> How about toUtf8() for classes and structs :-)
>> 
>> Sean
> 
> I think there's a fundamental problem with the way D deals with strings.
> The spec claims that D natively supports strings through char[], at the 
> same time, claims that D fully supports Unicode.
> The fundamental issue is that UTF-8 is one encoding for Unicode strings, 
> but it's not always the best choice. Phobos mostly only deals with 
> char[], and mixing code that uses wchar[] with code that uses char[] 
> isn't very straight forward.
> 
> Consider the simple case of reading a text file and detecting "words". 
> To detect a word, you must first recognize letters, no .. not English 
> letters; letters of any language, and for that purpose, we have 
> isUniAlpha function. Now, If you encode the string as char[], then how 
> are you gonna determine whether or not the next character is a Unicode 
> alpha or not?
> 
> The following definitely shouldn't work:
> //assuming text is char[]
> for( int i = 0; i < text.length; i++ )
> {
>      bool isLetter = isUniAlpha( text[i] );
>      ....
> }

  foreach(int i, dchar c; text)
  {
       bool isLetter = isUniAlpha( c );
       ...
  }


-- 
Derek
(skype: derek.j.parnell)
Melbourne, Australia
"Down with mediocrity!"
2/10/2006 5:10:26 PM



More information about the Digitalmars-d mailing list