Ping: Daniel Keep

Thomas Kuehne thomas-dloop at kuehne.cn
Sun Jan 7 16:18:35 PST 2007


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Daniel Keep schrieb am 2007-01-07:
> Thomas Kuehne wrote:
>> Daniel Keep schrieb am 2007-01-07:

>>>http://www.prowiki.org/wiki4d/wiki.cgi?DanielKeep/TextInD

>> The following sentence is incorrect:
>> #
>> # In fact, it does, but there's a teensy problem that some Unicode
>> # 'enabled' editors have: they forget the Byte Order Mark.
>> #
>> 
>> D doesn't require BOM and the presents of BOMs is application/system
>> defined (-> Unicode.org). More often than not the user simply used the
>> "save as text" feature. Especially on MSWindows most editors use some
>> installation dependent codepage instead of UTF if not ask explicitly to
>> store the text as Unicode.
>> 
>> The only situation were a BOM is required is documented here:
>> http://d.puremagic.com/issues/show_bug.cgi?id=430

> Thanks for the heads up.  Is this accurate?

<snip>

Yes, appart from this section:
>   The Byte Order Mark (or BOM) is a special character sequence at the 
> beginning of any UTF text file that tells the application which UTF 
> encoding is being used, and in some cases what the byte order is (ie: 
> Little Endian/Big Endian.)

Any UTF text file _may_ start with a BOM. On Windows system the
convention is to start with a BOM, on *nix system the convention is to
start without a BOM.

Thomas


-----BEGIN PGP SIGNATURE-----

iD8DBQFFoZp/LK5blCcjpWoRAvNYAJ9dxmYue0bO198RwI1PthHDFq70GwCfTrJX
XhjpCOeBoFCddefVXhkeL7c=
=Lj9W
-----END PGP SIGNATURE-----



More information about the Digitalmars-d mailing list