> I've got some XML that contains non ASCII chars in CDATA sections, both IE6
> and the XML library I'm using (perl XML::Simple) spew when they get to the
> non ASCII chars.
> The default encoding is UTF-8 right? how do I know if a char is UTF-8 or an
RIGHT
Quote:> invalid binary char?
SEE TABLE BELOW
THE PARSER MUST TELL IT FOR YOU (YOU DON'T HAVE TO CARE ABOUT THAT)
Quote:> one of the characters it is failing on is a degree (as in degrees Celsius)
> symbol, will I have to create an entity reference for this or use the
> Unicode   format .
USING AN ENTITY THAT REFERS TO UNICODE CHAR OR THAT CHAR DIRECTLY IS THE
SAME
Hi Joe,
  stands for non-breaking space
° stands for degree char
Unicode address UTF-8 sequence
from 0000 0000 to 0000 007F 0xxxxxxx
from 0000 0080 to 0000 07FF 110xxxxx 10xxxxxx
from 0000 0800 to 0000 FFFF 1110xxxx 10xxxxxx 10xxxxxx
from 0001 0000 to 001F FFFF 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx
Notice that an UTF-8 encoding document can't contains FE and FF bytes
see RFC2279 for more
I think you should specify which encoding you use in your document, even
if it is UTF-8
--
Cordialement,
///
(. .)
-----ooO--(_)--Ooo-----
| Philippe Poulard |
-----------------------