[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [syndication] Entities and Char sets



2001-11-27 16:04:58, Julian Bond <julian_bond@voidstar.com>:

> But when you're trying to get the content
> down an XML-RPC pipe and the toolkits don't interoperate
> because they barf on whitespace between tags or tell you
> the name-value is missing when actually it's just empty,

Just curious, could you name a few toolkits that barf on
whitespace between tags? (I'm not sure what you mean by 'name-
value' missing.)

> So what is the *correct* way of dealing with reserved and
> high order characters in RSS content, such that the maximum
> number of clients can read it and get the content to the
> reader?

Use the 'UTF-8' encoding. Write all ASCII characters directly,
and write all other characters (i.e. all characters at
codepoint > 127) as decimal character references. Then I
believe you should be pretty safe. Simplified example:

<?xml version="1.0" encoding="UTF-8"?>

<title>Trademark symbol: &#8482;</title>

-- 
Karl Ove Hufthammer