[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [syndication] Sneak Preview: my.info
On Fri, Jul 14, 2000 at 02:09:48PM +0200, Daniel Docekal wrote:
> Well, that's a kind of pain that fact that our language needs something more
> than pure ascii and we fight with it in computers since they were created
> long time ago.
>
> I finally got time to take a look what is happening. The first thing is good
> - you are not chaning anything that has been received from RSS feed. But
> what's wrong - you are explicitly telling that HTML page (at my.theinfo.org)
> is in 8859-1 coding, and that's where it goes wrong.
It's not, see below.
>
> This is of course interesting - our feed is in Windows-1250 (Sort of Latin-2
> and very close to 8859-2). But what if i subscribe to feed in Big Chinese?
> When listed in one page (at my.theinfo.org) together, that would make big
> mess :(
Presuming you correctly declare the encoding in your RSS file's XML
declaration, and Aaron's parser can handle that encoding, the XML parser
in will grok it OK and convert it into Unicode. Aaron's code then
translates that Unicode into ISO-8859-1 explicitly for publication on
his web page. Thus his indication of the ISO-8859-1 as the charset is
still correct.
Of course, it will probably still all blow up horribly if the equivalent
characters aren't available in ISO-8859-1. The only sure way is to for
Aaron to make his web page use the Unicode encoding.
-- Edd