[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Translate non-structured documents into Xml RSS format
- To: <syndication@egroups.com>
- Subject: Re: Translate non-structured documents into Xml RSS format
- From: Aaron Swartz <aswartz@swartzfam.com>
- Date: Mon, 25 Sep 2000 18:32:20 -0500
- In-reply-to: <080101c0273f$49587250$33a1dc40@murphy2>
- User-agent: Microsoft-Outlook-Express-Macintosh-Edition/5.02.2022
Dave Winer <dave@userland.com> wrote:
> It could easily work in any other CMS, or even from HTML text. The key is
> that the author follow some regular pattern for news items, and then the
> script harvests the information from the source text.
Hmm, this sounds a lot like the work for ASCII->RSS[1] and XHTML->RSS[2]. If
anyone is interested in a service, I might be persuaded to program it for
them. However, both of these require specific formatting -- a more general
converter would be harder to do (and not as accurate). I'm also providing[3]
specific conversions to RSS for specific web sites which don't produce the
RSS files on their own.
[1] http://4xt.org/downloads/rss/text-to-10.txt
[2] http://4xt.org/downloads/rss/xhtml-to-10.txt
[3] http://my.theinfo.org
--
Aaron Swartz |"This information is top security.
<http://swartzfam.com/aaron/>| When you have read it, destroy yourself."
<http://www.theinfo.org/> | - Marshall McLuhan