[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Translate non-structured documents into Xml RSS format



Dave Winer <dave@userland.com> wrote:

> It could easily work in any other CMS, or even from HTML text. The key is
> that the author follow some regular pattern for news items, and then the
> script harvests the information from the source text.

Hmm, this sounds a lot like the work for ASCII->RSS[1] and XHTML->RSS[2]. If
anyone is interested in a service, I might be persuaded to program it for
them. However, both of these require specific formatting -- a more general
converter would be harder to do (and not as accurate). I'm also providing[3]
specific conversions to RSS for specific web sites which don't produce the
RSS files on their own.

[1] http://4xt.org/downloads/rss/text-to-10.txt
[2] http://4xt.org/downloads/rss/xhtml-to-10.txt
[3] http://my.theinfo.org

-- 
        Aaron Swartz         |"This information is top security.
<http://swartzfam.com/aaron/>|     When you have read it, destroy yourself."
  <http://www.theinfo.org/>  |             - Marshall McLuhan