[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Translate non-structured documents into Xml RSS format



ben@ubiquick.com <ben@ubiquick.com> wrote:

> I would like to know if anybody has already worked on a bot that
> could grab unstructured documents and translate them into RSS format.

I'm not quite sure I follow. You mean a spider that would crawl the website
and output a channel with a listing of all the pages on that site? I've
never heard of such a thing, it does sound like an interesting possibility,
however.

What would you use this for, since the site map would rarely change (making
it not very useful for news)?

-- 
        Aaron Swartz         |"This information is top security.
<http://swartzfam.com/aaron/>|     When you have read it, destroy yourself."
  <http://www.theinfo.org/>  |             - Marshall McLuhan