[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [syndication] Contacting Aggregators



Steven Roussey <sroussey@network54.com> wrote:

> The thing is, I'd like to warn our aggregators first, but I don't know who
> they are. Is there already a mechanism to for aggregators to register that I
> have yet to set up? Or should aggregators send a courtesy email to
> webmaster@.. when they pull more than X number of feeds from a single site?

Perhaps aggregators should add a header in the HTTP request, identifying
themselves? Or maybe a special User-Agent field? Something of the form:

Aggregator: name/version (http://www.site.for.more.info/)

would work well. Also, is anyone running a registry of aggregators? It's
getting a little hard to keep track of them all. I know that XMLtree has a
listing of some of them, perhaps we could set something up on dmoz.org. If
we had a listing like the Robots list,
(http://info.webcrawler.com/mak/projects/robots/ IIRC) which is a listing of
User-Agent strings and email addresses for various web spiders, that would
be another way of dealing with the issue.

If there's interest, I'll set up a site where people can register their bot.

-- 
        Aaron Swartz         |"This information is top security.
<http://swartzfam.com/aaron/>|     When you have read it, destroy yourself."
  <http://www.theinfo.org/>  |             - Marshall McLuhan