[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

discovery vs information

To: syndication@yahoogroups.com
Subject: discovery vs information
From: Julian Bond <julian_bond@voidstar.com>
Date: Wed, 15 Oct 2003 11:56:23 +0100
User-agent: Turnpike/6.02-U (<NihPaT1eq7QPEjqHWIMIZRyCL3>)

More thoughts on mypublicfeeds.

- I wonder if it would help to separate the discovery part of this fromthe format part and try and solve these independently.

- I was thinking about use cases. A classic one for me is this. I waslooking at Zdnet UK, wireless section.(http://news.zdnet.co.uk/communications/wireless/). I'm sure I rememberreading that there was some RSS on zdnet somewhere but I don't knowwhere, so what should I do next? There's no obvious XML link on thevisible page. So do I "view source" and look for <link> and/or startlooking for likely files containing lists in:-

http://news.zdnet.co.uk/communications/wireless/
http://news.zdnet.co.uk/communications/
http://news.zdnet.co.uk/
http://www.zdnet.co.uk/

In the absence of a *really widely* implemented standard like robots.txtlooking for files will just give me loads of 404s. Incidentally, there'sno robots.txt or favicon.ico in any of those zdnet directories either.

- There's enough different reasons for having lists and enough differentthings to list that it feels to me like we need to solve this generallyand not just for rss. Even for rss, I feel the need to spec whichflavour each entry refers to.

- I think I've got three or four things I'd like to put in the header ofevery html file. They're all optional and there might be more than oneof each.

1) Here's a pointer to single alternate version of the same content

2) Here's a pointer to a machine-readable file containing lists ofalternate versions of this content or related content. eg RSS0.92,RSS1.0, RSS2.0, Atom, WML, Author's FOAF, Assorted metadata3) Here's a pointer to a machine-readable file containing lists of filesrelated to this section of the site4) Here's a pointer to a machine-readable file containing lists of filesrelated to this whole site.

1) needs more work on identifying the type of the target file. Not typeas in text/plain vs text/xml, but type as in RSS0.92 vs FOAF vs Atom

2), 3) and 4) need work on the markup approach and standard. I don'tthink any of RDFS:seeAlso, OPML or OCS are actually good enough orcomplete enough yet. If this is going to be general it needs to solve awhole load of cases now and it absolutely must handle new file types.And it had better be really simple to parse and produce too. There'sgoing to be a temptation to start adding all sorts of meta-meta-dataabout each entry. Please resist this. It should be a simple list of filetype, name, URI. Any additional meta-data about each file should becontained in the files themselves and available by collecting them.

3) and 4) are actually about metadata describing the directorystructure. I think there is a case here for the W3C to come up with astandard way for this to be found and created. If they haven't already.It feels like there's a case for a standard file with a standard name ineach directory. This would actually help robots because it could containsitemap lists of pages.

However, I think we actually already have a standard here. And that'sthat web requests to directories with no file name should returnsomething via http and with a mime type. Either a web page, an index, a404, a graphic or whatever. Now if the returned doc is of type text/htmlthen we're back to <link>


Aside: I wonder if creating new http headers is out of the question? ;-)

So. I think we need the following:-
1) Some standards for specifying target file content type in <link>

2) As well as RSSx.xx, Atom, NITF, FOAF etc, create some content typesfor cases 1,2,3,4 above.

3) A defined way of creating new content types.

4) A standard file format for lists of <links> This needs to include asection which is metadata about this list. We can probably do this in away that the entries can be inserted into all sorts of other types offiles.

Once we've got this far, then we can move to stage 5) viz. evangelising;writing toolkits; writing apps; writing validators; arguing about commonlocations and file names; arguing about whether it should be xml or RDFor both; arguing about what it all means; and all that other stuff we'reso good at.

This seems to me to be a bare minimum. Once we've done that if the fixedfilename camp want to create these files with a fixed filename on theirwebservers, then they can go ahead. Just as long as they do <link> too.The fixed filename standard can then succeed or fail on it's own meritswithout killing the file format standard in the process.


--
Julian Bond Email&MSM: julian.bond@voidstar.com
Webmaster:              http://www.ecademy.com/
Personal WebLog:       http://www.voidstar.com/
M: +44 (0)77 5907 2173   T: +44 (0)192 0412 433

Follow-Ups:
- Re: [syndication] discovery vs information
  - From: "Bill Kearney" <wkearney@syndic8.com>
- A little different approach to discovery
  - From: Roy Osherove <fireride@netvision.net.il>

Prev by Date: seeAlso
Next by Date: Re: [syndication] RFC: myPublicFeeds.opml
Previous by thread: Just a reminder...
Next by thread: Re: [syndication] discovery vs information
Index(es):
- Date
- Thread