[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [syndication] Syndic8.com
- To: syndication@yahoogroups.com
- Subject: Re: [syndication] Syndic8.com
- From: Mark Nottingham <mnot@mnot.net>
- Date: Mon, 1 Oct 2001 16:00:12 -0700
- In-reply-to: <006601c14acc$779c2470$070d0dc0@monster>; from jeff@vertexdev.com on Mon, Oct 01, 2001 at 03:57:37PM -0700
- References: <20011001143357.I1343@mnot.net> <006601c14acc$779c2470$070d0dc0@monster>
- User-agent: Mutt/1.2.5i
Any way to make it possible to check a submitted URI (for example, if
someone didn't want to list their feed yet)?
Cheers,
On Mon, Oct 01, 2001 at 03:57:37PM -0700, Jeff Barr wrote:
> Mark asks,
>
> > Just out of curiosity, what's the engine you're using to find the XML
> > errors and highlight them? Is it homebrew, or open source?
>
> I wrote my own RSS parser in PHP, using PHP's XML module. It uses the
> Expat parser. If it returns an error message then I grab it and keep it,
> along with the line number.
>
> The display is just a little bit of PHP. Again, nothing amazing.
>
> Jeff;
>
> -----Original Message-----
> From: Mark Nottingham [mailto:mnot@mnot.net]
> Sent: Monday, October 01, 2001 2:34 PM
> To: syndication@yahoogroups.com
> Subject: Re: [syndication] Syndic8.com
>
>
>
> Oh, SWEET! That's really cool.
>
> Just out of curiosity, what's the engine you're using to find the XML
> errors and highlight them? Is it homebrew, or open source?
>
> re: transient nature of feeds; yeah, that's one of the big challenges
> we face.
>
> Cheers & thanks,
>
>
>
> On Mon, Oct 01, 2001 at 01:20:42PM -0700, Jeff Barr wrote:
> > Mark asks:
> >
> > > Is there any way to break down the errors further?
> >
> > There are no more detailed stats yet on the page. Here is a quick
> > breakdown.
> > The NULL parse_error represents a working feed.
> >
> > mysql> select parse_error, count(*) from polldetail where
> > mysql> summaryid=21 group by parse_error order by parse_error;
> >
> >
> +------------------------------------------------------------+----------
> > +
> > | parse_error |
> count(*)
> > |
> >
> +------------------------------------------------------------+----------
> > +
> > | NULL |
> 2513
> > |
> > | junk after document element |
> 2
> > |
> > | mismatched tag |
> 46
> > |
> > | no element found |
> 14
> > |
> > | not well-formed |
> 148
> > |
> > | syntax error |
> 61
> > |
> > | undefined entity |
> 6
> > |
> > | xml processing instruction not at start of external entity |
> 2
> > |
> >
> +------------------------------------------------------------+----------
> > +
> >
> > The site retains the XML from each poll to each feed, as well as the
> > line number of the error. So here is a broken feed with a full
> > indication
> > of the problem:
> >
> >
> http://www.syndic8.com/feedinfo.php?FeedID=884&Section=xml&SummaryID=21
> >
> > This solves what has been a very tough issue with syndicated
> headlines.
> > Things change quickly enough that it can be hard to track down
> something
> >
> > that is broken for a few hours or so.
> >
> > Jeff;
> >
> > -----Original Message-----
> > From: Mark Nottingham [mailto:mnot@mnot.net]
> > Sent: Monday, October 01, 2001 11:43 AM
> > To: syndication@yahoogroups.com
> > Subject: Re: [syndication] Syndic8.com
> >
> >
> >
> > This is great info.
> >
> > Wow, a 10% error rate... not good. I have a feeling that almost all
> > of them are entity encoding errors in links, etc.; is there any way
> > to break down the errors further?
> >
> > Cheers,
> >
> >
> >
> >
> > On Mon, Oct 01, 2001 at 07:37:35PM +0100, Julian Bond wrote:
> > > Http://www.syndic8.com/polllist.php
> > > Current statistics
> > >
> > > Feeds polled: 2792
> > > Live: 2458
> > > Dead: 61
> > > Parse errors: 279
> > >
> > > These are all native RSS feeds.
> > >
> > > --
> > > Julian Bond email: julian_bond@voidstar.com
> > > CV/Resume: http://www.voidstar.com/cv/
> > > WebLog: http://www.voidstar.com/
> > > HomeURL: http://www.shockwav.demon.co.uk/
> > > M: +44 (0)77 5907 2173 T: +44 (0)192 0412 433
> > > ICQ:33679568 tag:So many words, so little time
> > >
> > >
> > >
> > > Your use of Yahoo! Groups is subject to
> > http://docs.yahoo.com/info/terms/
> > >
> > >
> >
> > --
> > Mark Nottingham
> > http://www.mnot.net/
> >
> >
> >
> >
> > Your use of Yahoo! Groups is subject to
> > http://docs.yahoo.com/info/terms/
> >
> >
> >
> >
> >
> > Your use of Yahoo! Groups is subject to
> http://docs.yahoo.com/info/terms/
> >
> >
>
> --
> Mark Nottingham
> http://www.mnot.net/
>
>
>
>
> Your use of Yahoo! Groups is subject to
> http://docs.yahoo.com/info/terms/
>
>
>
>
>
> Your use of Yahoo! Groups is subject to http://docs.yahoo.com/info/terms/
>
>
--
Mark Nottingham
http://www.mnot.net/