Chris Babcock <cbabcock@[EMAIL PROTECTED]
> writes:
>I don't know whether it constitutes a "plague" or not, but I have some
>preliminary figures from the URL checker that I set up to spider the
>diplom.org site. After seven and a half hours, the program has checked
>over 29,000 links. At least 50,000 remain to be checked. About 3% of the
>links checked so far are bad. For those about to strain their brains
>doing the math, the program is already re****ting over 1,000 broken
>links. I don't think that I'll be shelling in and fixing all of them
>with my text editor of choice (vi) tonight. ;-)
>The good news is that I'm starting to see some instances of "Item
>already checked" fla****ng by. This would mean that I probably don't
>have to wait another 16 hours or more for final results and that there
>may not be 3,000 broken links by the time it's finished checking the
>site...
>Chris
Hi Chris,
What you mean, I think, is that you're doublecounting some of the
"repeated" broken links where the same link exists in many places and of
course each one is broken? Some things that are im****tant.... how HUGE
the site is, how many links there are!!! And is 3% large or small? I
think that's about what I would have thought it was. I think that's
neither large nor small but about what such a huge site, originally built
over ten years ago, has as a legacy.
Jim-Bob


|