May 10, 2003

Crash update, Google insight

It's now been a little less than two weeks from the website crash. The new installation of the blog is debugged, and I've fixed any critical file-not-found errors. Old provider http://www.phpwebhosting.com/ remains very apologetic that they didn't have one bit of backup. But sorry and lost mailing-list data still leaves me with painful loss of mailing-list data. That's another personal discouragement.

The other notable after-effect was that I was no longer being visited by My Friend The Freshbot (the Google crawler which checks certain sites for daily updates). It turns out that the daily Google crawler still thinks that my site is hosted on the old location (http://www.phpwebhosting.com/), even though it's now been moved for many days.

That's interesting, as it indicates that the daily Google crawler is rather slow to update its DNS. I've got a log full of errors. That log shows a pattern which seems to confirm that the highest PR or most-linked pages are what forms the basis of the daily crawl (which makes sense).

I'm seeing a brief daily visit from a Google crawler on my new location (Project Geek). But it's just checking the front page and robots.txt. This is probably Google's general crawler to keep track of what websites exist and what shouldn't be searched on them (robots.txt).

As an interim measure, I put some pages back on the old host, so hopefully Google/Freshbot will find them soon. I'm going to keep track of when Freshbot Comes Home (new home, that is).

By Seth Finkelstein | posted in google , website | on May 10, 2003 11:53 PM (Infothought permalink) | Followups
Seth Finkelstein's Infothought blog (Wikipedia, Google, censorware, and an inside view of net-politics) - Syndicate site (subscribe, RSS)

Subscribe with Bloglines      Subscribe in NewsGator Online  Google Reader or Homepage