Cool features of the Slurp! Bot

In: smp

11 Feb 2005

Yahoo Search Blog has a great posting on how the Slurp Bot tries to conserve bandwidth by making use of compression and cache-control headers. [here]

As a Web performance fanatic, it is heartening to see that these folks have taken such care, and put such thought into their indexing crawler. They want it to be accurate, but they don’t want to slam your site.

A while back, I had to write a robots.txt file for WebPerformance to keep the MSNBot from stomping the site on a daily basis. This site uses frames and query variables to produce the various performance graphs. Well, the MSNBot was indexing every page and every variation almost daily. Finally, I said go away, just to that crawler. All the others are fine. Maybe MSN Search should take a page from the Yahoo! (Inktomi) Bot development team.

Spread the Love:
  • Facebook
  • Twitter
  • Ping.fm
  • Digg
  • StumbleUpon
  • LinkedIn
  • Reddit
  • Slashdot
  • Netvouz
  • Identi.ca
  • Technorati
  • del.icio.us
  • email

Related Posts

  • Yahoo's search engine HTTP advises are of high interest
    Yahoo! Search give recently in their blog some known advises, but worth recalling again and again for optimizing bandwidth needs of websites and accelerating them as a side effect.

    Basically, they advertize that their robot/crawler/spider (i.e. the s
blog comments powered by Disqus

About this blog

Stephen Pierzchala is one of a 10-year veteran of the Web performance field who also writes on topics that interest his non-linear world-view.

Contact

stephen@pierzchala.com

+1 (508) 410-3865