Tesugen

Answers from Aaron

A couple of days ago, Aaron Swartz posted an entry on his Google Weblog, requesting questions about Google. I sent off some questions I had regarding how Google indexes pages as it crawls around the net.

He answered me tonight that the threshold (that I’ve suspected to be used to determine which pages go into the main index) is the PageRank. Obviously! Why have another measurement for pages? I didn’t think of that.

He also wrote that the system of crawling pages every day is something Google began doing recently, and that he suspects that the “Similar pages” links use “their linking index, which likely is only part of the permanent crawl (since they don’t care about temporary pages that much)”.

The above was posted to my personal weblog on April 27, 2002. My name is Peter Lindberg and I am a thirtysomething software developer and dad living in Stockholm, Sweden. Here, you’ll find posts in English and Swedish about whatever happens to interest me for the moment.

Tags:

Related posts:

Posted around the same time:

The seven most recent posts:

  1. Tesugen Replaced (October 7)
  2. My Year of MacBook Troubles (May 16)
  3. Tesugen Turns Five (March 21)
  4. Gustaf Nordenskiöld om keramik kontra kläddesign (December 10, 2006)
  5. Se till att ha två buffertar för oförutsedda utgifter (October 30, 2006)
  6. Bra tips för den som vill börja fondspara (October 7, 2006)
  7. Light-Hearted Parenting Tips (September 16, 2006)
Bloggtoppen.se