{"id":234,"date":"2010-06-10T02:05:51","date_gmt":"2010-06-10T02:05:51","guid":{"rendered":"http:\/\/samwebman.wordpress.com\/?p=234"},"modified":"2010-06-10T02:05:51","modified_gmt":"2010-06-10T02:05:51","slug":"google-new-search-index-caffeine","status":"publish","type":"post","link":"https:\/\/www.intentrust.com\/?p=234","title":{"rendered":"Google new search index: Caffeine"},"content":{"rendered":"<blockquote>\n<div>6\/08\/2010 05:00:00 PM<\/div>\n<div><em>(Cross-posted on the <\/em><a href=\"http:\/\/googlewebmastercentral.blogspot.com\/2010\/06\/our-new-search-index-caffeine.html\"><em>Webmaster  Central Blog<\/em><\/a><em>)<\/em><\/div>\n<p>Today, we&#8217;re  announcing the completion of a new web indexing system called Caffeine.  Caffeine provides 50 percent fresher results for web searches than our  last index, and it&#8217;s the largest collection of web content we&#8217;ve  offered. Whether it&#8217;s a news story, a blog or a forum post, you can now  find links to relevant content much sooner after it is published than  was possible ever before.<\/p>\n<p>Some background for those of you who  don&#8217;t build search engines for a living like us: when you search Google,  you&#8217;re not searching the live web. Instead you&#8217;re searching Google&#8217;s  index of the web which, like the list in the back of a book, helps you  pinpoint exactly the information you need. (Here&#8217;s a <a href=\"http:\/\/www.google.com\/howgoogleworks\/\">good explanation<\/a> of how  it all works.)<\/p>\n<p>So why did we build a new search indexing system?  Content on the web is blossoming. It&#8217;s growing not just in size and  numbers but with the advent of video, images, news and real-time  updates, the average webpage is richer and more complex. In addition,  people&#8217;s expectations for search are higher than they used to be.  Searchers want to find the latest relevant content and publishers expect  to be found the instant they publish.<\/p>\n<p>To keep up with the  evolution of the web and to meet rising user expectations, we&#8217;ve built  Caffeine. The image below illustrates how our old indexing system worked  compared to Caffeine:<\/p>\n<div>\n<a href=\"http:\/\/1.bp.blogspot.com\/_7ZYqYi4xigk\/TA7I2hFm20I\/AAAAAAAAGQA\/nbajoe0ibHA\/s1600\/caffeine.jpg\"><img src=\"http:\/\/1.bp.blogspot.com\/_7ZYqYi4xigk\/TA7I2hFm20I\/AAAAAAAAGQA\/nbajoe0ibHA\/caffeine.jpg\" border=\"0\" alt=\"\" \/><\/a><br \/>\nOur  old index had several layers, some of which were refreshed at a faster  rate than others; the main layer would update every couple of weeks. To  refresh a layer of the old index, we would analyze the entire web, which  meant there was a significant delay between when we found a page and  made it available to you.<\/p>\n<p>With Caffeine, we analyze the web in  small portions and update our search index on a continuous basis,  globally. As we find new pages, or new information on existing pages, we  can add these straight to the index. That means you can find fresher  information than ever before\u2014no matter when or where it was published.<\/p>\n<p>Caffeine  lets us index web pages on an enormous scale. In fact, every second  Caffeine processes hundreds of thousands of pages in parallel. If this  were a pile of paper it would grow three miles taller every second.  Caffeine takes up nearly 100 million gigabytes of storage in one  database and adds new information at a rate of hundreds of thousands of  gigabytes per day. You would need 625,000 of the largest iPods to store  that much information; if these were stacked end-to-end they would go  for more than 40 miles.<\/p>\n<p>We&#8217;ve built Caffeine with the future in  mind. Not only is it fresher, it&#8217;s a robust foundation that makes it  possible for us to build an even faster and comprehensive search engine  that scales with the growth of information online, and delivers even  more relevant search results to you. So stay tuned, and look for more  improvements in the months to come.<\/p>\n<p>Posted  by Carrie Grimes, Software Engineer<\/p><\/div>\n<\/blockquote>\n<div>Reference from: http:\/\/googleblog.blogspot.com\/2010\/06\/our-new-search-index-caffeine.html<\/div>\n","protected":false},"excerpt":{"rendered":"<p>6\/08\/2010 05:00:00 PM (Cross-posted on the Webmaster Ce &hellip; <\/p>\n<p class=\"link-more\"><a href=\"https:\/\/www.intentrust.com\/?p=234\" class=\"more-link\">\u95b1\u8b80\u5168\u6587<span class=\"screen-reader-text\">\u3008Google new search index: Caffeine\u3009<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[9],"tags":[109],"_links":{"self":[{"href":"https:\/\/www.intentrust.com\/index.php?rest_route=\/wp\/v2\/posts\/234"}],"collection":[{"href":"https:\/\/www.intentrust.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.intentrust.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.intentrust.com\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.intentrust.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=234"}],"version-history":[{"count":0,"href":"https:\/\/www.intentrust.com\/index.php?rest_route=\/wp\/v2\/posts\/234\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.intentrust.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=234"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.intentrust.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=234"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.intentrust.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=234"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}