<?xml version="1.0" encoding="UTF-8"?><!-- generator="wordpress/2.3.3" -->
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	>
<channel>
	<title>Comments on: Yahoo Search Wants to Be More Like Google, Embraces Hadoop</title>
	<link>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/</link>
	<description>Startup and Tech News</description>
	<pubDate>Mon, 12 May 2008 08:18:21 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.3.3</generator>
		<item>
		<title>By: 14153f82220c</title>
		<link>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2286561</link>
		<dc:creator>14153f82220c</dc:creator>
		<pubDate>Sat, 10 May 2008 06:53:35 +0000</pubDate>
		<guid>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2286561</guid>
		<description>&lt;strong&gt;14153f82220c...&lt;/strong&gt;

14153f82220cdd728696...</description>
		<content:encoded><![CDATA[<p><strong>14153f82220c&#8230;</strong></p>
<p>14153f82220cdd728696&#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ryan Ward</title>
		<link>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2010899</link>
		<dc:creator>Ryan Ward</dc:creator>
		<pubDate>Fri, 29 Feb 2008 01:25:35 +0000</pubDate>
		<guid>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2010899</guid>
		<description>Reading through the shear size and amount of data is mindboggling.</description>
		<content:encoded><![CDATA[<p>Reading through the shear size and amount of data is mindboggling.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: mapreduced</title>
		<link>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2001448</link>
		<dc:creator>mapreduced</dc:creator>
		<pubDate>Thu, 21 Feb 2008 07:23:55 +0000</pubDate>
		<guid>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2001448</guid>
		<description>Thanks for this post and the clarification. Yahoo itself played a little bit with the words map and reduce so ordinary people understand and remind it. Good to see that Yahoo made some decissions in the past that now add value to the web. Combine this with some tough Microsoft deal makers and distribution opportunities and the other Company will get some serious competition the first time. Yahoo, you made my day ;</description>
		<content:encoded><![CDATA[<p>Thanks for this post and the clarification. Yahoo itself played a little bit with the words map and reduce so ordinary people understand and remind it. Good to see that Yahoo made some decissions in the past that now add value to the web. Combine this with some tough Microsoft deal makers and distribution opportunities and the other Company will get some serious competition the first time. Yahoo, you made my day ;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Anonymous</title>
		<link>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2001385</link>
		<dc:creator>Anonymous</dc:creator>
		<pubDate>Thu, 21 Feb 2008 05:56:42 +0000</pubDate>
		<guid>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2001385</guid>
		<description>You guys are the lamest of all Google fanboys. Yahoo! follows google's footsteps? Yahoo! has been one of the largest proponents and developers on Hadoop. 

Google has nothing to do with Hadoop. They do closed source software. They should be marching in Yahoo!'s footsteps ... but, wait. Why would google do anything in open source? They are a black box *non* evil empire. 

Happy 1984 to all the fanboys.</description>
		<content:encoded><![CDATA[<p>You guys are the lamest of all Google fanboys. Yahoo! follows google&#8217;s footsteps? Yahoo! has been one of the largest proponents and developers on Hadoop. </p>
<p>Google has nothing to do with Hadoop. They do closed source software. They should be marching in Yahoo!&#8217;s footsteps &#8230; but, wait. Why would google do anything in open source? They are a black box *non* evil empire. </p>
<p>Happy 1984 to all the fanboys.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: chris smith</title>
		<link>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2001229</link>
		<dc:creator>chris smith</dc:creator>
		<pubDate>Thu, 21 Feb 2008 02:23:04 +0000</pubDate>
		<guid>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2001229</guid>
		<description>Interesting article, looks like you missed another similar story at the begining of FEB when Hypertable.org launched.....Seeing that they have had nearly 300 downloads and are talking with some very hip Valley companies.... why would you not of seen or covered this...(http://onotech.blogspot.com/)

Fred could not possibly be right could he....</description>
		<content:encoded><![CDATA[<p>Interesting article, looks like you missed another similar story at the begining of FEB when Hypertable.org launched&#8230;..Seeing that they have had nearly 300 downloads and are talking with some very hip Valley companies&#8230;. why would you not of seen or covered this&#8230;(http://onotech.blogspot.com/)</p>
<p>Fred could not possibly be right could he&#8230;.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Markus Thomson</title>
		<link>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2001083</link>
		<dc:creator>Markus Thomson</dc:creator>
		<pubDate>Wed, 20 Feb 2008 23:07:31 +0000</pubDate>
		<guid>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2001083</guid>
		<description>Haha, today's XKCD comic is just so perfect for this thread:

http://xkcd.com/386/

awesome.</description>
		<content:encoded><![CDATA[<p>Haha, today&#8217;s XKCD comic is just so perfect for this thread:</p>
<p><a href="http://xkcd.com/386/" rel="nofollow">http://xkcd.com/386/</a></p>
<p>awesome.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Erick Schonfeld</title>
		<link>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2001074</link>
		<dc:creator>Erick Schonfeld</dc:creator>
		<pubDate>Wed, 20 Feb 2008 22:58:46 +0000</pubDate>
		<guid>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2001074</guid>
		<description>Thank you Qian Wang for setting me straight. I've updated the post with a clarification.</description>
		<content:encoded><![CDATA[<p>Thank you Qian Wang for setting me straight. I&#8217;ve updated the post with a clarification.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Eric Baldeschwieler</title>
		<link>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2001066</link>
		<dc:creator>Eric Baldeschwieler</dc:creator>
		<pubDate>Wed, 20 Feb 2008 22:48:34 +0000</pubDate>
		<guid>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2001066</guid>
		<description>Hi Folks, thanks for the write-up.  A couple of comments:

Comparing a single job to all jobs run at google is not too informative.  Google clearly has a larger plant than we do and has undoubtedly run much larger jobs than ours, but we have been producing a comparably sized web search index for many years and the point of our announcement was that Hadoop can now support jobs of the scale needed to build Yahoo or Google scale services.  Doing some math on the Google numbers you quoted...

Their average job produces 14,000TB/2,217,000 jobs -&#62; 0.0063TB / job compared to our 300TB job.  So we can conclude that we have run a job 47,000 times larger than their average job!  Again, the point is that we can do full web work on Hadoop, not to compare our plant to theirs.

Also our team is very proud of our investment in the Hadoop platform.  It is our contributions to the Hadoop project that have made it possible to do full scale web search work on it.  We've been working towards this milestone for several years.

http://developer.yahoo.net/blog/archives/2007/07/yahoo-hadoop.html
http://radar.oreilly.com/archives/2007/08/yahoos-bet-on-h.html</description>
		<content:encoded><![CDATA[<p>Hi Folks, thanks for the write-up.  A couple of comments:</p>
<p>Comparing a single job to all jobs run at google is not too informative.  Google clearly has a larger plant than we do and has undoubtedly run much larger jobs than ours, but we have been producing a comparably sized web search index for many years and the point of our announcement was that Hadoop can now support jobs of the scale needed to build Yahoo or Google scale services.  Doing some math on the Google numbers you quoted&#8230;</p>
<p>Their average job produces 14,000TB/2,217,000 jobs -&gt; 0.0063TB / job compared to our 300TB job.  So we can conclude that we have run a job 47,000 times larger than their average job!  Again, the point is that we can do full web work on Hadoop, not to compare our plant to theirs.</p>
<p>Also our team is very proud of our investment in the Hadoop platform.  It is our contributions to the Hadoop project that have made it possible to do full scale web search work on it.  We&#8217;ve been working towards this milestone for several years.</p>
<p><a href="http://developer.yahoo.net/blog/archives/2007/07/yahoo-hadoop.html" rel="nofollow">http://developer.yahoo.net/blo.....adoop.html</a><br />
<a href="http://radar.oreilly.com/archives/2007/08/yahoos-bet-on-h.html" rel="nofollow">http://radar.oreilly.com/archi.....-on-h.html</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: ONLINE SERVICES/INTERACTIVE MEDIA &#171; Daily Marauder</title>
		<link>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2001053</link>
		<dc:creator>ONLINE SERVICES/INTERACTIVE MEDIA &#171; Daily Marauder</dc:creator>
		<pubDate>Wed, 20 Feb 2008 22:17:57 +0000</pubDate>
		<guid>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2001053</guid>
		<description>[...] and “reduces” them to a map of the Web so that ranking algorithms can be run against them. (http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop [...]</description>
		<content:encoded><![CDATA[<p>[&#8230;] and “reduces” them to a map of the Web so that ranking algorithms can be run against them. (http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop [&#8230;]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Colin Brumelle</title>
		<link>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2000996</link>
		<dc:creator>Colin Brumelle</dc:creator>
		<pubDate>Wed, 20 Feb 2008 21:02:57 +0000</pubDate>
		<guid>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2000996</guid>
		<description>For those interested, there is a well written explanation (in classic Joel style) of how "Map" and "Reduce" work here:
http://www.joelonsoftware.com/items/2006/08/01.html</description>
		<content:encoded><![CDATA[<p>For those interested, there is a well written explanation (in classic Joel style) of how &#8220;Map&#8221; and &#8220;Reduce&#8221; work here:<br />
<a href="http://www.joelonsoftware.com/items/2006/08/01.html" rel="nofollow">http://www.joelonsoftware.com/.....08/01.html</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Dyde</title>
		<link>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2000991</link>
		<dc:creator>Dyde</dc:creator>
		<pubDate>Wed, 20 Feb 2008 20:58:54 +0000</pubDate>
		<guid>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2000991</guid>
		<description>Yahoo can do one thing to gain on Google, get rid of that slow page design on the search pages. Get rid of Javascript, gazillion cute images. Show only 7 results and 3 more links on the bottom.  That will make it load faster than Google. And they should not index subdomains and biz and info sites. Most of these are spam. But of course, I'm probably not the first to suggest these improvements. I'm sure their engineers do suggest similar changes but these suggestions die in huge bureaucracy Yahoo has become.</description>
		<content:encoded><![CDATA[<p>Yahoo can do one thing to gain on Google, get rid of that slow page design on the search pages. Get rid of Javascript, gazillion cute images. Show only 7 results and 3 more links on the bottom.  That will make it load faster than Google. And they should not index subdomains and biz and info sites. Most of these are spam. But of course, I&#8217;m probably not the first to suggest these improvements. I&#8217;m sure their engineers do suggest similar changes but these suggestions die in huge bureaucracy Yahoo has become.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: TechBeta &#187; Blog Archive &#187; Yahoo Search Wants to Be More Like Google, Embraces Hadoop</title>
		<link>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2000964</link>
		<dc:creator>TechBeta &#187; Blog Archive &#187; Yahoo Search Wants to Be More Like Google, Embraces Hadoop</dc:creator>
		<pubDate>Wed, 20 Feb 2008 20:22:43 +0000</pubDate>
		<guid>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2000964</guid>
		<description>[...] more: techcrunch.com  addthis_url = [...]</description>
		<content:encoded><![CDATA[<p>[&#8230;] more: techcrunch.com  addthis_url = [&#8230;]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: till</title>
		<link>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2000962</link>
		<dc:creator>till</dc:creator>
		<pubDate>Wed, 20 Feb 2008 20:21:09 +0000</pubDate>
		<guid>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2000962</guid>
		<description>@Metabass: And apparently it takes even less to comment on here. 

@Erick: Still an interesting post. Thanks. ;) I don't get the fuss.</description>
		<content:encoded><![CDATA[<p>@Metabass: And apparently it takes even less to comment on here. </p>
<p>@Erick: Still an interesting post. Thanks. <img src='http://www.techcrunch.com/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' /> I don&#8217;t get the fuss.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Qian Wang</title>
		<link>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2000952</link>
		<dc:creator>Qian Wang</dc:creator>
		<pubDate>Wed, 20 Feb 2008 20:11:47 +0000</pubDate>
		<guid>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2000952</guid>
		<description>Erick, the confusion comes from your use of the words "map" and "reduce" in a different way than what they mean in MapReduce.  In MapReduce, "map" refers to mapping a function over a set of data.  This is an operation that can be quite easily parallelized, which is why Google is able to bring such a large array of processors to bear.   The "reduce" part takes all the results of the mapping step and recombines them into the end result (or set of results).

Your paraphrasing would be fine if you just didn't use the words "map" and "reduce".</description>
		<content:encoded><![CDATA[<p>Erick, the confusion comes from your use of the words &#8220;map&#8221; and &#8220;reduce&#8221; in a different way than what they mean in MapReduce.  In MapReduce, &#8220;map&#8221; refers to mapping a function over a set of data.  This is an operation that can be quite easily parallelized, which is why Google is able to bring such a large array of processors to bear.   The &#8220;reduce&#8221; part takes all the results of the mapping step and recombines them into the end result (or set of results).</p>
<p>Your paraphrasing would be fine if you just didn&#8217;t use the words &#8220;map&#8221; and &#8220;reduce&#8221;.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: some guy</title>
		<link>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2000938</link>
		<dc:creator>some guy</dc:creator>
		<pubDate>Wed, 20 Feb 2008 19:53:22 +0000</pubDate>
		<guid>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2000938</guid>
		<description>the map step breaks the problem down into many small chunks and sends them to individual boxes for computation.

each box then does computation on it's small chunk of data

the reduce step take the result from all the small chunks and combines them back into one big solution.

that's the rough explanation of what's going on.</description>
		<content:encoded><![CDATA[<p>the map step breaks the problem down into many small chunks and sends them to individual boxes for computation.</p>
<p>each box then does computation on it&#8217;s small chunk of data</p>
<p>the reduce step take the result from all the small chunks and combines them back into one big solution.</p>
<p>that&#8217;s the rough explanation of what&#8217;s going on.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: YDRIVE</title>
		<link>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2000935</link>
		<dc:creator>YDRIVE</dc:creator>
		<pubDate>Wed, 20 Feb 2008 19:52:09 +0000</pubDate>
		<guid>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2000935</guid>
		<description>@19 - 

1) http://feedblog.org/2008/01/06/mapreduce-simplified-data-processing-on-large-clusters/
&lt;b&gt;(and click on the pdf file)&lt;/b&gt;

2) http://wiki.apache.org/lucene-hadoop/HadoopMapReduce</description>
		<content:encoded><![CDATA[<p>@19 - </p>
<p>1) <a href="http://feedblog.org/2008/01/06/mapreduce-simplified-data-processing-on-large-clusters/" rel="nofollow">http://feedblog.org/2008/01/06.....-clusters/</a><br />
<b>(and click on the pdf file)</b></p>
<p>2) <a href="http://wiki.apache.org/lucene-hadoop/HadoopMapReduce" rel="nofollow">http://wiki.apache.org/lucene-.....pMapReduce</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Metabass</title>
		<link>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2000931</link>
		<dc:creator>Metabass</dc:creator>
		<pubDate>Wed, 20 Feb 2008 19:45:13 +0000</pubDate>
		<guid>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2000931</guid>
		<description>TechCrunch... It takes all the technologies on the Web 2.0 found by Erick and "crunches" them down into inaccurate blog posts.</description>
		<content:encoded><![CDATA[<p>TechCrunch&#8230; It takes all the technologies on the Web 2.0 found by Erick and &#8220;crunches&#8221; them down into inaccurate blog posts.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mike</title>
		<link>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2000930</link>
		<dc:creator>Mike</dc:creator>
		<pubDate>Wed, 20 Feb 2008 19:43:33 +0000</pubDate>
		<guid>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2000930</guid>
		<description>In other news... Yahoo's YUI 2.5 is released.

http://yuiblog.com/blog/2008/02/20/yui-250-released/</description>
		<content:encoded><![CDATA[<p>In other news&#8230; Yahoo&#8217;s YUI 2.5 is released.</p>
<p><a href="http://yuiblog.com/blog/2008/02/20/yui-250-released/" rel="nofollow">http://yuiblog.com/blog/2008/0.....-released/</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Erick Schonfeld</title>
		<link>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2000918</link>
		<dc:creator>Erick Schonfeld</dc:creator>
		<pubDate>Wed, 20 Feb 2008 19:37:13 +0000</pubDate>
		<guid>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2000918</guid>
		<description>@3 and @6, I clearly state in the post that Hadoop "also works for large-scale computing problems beyond search"

Also the "It" in that sentence you both cite refers to Hadoop, not MapReduce, and is a paraphrase of Yahoo's own description of what Hadoop does:

http://developer.yahoo.com/blogs/hadoop/2008/02/yahoo-worlds-largest-production-hadoop.html

"The Yahoo! Search Webmap is a Hadoop application that runs on a more than 10,000 core Linux cluster and produces data that is now used in every Yahoo! Web search query.

"The Webmap build starts with every Web page crawled by Yahoo! and produces a database of all known Web pages and sites on the internet and a vast array of data about every page and site. This derived data feeds the Machine Learned Ranking algorithms at the heart of Yahoo! Search."

I read that to mean that Hadoop creates a map of the Web and reduces (i.e. compresses) it into a manageable set of data that can be placed into a database so that the ranking algorithms can do their work.

If this is wrong, someone please explain (and, no, that does not mean linking to Wikipedia).  Explain it in English.</description>
		<content:encoded><![CDATA[<p>@3 and @6, I clearly state in the post that Hadoop &#8220;also works for large-scale computing problems beyond search&#8221;</p>
<p>Also the &#8220;It&#8221; in that sentence you both cite refers to Hadoop, not MapReduce, and is a paraphrase of Yahoo&#8217;s own description of what Hadoop does:</p>
<p><a href="http://developer.yahoo.com/blogs/hadoop/2008/02/yahoo-worlds-largest-production-hadoop.html" rel="nofollow">http://developer.yahoo.com/blo.....adoop.html</a></p>
<p>&#8220;The Yahoo! Search Webmap is a Hadoop application that runs on a more than 10,000 core Linux cluster and produces data that is now used in every Yahoo! Web search query.</p>
<p>&#8220;The Webmap build starts with every Web page crawled by Yahoo! and produces a database of all known Web pages and sites on the internet and a vast array of data about every page and site. This derived data feeds the Machine Learned Ranking algorithms at the heart of Yahoo! Search.&#8221;</p>
<p>I read that to mean that Hadoop creates a map of the Web and reduces (i.e. compresses) it into a manageable set of data that can be placed into a database so that the ranking algorithms can do their work.</p>
<p>If this is wrong, someone please explain (and, no, that does not mean linking to Wikipedia).  Explain it in English.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Owen O'Malley</title>
		<link>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2000905</link>
		<dc:creator>Owen O'Malley</dc:creator>
		<pubDate>Wed, 20 Feb 2008 19:27:10 +0000</pubDate>
		<guid>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2000905</guid>
		<description>It really runs the other way. Yahoo needed to update their infrastructure and decided to invest in an open source infrastructure rather than a proprietary one. Yahoo now has the advantage that undergraduates at schools like UCB and UW are being taught to program using Hadoop.

The Yahoo webmap is one Hadoop application, but it is far from the only one at Yahoo. One of the side benefits of having a good distributed computing infrastructure is that it becomes much easier to write ad-hoc programs that run on a large cluster of machines. There are more than 40,000 cores running Hadoop at Yahoo.

Hadoop was the name of Doug Cutting's *son's* stuffed elephant.</description>
		<content:encoded><![CDATA[<p>It really runs the other way. Yahoo needed to update their infrastructure and decided to invest in an open source infrastructure rather than a proprietary one. Yahoo now has the advantage that undergraduates at schools like UCB and UW are being taught to program using Hadoop.</p>
<p>The Yahoo webmap is one Hadoop application, but it is far from the only one at Yahoo. One of the side benefits of having a good distributed computing infrastructure is that it becomes much easier to write ad-hoc programs that run on a large cluster of machines. There are more than 40,000 cores running Hadoop at Yahoo.</p>
<p>Hadoop was the name of Doug Cutting&#8217;s *son&#8217;s* stuffed elephant.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: YDRIVE</title>
		<link>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2000891</link>
		<dc:creator>YDRIVE</dc:creator>
		<pubDate>Wed, 20 Feb 2008 19:11:39 +0000</pubDate>
		<guid>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2000891</guid>
		<description>Yahoo! does still have lots of bright engineers.. only a few noisy guys at the top make themselves look stupid.</description>
		<content:encoded><![CDATA[<p>Yahoo! does still have lots of bright engineers.. only a few noisy guys at the top make themselves look stupid.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: WebSide Ventures</title>
		<link>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2000886</link>
		<dc:creator>WebSide Ventures</dc:creator>
		<pubDate>Wed, 20 Feb 2008 19:04:50 +0000</pubDate>
		<guid>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2000886</guid>
		<description>Hey, I totally bought the MapReduce quote...looks like you can't slip much past this crowd though.</description>
		<content:encoded><![CDATA[<p>Hey, I totally bought the MapReduce quote&#8230;looks like you can&#8217;t slip much past this crowd though.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Nick</title>
		<link>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2000880</link>
		<dc:creator>Nick</dc:creator>
		<pubDate>Wed, 20 Feb 2008 18:58:14 +0000</pubDate>
		<guid>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2000880</guid>
		<description>Well, this is not like an impulse idea from Yahoo!. It was obvious this day would come already when they hired Doug Cutting over a year ago. Yahoo!, with Doug leading the way, have since been the single most proponents and developers of Hadoop.</description>
		<content:encoded><![CDATA[<p>Well, this is not like an impulse idea from Yahoo!. It was obvious this day would come already when they hired Doug Cutting over a year ago. Yahoo!, with Doug leading the way, have since been the single most proponents and developers of Hadoop.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Owen</title>
		<link>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2000877</link>
		<dc:creator>Owen</dc:creator>
		<pubDate>Wed, 20 Feb 2008 18:54:52 +0000</pubDate>
		<guid>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2000877</guid>
		<description>Interesting that it is more efficient - should increase overall search throughput - shouldn't affect quality of results though. Not that that matters much. As far as I (a sample of one) can tell, Google's constant tinkering with search algorithms has led to worse and worse results over the past two years. Yahoo's results have been better for well over a year now. It may not be Google's fault entirely. There is so much attempted manipulation of its results that playing defense against that may be hurting the overall quality.</description>
		<content:encoded><![CDATA[<p>Interesting that it is more efficient - should increase overall search throughput - shouldn&#8217;t affect quality of results though. Not that that matters much. As far as I (a sample of one) can tell, Google&#8217;s constant tinkering with search algorithms has led to worse and worse results over the past two years. Yahoo&#8217;s results have been better for well over a year now. It may not be Google&#8217;s fault entirely. There is so much attempted manipulation of its results that playing defense against that may be hurting the overall quality.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: RJacobsen</title>
		<link>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2000870</link>
		<dc:creator>RJacobsen</dc:creator>
		<pubDate>Wed, 20 Feb 2008 18:43:50 +0000</pubDate>
		<guid>http://www.techcrunch.com/2008/02/20/yahoo-search-wants-to-be-more-like-google-embraces-hadoop/#comment-2000870</guid>
		<description>This is great news.. as just this morning I was wondering how come Google is referring 10 times the amount of traffic to my site than Yahoo is. I hope that this new algorithm will help my targeted audience to be able to locate my site easier, when entering certain keywords pertaining to my site.</description>
		<content:encoded><![CDATA[<p>This is great news.. as just this morning I was wondering how come Google is referring 10 times the amount of traffic to my site than Yahoo is. I hope that this new algorithm will help my targeted audience to be able to locate my site easier, when entering certain keywords pertaining to my site.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

<!-- Dynamic Page Served (once) in 1.428 seconds -->
