Yahoo BOSS Might Be Bigger Than Bing
by Erick Schonfeld on August 18, 2009

One of the least appreciated, but smartest, moves Yahoo has made in the past year is to launch Yahoo BOSS, its open search APIs which lets developers create their own custom search engine using Yahoo’s algorithms. We use it to power search across the TechCrunch network. And we are not alone. It has become immensely popular.

By last May, Yahoo BOSS was serving up 30 million search queries a day. You can see the rapid growth in search volume in the chart above, which comes from a Technology Review profile of Vik Singh, the 24-year-old engineer who was one of the main champions behind Yahoo BOSS. (He is one of Technology Review’s 2009 Innovators Under 35)

At 30 million queries a day, that comes out to about 900 million queries a month, which would make Yahoo BOSS the fourth largest search engine in the U.S. with about a 6 percent share. That is just below the 9 percent share (and 1.2 billion queries a month) comScore estimates for Bing.

But that is based on data from April. The chart is missing the last three months because Yahoo won’t update the numbers (I asked). Depending on the growth rate of Yahoo BOSS search queries since then, BOSS should now either be at par with Bing or even slightly larger. Between February, 2009 and April, 2009, search queries grew 50 percent (from 20 million a day to 30 million). Here are three different growth scenarios for the period May through July and the corresponding search volume numbers they would imply:

    Growth Scenarios
  1. 50 percent growth = 45 million queries a day = 1.35 billion queries a month
  2. 40 percent growth = 42 million queries a day = 1.26 billion queries a month
  3. 30 percent growth = 39 million queries a day = 1.17 billion queries a month

Remember, Bing was at an estimated 1.21 billion queries a month for the month of July. So BOSS might very well be as big as Bing. In fact, a couple months before the Bing-Yahoo deal, I kept hearing from people connected to Yahoo that BOSS on its own was bigger than Microsoft search. Now Microsoft owns BOSS as part of its deal to take over Yahoo’s search operations.

On the day the deal was announced, Microsoft SVP Yusuf Mehdi told me that he wants to keep BOSS alive because “there is a lot of goodness there.” I’ll say. It adds about another six points to Bing’s overall search volume market share. (Click the market share table below to enlarge).

Update: After some reflection (and sleep), I think it is less likely that BOSS is bigger than Bing. First, as has been noted in comments, the comScore numbers are only for the U.S., while the BOSS numbers are worldwide. So best case scenario (assuming the U.S. accounts for only half of Bing searches, which is the industry norm), BOSS is really only half the size of Bing proper. That is still quite significant.

But there is something else to consider as well. Bing also has an API, which back in March served 3 billion queries. Microsoft tells me that number has increased by more than 50 percent, so Bing’s own unmetered APIs are responsible for many more searches than BOSS, and are also not counted in the comScore data. All of this underlines the larger point here, which is that opening up search to outside developers is a powerful way to gain search query share.

july-search-volume-tbale

Advertisement

Responses

Comments rss icon

  • Big difference here, which is always an issue with Techcrunch is you’re using Yahoo’s internal numbers and comparing them to comScore’s, which every engine (and company, including you guys) would say is pretty faulty reasoning.

    Not saying BOSS isn’t huge, it is, and is doing *fantastically well*… and not even saying it’s not as big as Bing (which I’m a fan of), just saying you’re drawing conclusions that the data doesn’t support since we don’t know the correlation between internal queries and comScore recorded queries.

    This also begs an even more fundamental question though: if all these queries are happening *via* Yahoo, why aren’t they being counted *somewhere* by comScore?

  • Funny that the “Climbing the Charts” source is…..Yahoo.

  • Erick, you have a short memory. Don’t you remember when Carol said Yahoo was never a search company?

  • I’m not aware of Google having anything similar to BOSS; so sites use custom search instead (using url: in the query, or whatever).

    Thus, are the figures for Google’s queries/month inclusive of these searches?

    Obviously Yahoo won’t be equal to Google even if you sum it with BOSS but I guess it narrows the gap?

  • If I’m not mistaken, beyond the internal/external difference, the BOSS numbers are world-wide, while the Bing numbers are US-only, no? This seems like a significant effect, not everyone is in the US, remember.

  • Dear Microsoft,

    BOSS is one of best API’s on the internet for developers, and a significant competitive advantage you have over Google. Please don’t throw it out. Thanks.

  • Boss business model?

  • maybe one of these days I’ll understand what all the talk about search is? I get that its important, to find what you’re looking for, but its also important to know what you are looking for.

  • If Yahoo wants to remain relevant, long term…
    1) Ditch microsoft
    2) Ditch paid inclusion (what idiot thinks it makes sense? Microsoft won’t keep the program alive, either…Qiu Lu, I’d bet money _hates_ it…MS removed it as soon as they could, even while they used YST results).
    3) Add revenue share (a decent adsense competitor to BOSS & the Yahoo network)
    4) *lower* your ridiculous 10 cent per click cost…
    5) Start to see marketshare expand, again.

    To start with, Carol isn’t prioritizing or doing any of these things…*fire* her, and then move on with the action plan to make Yahoo a great company…OR watch as she kicks Y! where it hurts and they see their market share, position and scale crumble till there is next to nothing left (remember Iwon.com? Excite? Lycos? Yahoo’s next if they continue the way they are going…)

  • One of the things that seems to be getting lost in this discussion, is the reason BOSS is so great, namely, Yahoo’s license is extremely open and flexible, much more so than Bing or Google. People mistakenly think its the inconvenience of switching to another API format that is the issue for start ups, or the loss of particular functionality. The former is a few days work, the latter is minimal as most functionality is in alternative APIs. The real problem lies in Bing or Google’s API licenses. For example, Bing’s license is oriented around getting users to make new UIs for their search; this totally eliminates myriad back end uses.

  • Not exactly the same as BOSS, but Google offers Custom Search Engine at http://www.google.com/cse. Google is free for the foreseeable future, while the BOSS website says they will be pricing the service soon.

    • Google custom engine is free but not flexible. As Deep Dhillon mentioned above, it is about licensing. BOSS API licensing is pretty flexible. You need not arrange the results in the same order as in the output of the API. Google Custom Engine does not provide that flexibility.

      • yeah, google is just trying to keep *their* market growing with free search. you can’t make any money with google custom search, whereas BOSS is (Build your Own Search Service). You can build your own search engine on BOSS and monetize it. You can’t do that with Google’s API.

      • I just tried out google custom search. Yuck, free version has 3 ads in main column, that too at start. My charge on goog always been against main column ads with “sponsored link” written in far corner, that novice users click thinking as results.

  • The open question is monetization. Give something a comparable product away for free when your competitors charge for it and you are certainly going to grab some share.

    Yet, lets be clear without a monetization solution this is unsustainable. Until BOSS can be monetized it was pricing experiment (does free beat paid?). In talking to the guys who are really running BOSS day-in and day-out they are correctly trying to transition this into a business. With the scale of AdCenter+Panama (30% market share) they just may have the monetization capability to make this work. I hope they do.

  • Nice Information.Good News for all users.

  • Erick – check out Sphinx Search (http://www.sphinxsearch.com) It’s an amazing open source search engine powering such sites as craigslist.org … Sphinx website claims craigslist alone is making 50 million queries a day. Add to that all the other Sphinx users (many of which are not publicly announcing it).

    • Jani, Sphinx is just a search engine, same as Lucene. The scale of the search engine is not as much as the web search engine. Sphinx, Lucene, Solr are good for small set of data (say 10s of millions of rows), but not good @ web search scale (say 10s of billions of documents)

      • Yogish – Sphinx can handle billions of rows with no problem. Of course it is just the backend engine, you need to seed it with data, but it is not true to say it only works for small data sets.

        • does anyone know of an open index of the web, like BOSS? the data is the valuable part, not the engine.

        • When I use BOSS I do not have to worry about “spidering” the web…with Sphinx and Lucene I do…

          BOSS allows you to build your own search engine with NO YAHOO branding, and without the overhead of “spidering” or even maintaining the index…as far as I am aware there is no comparable offering on web.

  • Someone please expain WHY BOSS number ARE NOT included in the monthly Comscore numbers for Yahoo… I still haven’t read a single explanation on this…

    Vetinary

  • Yahoo! BOSS might be the best web service in existence today.

  • So what is the flexibility of the Bing API like in comparison to BOSS? I haven’t checked it out yet. Does it let you re-order results? I do remember reading that it was going to be free for now but eventually there would be some charge to use it based on no. of queries.

    • Steven,

      The problem with the Bing Api is that its TOU doesn’t allow monetization, except through their own ads. But they don’t easily accept you into the revenue-sharing program…

  • One thing everybody forgets is that its a search API – that is most of the queries can be automated by computer programs / bots / various tools for SEO / datamining etc. This can inflate the figures by a huge amount – I know there might be policies in a way to prevent that but from experience I know its not difficult to overcome these.

  • The question whether Yahoo Boss remains as a part of MS-Y! deal is still open. What happens to the developers who’ve built their apps on Boss and MS decides to discontinue it. Will Bing API be the only alternative left?

    • Well, the question whether MS-Y will ever happen is also still open. There are some heavy anti-trust issues here. As a side note, maybe openly stating that they would leave BOSS alive would help them strenghten their claim that competition would not suffer…

  • Indeed, Janusz is right. Last time I was presented by YahooBoss some of the projects that were relying on it, I would say that about half of them were services wehere the queries were processed in th background as a job (think stuff like matching queries results with domain name expiring to assess value…).
    Nonetheless, always got pretty impressed by YahooBOSS, as much from a tech perspective as from the approach they had to it as team: Very open minded, always willing to help and to see how they could contribute to our projects with YahooBoss.

  • too many of this things will leave the users confused, however the new development is welcome

  • The people using Bing and Yahoo are probably almost all computer illiterates who don’t know how to change their homepages. I doubt anyone can overcome Google’s quality, even with people working on BOSS.

  • This is interesting, if a bit unclear. Can someone detail a project that uses BOSS as underpinning? What were they using before BOSS came along, or was the project not possible pre-BOSS?

  • The note about Yahoo’s provided figures for BOSS probably being worldwide and the comScore figures being US only are important.

    I’ve also asked Yahoo in the past, at least twice, for a breakdown of who exactly is generating what searches on BOSS. IE — is OneRiot the big BOSS customer? Or someone else?

    Got nada. Until we get decent figures about what a BOSS “query” actually is and where it’s at, I’m not that excited. And when we get those, be interesting to see how they measure up against Google custom search queries, which comScore probably doesn’t count, either.

  • I love how little attention is paid to what kind of queries these are. Apples and Oranges, people.

  • …but it still is beyond me why the company that made VISTA goes out and try to market themselves as making decisions for people…decision engine my ass….

  • Here’s a good example of the BOSS api in use.

    http://www.linguos.com

    Linguos handles all the multilingual algorithms and uses BOSS to perform the actual search.
    Linguos enables transcription based search of non-English, non-roman languages, enabling access to the non-English web.
    Private labeling allows results to be blended into the site much better than Google or Bing APIs.

Leave Comment

Commenting Options

Enter your personal information to the left, or sign in with your Facebook account by clicking the button below.

Alternatively, you can create an avatar that will appear whenever you leave a comment on a Gravatar-enabled blog.

Trackback URL
Short URL
bugbugbug
Techcrunch on Facebook