<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>TechCrunch &#187; lingoz</title>
	<atom:link href="http://www.techcrunch.com/tag/lingoz/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.techcrunch.com</link>
	<description>Startup and Technology News</description>
	<lastBuildDate>Fri, 27 Nov 2009 02:17:31 +0000</lastBuildDate>
	
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<cloud domain='www.techcrunch.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
		<item>
		<title>TheRarestWords: Intriguing Semantic SEO Project from Russia</title>
		<link>http://www.techcrunch.com/2008/05/23/therarestwords-intriguing-semantic-seo-project-from-russia/</link>
		<comments>http://www.techcrunch.com/2008/05/23/therarestwords-intriguing-semantic-seo-project-from-russia/#comments</comments>
		<pubDate>Fri, 23 May 2008 17:41:39 +0000</pubDate>
		<dc:creator>Erick Schonfeld</dc:creator>
				<category><![CDATA[Web 2.0 News & Ideas]]></category>
		<category><![CDATA[lingoz]]></category>
		<category><![CDATA[Therarestwords]]></category>
		<category><![CDATA[Wiktionary]]></category>

		<guid isPermaLink="false">http://www.techcrunch.com/2008/05/23/therarestwords-intriguing-semantic-seo-project-from-russia/</guid>
		<description><![CDATA[
A mysterious yet intriguing project from Russia has come across our inbox.  It is a search-engine optimization analysis tool for Websites called TheRarestWords.  For any given URL, like Microsoft&#8217;s or Techcrunch&#8217;s, it shows you the rarest keywords on the homepage (i.e., the ones most likely to give your site some search-engine juice), other [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://therarestwords.com/"><img class="shot2" src='http://cache0.techcrunch.com/wp-content/rarest-words-logo.png' alt='rarest-words-logo.png' /></a></p>
<p>A mysterious yet intriguing project from Russia has come across our inbox.  It is a search-engine optimization analysis tool for Websites called <a href="http://therarestwords.com/">TheRarestWords</a>.  For any given URL, like <a href="going to ">Microsoft&#8217;s</a> or <a href="http://therarestwords.com/techcrunch.com">Techcrunch&#8217;s</a>, it shows you the rarest keywords on the homepage (i.e., the ones most likely to give your site some search-engine juice), other sites with related keywords, and a list of categories the site would fit under based on those keywords.  For Microsoft, some the rare keywords it identifies are &#8220;silverlight,&#8221; &#8220;biztalk,&#8221; &#8220;onecare,&#8221; &#8220;skydrive, &#8220;popfly,&#8221; &#8220;ballmer,&#8221; and &#8220;ozzie.&#8221;  You can try your site by going to http://therarestwords.com/YOURSITE.com.</p>
<p><img class="shot" src='http://cache0.techcrunch.com/wp-content/rarestwords-1.png' alt='rarestwords-1.png' />TheRarestWords then tries to tap into crowd intelligence by letting anyone add a 100-character definition for each keyword, which could give it a semantic edge in trying to categorize each site.  This could also be gamed pretty easily, but this looks to be just a Web project at this point.  It could also be used to create a Wiki dictionary like <a href="http://wiktionary.org/">Lingoz</a> or <a href="http://wiktionary.org/">Wiktionary</a>, but that does not seem to be the focus of the project. </p>
<p>The developer is a mysterious Russian who does not want to give out his name.  You can find more info on his blog and on this <a href="http://forums.site-reference.com/topic/54804/#p54804">forum post</a>.  Mircea Goia from <a href="http://www.mytestbox.com">MyTestBox</a> dug into it for us and reports:</p>
<blockquote><p><em>The author and the sole founder – who is from Russia and wants to have a low profile for now &#8211; says it is just a hobby that was started in December 2007 and he calls it a “linguistic experiment”. </p>
<p>Their spider (called TheRarestParser/0.2a) started scouting the internet in May and extracted words from many websites. It looked at which one are used most often on those websites and which ones are rarely used, or not at all.  For now it extracts only the words from the first page of a domain. It doesn’t go deeper than that, however the spider managed to index 20 million words from many domains. </p>
<p>The author wants to implement new options like:</p>
<p>    * Trend spotting (which of the words are gaining popularity &#8211; like “django” is becoming more popular, “python” is still strong, and which are losing it like “perl”)<br />
    * Help with SEO for mom-and-dad kinds of business sites (it could be useful from this stand point, the author says)<br />
    * Auto-categorization of your sites against a big list of categories (actually, at this time it has already been implemented, but the algorithm still needs to be perfected)</em></p></blockquote>
<p>The interface is confusing the first time you go there, but there is some interesting data you can pull from it.  For instance, you can have an <a href="http://rarestblog.com/2008/05/new-feature-fight-your-site/">SEO fight</a> between any two sites by typing in the address: http://therarestwords.com/vs/your-site.com/competitors-site.com.  This feature shows which rare words your site has that your competitor doesn&#8217;t and vice versa.  </p>
<p>For example, here&#8217;s <a href="http://therarestwords.com/vs/techcrunch.com/gigaom.com">TechCrunch Vs. GigaOm</a>.  This is only a snapshot of what is on each frontpage, but we are more likely to get search traffic right now for terms like &#8220;friendfeed,&#8221; &#8220;gamestop,&#8221; and &#8220;blogosphere.&#8221;  While they are kicking our butts on &#8220;qualcomm,&#8221; &#8220;powerset,&#8221; and &#8220;sarcasm.&#8221;  (At least that was the case before I put up this post.  I really can&#8217;t let Om beat us on sarcasm).  </p>
<p><a href='http://www.techcrunch.com/wp-content/rarestwords-tc-vs-gigaom-big.png' title='rarestwords-tc-vs-gigaom-big.png'><img src='http://cache0.techcrunch.com/wp-content/rarest-words-tc-vs-gigaom.png' alt='rarest-words-tc-vs-gigaom.png' /></a></p>
<p><a href='http://www.techcrunch.com/wp-content/rarestwords-msft.png' title='rarestwords-msft.png'><img src='http://cache0.techcrunch.com/wp-content/rarestwords-msft-small.png' alt='rarestwords-msft-small.png' /></a></p>
<div class="cbw snap_nopreview">
<div class="cbw_header"><script src="http://www.crunchbase.com/javascripts/widget.js" type="text/javascript"></script>
<div class="cbw_header_text"><a href="http://www.crunchbase.com/">CrunchBase Information</a></div>
</div>
<div class="cbw_content">
<div class="cbw_subheader"><a href="http://www.crunchbase.com/company/lingoz">LingoZ</a></div>
<div class="cbw_subcontent"><script src="http://www.crunchbase.com/cbw/company/lingoz.js" type="text/javascript"></script></div>
<div class="cbw_subheader"><a href="http://www.crunchbase.com/product/wiktionary">Wiktionary</a></div>
<div class="cbw_subcontent"><script src="http://www.crunchbase.com/cbw/product/wiktionary.js" type="text/javascript"></script></div>
<div class="cbw_footer">Information provided by <a href="http://www.crunchbase.com/">CrunchBase</a></div>
</div>
</div>
<p><strong><em>Crunch Network</em></strong>:  <a href="http://www.mobilecrunch.com/">MobileCrunch</a><em> </em>Mobile Gadgets and Applications, Delivered Daily.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.techcrunch.com/2008/05/23/therarestwords-intriguing-semantic-seo-project-from-russia/feed/</wfw:commentRss>
		<slash:comments>21</slash:comments>
		</item>
		<item>
		<title>Lingoz: Wiktionary Done Right?</title>
		<link>http://www.techcrunch.com/2007/10/02/lingoz-wiktionary-done-right/</link>
		<comments>http://www.techcrunch.com/2007/10/02/lingoz-wiktionary-done-right/#comments</comments>
		<pubDate>Tue, 02 Oct 2007 19:10:00 +0000</pubDate>
		<dc:creator>Roi Carthy</dc:creator>
				<category><![CDATA[Company & Product Profiles]]></category>
		<category><![CDATA[Web 2.0 News & Ideas]]></category>
		<category><![CDATA[lingoz]]></category>
		<category><![CDATA[Wikipedia]]></category>

		<guid isPermaLink="false">http://www.techcrunch.com/2007/10/02/lingoz-wiktionary-done-right/</guid>
		<description><![CDATA[Can a user-defined dictionary be done better than Wikipedia&#8217;s Wiktionary? Babylon, a maker of popular for-pay translation/dictionary desktop software, certainly thinks so, and they are launching Lingoz to prove it.
Lingoz is a collaborative, online dictionary where users are encouraged to participate by contributing terms and definitions, as well as by voting, commenting and aggregating words [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://lingoz.com"><img src="http://cache0.techcrunch.com/wp-content/lingoz_logo.png" class="shot" style="float: left" alt="lingoz_logo.png" /></a>Can a user-defined dictionary be done better than Wikipedia&#8217;s <a href="http://wiktionary.com">Wiktionary</a>? Babylon, a maker of popular for-pay translation/dictionary desktop software, certainly thinks so, and they are launching <a href="http://www.crunchbase.com/company/lingoz">Lingoz</a> to prove it.</p>
<p>Lingoz is a collaborative, online dictionary where users are encouraged to participate by contributing terms and definitions, as well as by voting, commenting and aggregating words into helpful glossaries.</p>
<p>Considered a modest Israeli success story, <a href="http://www.babylon.com/">Babylon</a> has been around since 1997 and has sold 1.6 million licenses in over 160 countries. As the company&#8217;s first pure Web play, Lingoz is being kicked-off with a substantial base of 4.5M terms in 8 languages, leveraging the vast 9M definition database Babylon has amassed over its 10 years of operation.  An additional 42 languages will be rolled-out in the coming months.</p>
<p>Back to Wiktionary for a moment.  The editorial back-and-forth process that works so well for encyclopedic entries on Wikipedia seems less successful when applied to defining dictionary terms, a process more suited towards voting on multiple versions of a definition.</p>
<p>Cognizant of Wiktionary&#8217;s shortcomings, Lingoz is being launched with a sensible set of social/UGC features: Terms can be submitted or requested. Voting on content quality is performed with a simple thumbs-up/down. Users can also define brand-new glossaries themselves, or request ones to be created. Glossaries may prove quite sticky as there are virtually an infinite number of potential themes that can be built out (think Web 2.0 terms, 60&#8217;s Hollywood actresses, etc—although a good starting point might be an actual definition for Web 2.0, which does not yet exist on the site.</p>
<p>The main competition Lingoz faces is from <a href="http://www.answers.com/">Answers.com</a>—ironically, another Israeli company. Answers.com doesn&#8217;t embrace UGC yet.  If Lingoz can become the Wikipedia of online dictionaries, perhaps one day it will give Answers.com a run for its money.  That would especially be true if Lingoz could attract substantial Google traffic.  As Google&#8217;s default &#8220;definition&#8221; provider, Answers.com is especially vulnerable to any changes in referrals from Google.  (For instance, a recent Google search algorithm tweak reduced their traffic by 28%). How do you define <a href="http://www.lingoz.com/en/dictionary/opportunity">opportunity</a>?
<p><strong><em>Crunch Network</em></strong>:  <a href="http://www.crunchbase.com">CrunchBase</a><em> </em>the free database of technology companies, people, and investors</p>
]]></content:encoded>
			<wfw:commentRss>http://www.techcrunch.com/2007/10/02/lingoz-wiktionary-done-right/feed/</wfw:commentRss>
		<slash:comments>23</slash:comments>
		</item>
	</channel>
</rss>
