Image Recognition Problem Finally Solved: Let’s Pay People To Tag Photos
by Michael Arrington on March 29, 2008

Most people have thousands of digital photos sitting on their hard drive. And the vast majority of those photos aren’t tagged or searchable. Want to find the 300 pictures of your youngest son amongst 10,000 others? It’s not going to happen. Unless you’ve been diligently tagging and categorizing those photos over the years, and who does that?

The problem is obvious. The solution, not so much. A trail of failed startups have tried to tackle the problem with a fairly serious application of technology, including: Riya (now focused on ecommerce via Like.com), Ookles (never launched), and Polar Rose (in private beta for nearly a year), among others.

And now suddenly TagCow appears, which allows users to upload photos and have them tagged within a few minutes. The technology appears to be “magic,” meaning there’s no explanation of it.

If there’s a mountain in the photo, it’s tagged. A dog? yep. A yellow cup? Absolutely. It does people, too. Upload an image of a person and say who it is, and all other images you upload will be tagged with that person, too. The service also integrates with Flickr and will auto tag the photos you have on the service.

Thomas Hawk, the CEO of photo site Zooomr, tried the service and declared it “really, really cool,” although he wonders how it works.

The answer is, humans do it. I note that the TagCow site is careful not to say anything about the tagging process, and never use the word “automated” or anything else that would suggests computers are doing the work. Munjal Shah, the founder of Riya/Like, agreed, noting that it recognized a witch in Thomas’ photo - he says this just isn’t something a computer can do today.

I haven’t confirmed this yet. I’ve emailed the company for a description of how the service works but have yet to hear back. Until we do, I’m betting that humans are the taggers. Note that Google has effectively thrown in the towel and uses humans for this kind of work, too.

TagCow appears to be offering the service for free, so the cost side of the business may be a problem for them down the road. And the business is definitely a little sketchy. Worried about the privacy of your data? Just don’t click on their Privacy Policy or Terms of Use: “Privacy policy is TBD.” and “Legal stuff TBD.” Not exactly a way to build confidence.

Responses (Trackback URL)

Comments

Comments Pages: [1] 2 » Show All

 

Probably not automated. They probably have a couple people doing it, but they will probably eventually crack under load.

Interested to hear what they say, if anything.

 

I can imagine this happening… but because of their “Privacy Policy” and “Terms of Service” … nope. Lack of business model, too. Hmm…

 

I saw Hawk’s mention of it on FriendFeed and went and had a look. Much like you I to found the whole thing rather suspicious. And the TBO for privacy .. should be a warning flag .. run .. run very fast in the opposite direction.

 

Sweet, Jeffery Bennett’s “Image Search for the Blind” from SxSW finally got funding

 
 

Sounds like they are using Amazon’s Mechanical Turk

 

Besides the obvious interest, who cares whether it’s automated or done by humans? If it is done by people, then it’s basically the mechanical turk, a business model that has had a modicum of success elsewhere.

For the average snapper, tagging is something you have to do yourself. If you get somebody/something else to tag your photos how will it know when/where/why a photo was taken, or who is in the photo? It might recognise famous faces and places, but what about a generic flower, or my baby nephew?

Professional photographers might pay since more tags leads to more leads. I have no idea whether photographers currently do this themselves for sites like iStockPhoto, but if not, there is a potential business model in there.

 

Michael: I work in the image recognition and processing field (have been for a decade) and your answer is correct: the tagging is definitely not an automated process; it is manual. Total BS given the tag line “automatically tag thousands of photos”; perhaps “automatically” to them means “tagged by somebody else while you are not looking”.

 

Blowski - it’s only an important distinction if you see a difference between, say, a secretary taking dictation v. a word processor on a computer.

 

what I don’t get is how they can tell who ‘dad’ or ‘emma’ are in the pictures I’d upload for tagging. For generic stuff (car, mountain, people, sunset, etc) this seems pretty straightforward, and yeah, they’re likely using mechanical turk or something like it.

 

The *automagical* tagging technology ‘behind’ the http://www.Tagcow.com site is definitely not clear. Ok, so you think “humans do it.” Now, I am picturing this… there’s like this room of terribly sweaty, overworked ‘interns’ cranking out tags to match photos?! are you serious?

If that’s the case, then wow, that’s definitely a ridiculous business plan; not viable.

Don’t even get me started on the privacy policy “TBD” bit.

 

“tagged by somebody else while you are not looking” -#9

That’s probably exactly why they don’t have a privacy policy up yet.

 

@Michael: I would have compared it to a secretary taking dictation and voice-recognition software. You go with whichever gives you the best letter.

 

This is interesting. The complexities of solving this problem from a technical standpoint are computationally difficult to say the least.

I think the better question here is, “Who Cares”? If its done by a human, its going to be more accurate….

So if it works, and they’ll tag your photos for free, and it’s accurate — who cares?

 

This business model was crushed by Luis von Ahn years ago.

 

“I’ve emailed the company for a description of how the service…”

Looks like they want to keep it a secret. If the service is a success you will see numerous oDesk job posting for “image taggers”

 

Personally I’m a little uncomfortable with humans looking at all my photos.

Imagine if Gmail ads were added by humans reading your emails.

 

Why can’t it be a combination?
80% automatic 20% human validation. The better it gets the more automatic it gets. Unlike riya it’s a progresive approach that learns from training.

 

Its not the business model that is the problem. Its viable as they can get it done at low cost destinations and the work can be done with little or no educational qualifications. Its the privacy issues that come to the fore.

 
 

watch like they want to keep it a secret. If the service is a success you will see numerous

 

It sounds like they’re ripping off Luis Von Ahn’s research:

http://www.cs.cmu.edu/~biglou/

 
 

If there is a human factor to the matching process, you should have picked a better topic :) “Hey we got this figured out! Not!” could have worked.

Also what are the privacy implications of this type of service?

 

Sounds like a good idea in theory, but other than the obvious question marks around revenue model, operation, and everything else, there’s one more teensy weensy issue. How many people tag their photos by ‘mountain’, ‘yellow’ cup, ie. random description? I rarely tag my photos due to sheer laziness, but am I the only crazy person who does it by name of the people in the picture, event, date, location, ie. relevant details? How can any system, automated or monkey-operated do that?

 

Umm, this wouldn’t have anything to do with a certain date fast approaching would it?

You know, the date that makes people look like fools ;-)

 

Funny post, especially when you finish watch “Deja Vu” with Denzel Washington 10 minutes prior to reading the post!

 

Thanks Mike, but an endorsement from a person who copies wholeseale another site’s code (*hello Thomas Hawk* *hello Zoooommerrr*) is hardly to be trusted.

(unless of course they’re sending coke&ho’s your way, in which case I understand)

 

Looks like they upgraded the tagline with the word “Automatically”

‘Automatically tag thousands of photos and make your photo library yours again’

 
 

Manually? No way. This technology doesn’t seem to be new. I have been using oculr.com for a while.

 

Too true to be believed at the moment if they are doing it automated way…

 

@ 23: Adam:

Thanks for the link, amazing video on Human Computation here : http://video.google.com/videop.....0976635143

Its the google techtalk from Luis Von Ahn … I started to watch it for a few minutes but could not stop until the end ^^

 

It is practically impossible to get the tagging done by humans. I mean how many people can they employ? Hence my sense is it must be a combination of technology and human tagging. I mean the technology can tag 80% of the images and the humans can correct the rest 20%.

I believe that is a good way strategy to enter this market. Enter the market, create a presence, then keep investing in the technology to a point where it becomes 95% or so correct. I think the human intervention shall be required for quite sometime to come.

End of the day, as an end user I do not care how they do it. I just want to get my images tagged properly.

 

In corner A: letting TagCow hire random people to tag my *public* Flickr photos.

In corner B: AOL, MSN, and Yahoo hand over their logs to the U.S. federal government when requested, and not even as part of a criminal investigation, and without telling their users.

And I’m supposed to be concerned about TagCow? WTF?

 

#36, u have a point. i guess most ppl willnot be worried abt privacy. just look at how they put everything public in flickr,social n/w etc atleast they have some market.

 

We’ve seen loads of ridiculous ideas turn into serious business on the WWW, but I would not bet on this one. Google Image Labeler is using human labeling based on Luis Von Ahn research and project, which is based on making use of people fun time. Paying for labeling is a nasty costly business you can’t afford it. All the time and money it takes to keep up with people uploading their some junk photos.

 

This human-tagging process reminds me of Yahoo whose directory of URL is maintained manually by human. User suggests interesting links to Yahoo which they then add it to portfolio.

However, human-tagging may not be efficient if the system gain critical mass. It will have to hire more human to tag the content, and that some content may just be missed due to human’s oversight. Moreover, it might be scalable.

 

Also what are the privacy implications of this type of service?

At least in my case I wasn’t really worried about their privacy policy. 100% of my Flickr photos are already public photos. Anyone and everyone in the whole world already has access to these photos by my choice to make them public images. Certainly someone with *private* photos might not want to turn them over to tagcow unless they felt comfortable with their privacy policy.

Personally I was pretty impressed with the job that they did tagging my photos. They added thousands of highly descriptive tags. This means that more of my photos will likely show up in image searches both on Flickr as well as on Yahoo by extension as the photos are indexed for theses new tags.

Personally I think that tagcow uses a combination of AI and human resources. I noticed what felt like text recognition on a lot of my photos that had text in them. But they also tagged many of my photos using highly complex conceptual tags that I don’t believe a machine could come up with.

I believe that there possibly could be clustering of sorts going on as well but am not sure on this. For instance. If a photo is tagged witch, might you not automatically tag a photo with this tag also with “wicked.” Similarly, if a photo was tagged sunrise, might you also autotag the photo morning, etc.

Certainly human tagging might not be economically feasible. Certainly it might not also scale. But, at least for the time being, it was pretty cool to have someone/somehow spend hours and hours making my photos on Flickr more descriptive at no cost to me. And whatever the case, one of the cooler things I’ve ever seen done on a personal level with the Flickr API.

 
 

this is like mechanical turk over at amazon isnt it?

 

“Note that Google has effectively thrown in the towel and uses humans for this kind of work, too.”

I very much doubt that. If anything, they could easily use the information they get to train an algorithm. I’m sure the image labeler is a small piece that will help to feed some kind of automated recognition that could actually scale.

 

Google hires 1,000’s of (contract) workers across the globe to classify and QA their web results.

QED.

 

“…Due to unexpected load (during our beta), tagging of photos will be delayed…”
Now that clears things up :-)

 

will they integrate with picasa ?

 

Its probably being done through Amazon’s Mechanical Turk.

http://www.mturk.com

 

It’s “powered by” Munio Memory services which sells image insurance

“Memory Insurance protects all your photos, children’s art, documents and journals.”

So the free tagging service is most likely an attempt to attract customers for the paid service.

 

We will have to wait for the official word. Answers should have been before the launch of the site, without mentioning that privacy conditions are not clear either.

 

I’m getting:

503 Service Temporarily Unavailable

The server is temporarily unable to service your request due to maintenance downtime or capacity problems. Please try again later.

Are they experiencing the techcrunch-effect already? ;)

 

Comments Pages: [1] 2 » Show All

Leave a Reply

Create a Gravatar for your comments.
« Back to text comment