August 6, 2006

AOL Proudly Releases Massive Amounts of Private Data

Michael Arrington

467 comments »

Yet Another Update: AOL: “This was a screw up”

Further Update: Sometime after 7 pm the download link went down as well, but there is at least one mirror site. AOL is in damage control mode - the fact that they took the data down shows that someone there had the sense to realize how destructive this was, but it is also an admission of wrongdoing of sorts. Either way, the data is now out there for anyone that wants to use (or abuse) it.

Update: Sometime around 7 pm PST on Sunday, the AOL site referred to below was taken down. The direct link to the data is still live. A cached copy of the page is here.

AOL must have missed the uproar over the DOJ’s demand for “anonymized” search data last year that caused all sorts of pain for Microsoft and Google. That’s the only way to explain their release of data that includes 20 million web queries from 650,000 AOL users.

The data includes all searches from those users for a three month period this year, as well as whether they clicked on a result, what that result was and where it appeared on the result page. It’s a 439 MB compressed download, expanded to just over 2 gigs. The data is available here (this link is directly to the file) and the output is in ten text files, tab delineated.

The utter stupidity of this is staggering. AOL has released very private data about its users without their permission. While the AOL username has been changed to a random ID number, the abilitiy to analyze all searches by a single user will often lead people to easily determine who the user is, and what they are up to. The data includes personal names, addresses, social security numbers and everything else someone might type into a search box.

The most serious problem is the fact that many people often search on their own name, or those of their friends and family, to see what information is available about them on the net. Combine these ego searches with porn queries and you have a serious embarrassment. Combine them with “buy ecstasy” and you have evidence of a crime. Combine it with an address, social security number, etc., and you have an identity theft waiting to happen. The possibilities are endless.

Marketers are going nuts over the possibilities, users are calling for a boycott of AOL, and others are just enraged:

User 491577 searches for “florida cna pca lakeland tampa”, “emt school training florida”, “low calorie meals”, “infant seat”, and “fisher price roller blades”. Among user 39509’s hundreds of searches are: “ford 352″, “oklahoma disciplined pastors”, “oklahoma disciplined doctors”, “home loans”, and some other personally identifying and illegal stuff I’m going to leave out of here. Among user 545605’s searches are “shore hills park mays landing nj”, “frank william sindoni md”, “ceramic ashtrays”, “transfer money to china”, and “capital gains on sale of house”. Compared to some of the data, these examples are on the safe side. I’m leaving out the worst of it - searches for names of specific people, addresses, telephone numbers, illegal drugs, and more. There is no question that law enforcement, employers, or friends could figure out who some of these people are.

There is some really scary stuff in this data.

I am assuming that AOL will take this page and the data down soon, but as of the time of this post it has been downloaded 809 times already. People I’ve spoken with are already building a web interface to the data. If you are an AOL customer, I feel sorry for you.

Note that Microsoft has proposed releasing similar data to researchers, although with an important difference - the data is not associated with a user. Excite released data very similar to what AOL has done here, with user associations, in 1999.

AOL is hitting bottom when it comes to brand image. This story comes on the heels of the recorded phone call with customer service disaster as well as a just-in story about a woman who is unable to cancel her deceased father’s AOL account, nine months after his death.

  • Sphere It

Trackbacks/Pings (Trackback URL)


  1. Pacificdave Blog » Shame, Shame on AOL
  2. Seriously, fire AOL. RIGHT NOW! | Poor Michael’s Almanac
  3. AccMan Pro / AOLs monumental moment of madness
  4. GregSadetsky.com » Blog Archive » AOL data: out there, for posterity
  5. AOL Releases Search Data, Pity the Fools · Island of Doctor Death
  6. :: Political Musings :: » Privacy Concerns
  7. Blog The Internet » Blog Archive » Do You Want To Know What AOL Users Searched For?
  8. Idiot Crisis » Blog Archive » AOL gives it all away for free, including customer data…
  9. The TrustedID Blog » Blog Archive » MASSIVE AOL Data Breach of a Different Sort!
  10. Shawn Christopher » Not one week
  11. Web 2.5 : The Always-On-You Web
  12. Anonymous
  13. Spider Tactics » AOL publishes 3 mos. search data of 500,000 users
  14. TechCrunch Japanese アーカイブ » AOL Proudly Releases Massive Amounts of Private Data
  15. Beta Alfa 2.0 » AOL gör loggfiler för webbsökningar tillgängliga
  16. AOL Exposes 650,000 Users’ Search Activities - numbrX Security Beat
  17. Stupidity of AOL at Cadeo’s Chaotic Pondering
  18. AOL intentionally releases user data - Matthew Gifford
  19. Dan Appleman: Kibitzing and Commentary » Blog Archive » Stunning Privacy Breach by AOL
  20. AOL Gate: Search Query Data Scandal by Elliott Back
  21. gHacks tech news
  22. The Paradigm Shift
  23. Loud Opinions | Blog » Blog Archive » AOL Releases Search Logs from 500,000 Users
  24. o [cc] do [caiocesar] na [www] » grave!
  25. Cadeautje van AOL aan spammers | KennethVerburg.nl - Information Engineer in het Wild
  26. Will Video for Food » AOL “Proudly Releases Massive Amounts of Private Data”
  27. Living In The Past » Is there a Law suit coming to AOL? I think I smell one.
  28. Suchmaschinen-Tricks » AOL veröffentlicht Nutzerdaten
  29. Greg Yardley's Internet Blog
  30. Greg Yardley’s Internet Blog » You never had privacy anyway
  31. brosinski.com/stephan » Blog Archive » AOL search logs: Scary datamining for the masses
  32. AOL präsentiert: Die dümmsten Momente in der Geschichte des Internets … live! at pl0g.de
  33. Will Video for Food » America Online Spoof Video: “AOL Privacy Cam”
  34. Search Engine Optimization For Dummies » AOL releases privacy data - 20M search queries
  35. Morph3ous’s Weblog » Blog Archive » Techcrunch » Blog Archive » AOL Proudly Releases Massive Amounts of Private Data
  36. Starked SF » Blog Archive » Talk of the town: Monday, August 7
  37. IA Inside the Beltway
  38. bananas on toast » Techcrunch » Blog Archive » AOL Proudly Releases Massive Amounts of Private Data
  39. Ryan. Connect. » More AOHell
  40. Musings of the Great Eric » Blog Archive » AOL = Evil
  41. :Ben Metcalfe Blog » Blog Archive » AOL releases search data on 500k users… and then tries to take it back
  42. michaelzimmer.org » Archives » AOL Proudly Releases Massive Amounts of Private Data
  43. Jimmy Daniels » AOL Releases Searchs From 500,000 Users
  44. Know It Or Blow It » Blog Archive » AOL Search Data
  45. Blogger Skills » AOL Search Data Leaked!
  46. AOL Proudly Releases Massive Amounts of Private Data at innerangst.net
  47. LOHAD - random rumblings on marketing and more » Blog Archive » AOHell, Indeed
  48. Tech Recipes Blog
  49. ansemond.com
  50. AOL’s Search Data Has Eerie Content - CyberNet News: Hardware, Downloads, Gadgets...Technology Done Right!
  51. wangarific » Blog Archive » AOL Does What Google Smartly Avoided, Releases Search Data
  52. John Ottesen
  53. AOL releases 20,000,000 or so search queries by around 500,000 of its users at Aral Balkan
  54. techborg » Technology » AOL Goofup leades to Google Highest Keywords Leak?!
  55. Domain of Slack » Descending further into AOheLL
  56. a thaumaturgical compendium » Blog Archive » AOL Data
  57. TimSaler.com » Blogging on Politics, Technology, and Sports
  58. AOL Screws Up: Releases Customer Search Histories Including ‘How to Kill Your Wife’
  59. Seattle’s Rain City Real Estate Guide » Real estate search patterns and AOL users
  60. Brutusweb » Still with AOL
  61. unitstep.net
  62. Twilight in the Valley of the Nerds » AOL Exposes Personal Search Data, Shoots Itself in Process
  63. T. Longren » AOL Releases Private Data
  64. TechEffect
  65. Randy Jensen Online Blog | randyjensenonline.com/blog
  66. A Welsh View
  67. ha.ckers.org web application security lab - Archive » AOL Releases Public Information
  68. AOL proudly releases personal data - FireBlades.org
  69. I found Calacanis’ Social Security Number in the AOL Logs at The Blog Herald
  70. AOL releases data on 650k users at The TPS Report - by Kiyoshi Martinez
  71. little giselle’s pretty pink pavilion » Blog Archive » With the new AOL, privacy’s a dream
  72. twodotfive
  73. Techcrunch » Blog Archive » AOL: “This was a screw up”
  74. Smetty’s Soapbox » AOL: de grootste search en privacy blunder ooit
  75. Yeah, About That… » Blog Archive » Careful What You Search For
  76. Ubertor Real Estate Blog » Real Estate Carnival
  77. Nosebleed’s Blog: We Do What We’re Told » Blog Archive » AOL Screws Up
  78. Blog SEO
  79. Left Wing = Hate » Blog Archive » Back from Vacation and everything went to heck in a handbasket
  80. Caribbean Business » Blog Archive » “You’ve Got…” a huge privacy issue in your hand, AOL
  81. that would explain Bob… » Aol-MAZING ! AOL Jumps the Shark ! AGAIN !
  82. A Blog Node » Blog Archive » AOHell is Very, Very Sorry…
  83. nonsmokingarea.com » Blog Archive » AOL releases user-search-logs to public
  84. James’ Blog » Blog Archive » AOL releases sensitive data
  85. Gogelmogel
  86. BenEskew.com » AOL Releases Private Data :: 20 Million Search Queries!
  87. Teklow Enterprises » The AOL thing
  88. the 60 billion $$ man » AOL Releases Search Logs of 657,427 Users
  89. AOL Sucks. - Ramblings of the Mildly Insane
  90. Myfavstuff.com
  91. Insider blogging: the great AOL search caper - stocks blog
  92. danbruno.net » AOL oopsie.
  93. “Zero influence!” » Blog Archive » links for 2006-08-08
  94. A Tableau of Crimes & Misfortunes » AOHell screws its customers again with massive release of private data
  95. Be Careful What You Search For - YellowHouseHosting
  96. Mind Mob » Do ethics apply to great data?
  97. Im Not A Doctor - but i do know something » Aol is up the creek with out a paddle!
  98. + Digitalna oznanila » Blog Archive » aol razgalil svoje uporabnike
  99. DinkumInteractive.com » AOL is free now - so is their data…woops: Small Business Search Engine Marketing Philadelphia
  100. ArtieFishill Thoughts » AOL Purposely discloses users search habits
  101. Rohan Pinto’s blog @ http://localhost
  102. Biting a wax tadpole » Blog Archives » Huge AOL security leak
  103. Ramblings of a 21st Century Digital Boy » All Your Searches Are Belong to Us
  104. Stan’s List » Blog Archive » AOL Releases Search Logs of 657,427 Users
  105. TechCrunch en français » AOL: “Nous avons commis une grosse erreur”
  106. AOL compromises search privacy | Why We Worry
  107. Techcrunch » Blog Archive » AOL Data: First Web Interfaces Up
  108. Twisted Logic - » More AOHell..
  109. Online Keywords Blog » Blog Archive » Free keyword research data from AOL
  110. joshshill.com - news from the tech world! » AOL’s next bad move
  111. PlaneBuzz
  112. Alleluia AOL! -- Macalua.com
  113. Online Keywords Blog » Blog Archive » Web interfaces to AOL search data
  114. Sandis Viksna » Blog Archive » AOL goes nuts?!