December 5, 2006

Swivel Aims To Become The Internet Archive For Data

Michael Arrington

143 comments »

Swivel Co-founders Dmitry Dimov and Brian Mulloy start off by describing their company as “YouTube for Data.” That’s a good start for someone trying to understand it, because the site allows users to upload data - any data - and display it to other users visually. The number of page views your website generates. Or a stock price over time. Weather data. Commodity prices. The number of Bald Eagles in Washington state. Whatever. Uploaded data can be rated, commented and bookmared by other users, helping to sort the interesting (and accurate) wheat from the chaff. And graphs of data can be embedded into websites. So it is in fact a bit like a YouTube for Data.

But then the real fun begins. You and other users can then compare that data to other data sets to find possible correlation (or lack thereof). Compare gas prices to presidential approval ratings or UFO sightings to iPod sales. Track your page views against weather reports in Silicon Valley. See if something interesting occurs.

And better yet, Swivel will be automatically comparing your data to other data sets in the background, suggesting possible correlations to you that you may never have noticed.

Academic types are going to go nuts over this. I spent a summer in college running regression analysis models on economic data. Being able to simply upload data to Swivel and then begin to slice and dice the data would have saved a lot of time. And being able to compare our data to what others were doing in related fields could have yielded results that we would never have aimed for. Big companies, small companies, thinktanks and non-classified government organizations are going to be similarly dazzled.

Swivel is putting significant computing power behind the scenes to run the data analysis. “We use farms of powerful computers and algorithms at the Swivel data centers to transform a lonely grid of numbers and letters into hundreds - sometimes thousands - of graphs that can be explored and compared with any other public data in Swivel.”

Not all data will be public. The companies business model is to provide the service for free for public data, and charge a fee for data that is kept private. Private data can still be compared by the owner to public data sets.

Look for Swivel to launch later this week after a year of quiet development. The company is based in San Francisco and is part of Minor Ventures.

Exclusive screen shots below:




  • Sphere It

Trackbacks/Pings (Trackback URL)

  1. The Third Bit » Blog Archive » YouTube for Data
  2. Beta Alfa 2.0 » Ett YouTube för data
  3. Swivel « Technically Speaking
  4. Data Mining: Text Mining, Visualization and Social Media
  5. Share your data? « What’s IT all about?
  6. SassaFrassin.com »
  7. saraewood.com » Startups, the laws of inertia and data is fun!
  8. Statistical Modeling, Causal Inference, and Social Science
  9. Non-Profit Tech Blog » Youtube for DATA = nonprofit transparency and accountability?
  10. TechCrunch Japanese アーカイブ » Swivel、データのためのインターネットのアーカイブを目指す
  11. entreprise_et_blog
  12. IndianPad
  13. Anonymous
  14. hans.gerwitz» Blog Archive » Social visualization
  15. Nodalities
  16. Swivel "YouTube for Data" « Rohit Aggarwal
  17. Techy Castle
  18. » Links for 06-12-2006 » Velcro City Tourist Board » Blog Archive
  19. » Links for 06-12-2006 » Velcro City Tourist Board » Blog Archive
  20. » This is an Intervention! A Rehabilitation Program for Business Intelligence - Juice Analytics
  21. links for 2006-12-06 · Lawsy.net
  22. links for 2006-12-06 | blog.forret.com
  23. benstraw.com » links for 2006-12-06
  24. Swivel Preview now available for ‘YouTube for data’ » Make You Go Hmm
  25. world according to ack ~ adam karas » Swivel
  26. Prime News Blog » Blog Archive » Swivel To Become The Internet Archive For Data
  27. Not Bad For a Cubicle » Archives » Lies, damn lies, and statistics
  28. mathoda.com » Nicely played, Internet, nicely played
  29. Nuudelisoppa » Swivel Aims To Become The YouTube For Data
  30. Unicast » From the aggregator… December 2006 edition
  31. Swivel To Become The Internet Archive For Data « Universe_JDJ’s News Blog
  32. TechCrunch en français
  33. Swivel - Innovative Data Monopoly at stepheno.net
  34. Web 2.0 Sammelalbum - Web2Null - Swivel
  35. Swivel « The Petri Dish
  36. Netemic | Understand the Internet » Blog Archive » Swivel on the data web
  37. David A. Windham - Demiurgic Design - David Windham
  38. Mark Rittman » Swivel - Social Sharing of Data
  39. Bionic Teaching » Swivel- YouTube for Data (well as close as data’s likely to get anyway)
  40. Swivel Set My Data Free at punctuative!
  41. chriskelley.org » Blog Archive » Data exchange
  42. Fred’s Journal » Archive » Competition, It’s Really All About Execution
  43. J-blawg » Data churning and display
  44. Swivel Adopts CC Licensing - Creative Commons
  45. Swivel: The Toast Of The OECD
  46. Jarrett House North » Blog Archive » Communitize your data
  47. Jarretthousenorth.com » Blog Archive » Communitize your data

Comments

RSS feed for comments on this post.

  1. SearcH EngineS WeB

    This will also be the the Internet Archive for Data (rather than just YouTube)

    it will be a valuable resources for future generations -seeking snapshots of our cultural trends :-D

  2. Michael Arrington

    you’re right. I’m changing the post and title.

  3. Andrew Ritchie

    I think the idea is brilliant. If the execution is as good as I’m hoping, this could really lead to some phenomenal analysis. Stock forecasting, climate change modeling, even diet analysis…

    Brilliant.

  4. Marc Hedlund

    What a great idea. Love it.

  5. Brian Mulloy

    Mike, thanks for the great article. Our investors say, ‘YouTube for data.’ ;-) we’re just obsessed with making data exploration fun and insightful. But mostly fun.

    Execution, as Andrew Ritchie said, is what keeps us up late at night.

    We’re hoping folks who are data freaks help us steer Swivel into something valuable.

    -b

  6. Denver Wang

    How about the google doing the same business?

    http://www.ezecho.com

  7. Darren

    this is great, I think the way youtube implemented its embeding has opened a lot of peoples eyes.

  8. Uri L.

    Fantastic.

    For everyone who ever read the wonderful “Augustine’s Laws”, Swivel is the digital incarnation.

    ref for book:
    http://www.amazon.com/Augustin.....1563472406

  9. Raghu

    The moment I read this post the idea looked brilliant to me too as it has for the most of you. But as somebody also rightly pointed out above execution and its application is a important thing.

    Would like to hear some thoughts on the following possible ‘breakers’ that I see,

    1. Most data is typically tied to the ‘environment’ in which it is collected but uploads on this archive will be devoid of this environment. Without this critical piece of the puzzle how valid would the patterns that emerge be - how useful would they be ?

    2. Data, whatever it might will have a zillion ways of representing it - both conceptually (graphs, tables, figures, plain text) and in computer formats. In my belief trying to extract any form of patterns must of course nullify the difference in formats but not spread across conceptual data forms.

    3. Assuming that this idea really takes of in a big way and people might start basing their ‘decisions and choices’ on this huge internet archive and pattern generator. Is this necessarily a good thing to happen - am doubtful. Yes the collective intelligence of lots of people is probably behind my decision but my creativity is definitely lost !

  10. dave mcclure

    okay, i’ll start the over/under bidding on this one… i say Google buys it in less than 6 months. i can visualize Larry drooling over this one already :)

    Vote for Google acquisition on PollDaddy

    hey Mike: do you allow polls in your comments? let’s see…

    Vote the Over/Under on Swivel acquisition by Google

  11. Dave W.

    “Academic types are going to go nuts over this” and “Big companies, small companies, thinktanks and non-classified government organizations are going to be similarly dazzled”, are very big statements to make.

    As a college students I know that none of my teachers will accept Wikipedia as a valid academic source and I don’t think they would allow this either.

    It would seem that this data could easily be skewed by someone or some group with enough time and resources and some end in mind that promotes their own message.

  12. Rajeev

    Very interesting Read.

    http://www.tekno-world.blogspot.com

  13. Steve Macdonald

    This has potential for abuse by ideologues on the Left and the Right. From the Climate Change alarmists to the theory of Evolution deniers, there will be a very strong incentive to load this thing up with bogus data engineered to “prove” whatever pet ideologically-driven point the person wants to ram down the public throat.

    Or maybe it will naturally balance out for the most part, in the way Wikipedia seems to be doing for the most part (politically sensitve topics aside).

  14. Yaniv Golan

    It sounds like a brilliant idea and a huge challenge.

    This is one of these times when I find myself thinking “gosh, I hope they make it!”. Not to get over excited, but the need is clear, the potential is huge, and the screenshots look slick.

    Can’t wait :)

  15. Dmitry Dimov

    Mike, thank you for the excellent article. We’re getting real close now to taking the wraps off, and seeing all the great comments gets our cylinders firing. We do have a bunch of things to worry about, as Steve points out, but we also know that the folks like you who are passionate about data will help us get it right and bubble the good stuff to the top and weed out junk. We’ll see soon enough. Now back to working the final kinks out!

  16. Dr. Phil

    I dont understand what you guys are going ga-ga about..

    1) If I wanted to upload my company specific private data, I would be very cautious and hesitant about it — just like I am with google.

    2) Those academics who would be interested in slicing and dicing the data — they already use SAS software to get what they want to….

    The 3′rd comment is absolutely hilarious — you might as well use them for the genome project..

  17. Rajeev Kumar

    Looks like one day we are in for a comprehensive database for earth and its various data’s and correlations. A complete fit for being called a collaborative web 2.0 Application.

    Bravo
    http://www.tekno-world.blogspot.com

  18. George

    Wow. That is very cool. At first when I was reading along it sounded a lot like archive.org, but then once I read “the fun part” I started to think of the possibilities. The funny thing about this is that now all those conspiracy theorists have a great place to go to come up with new conspiracies.

    Imagine if you can make a correlation between UFO sightings and the number of bloggers in the area. I can see it now…

    “Bloggers are really aliens!”

    “Bloggers and aliens plan to take over the world!”

  19. Metrics2

    This is great. Sure there will be issues in execution, ensuring data quality, data privacy issues, and more. But this will certainly help in making everyine more dara aware (read Data IQ!).

    Yes, if it comes out as advertised, it’s going to be bigger than YouTube and certainly more valuable in improving productivity…

  20. Joel

    Two thoughts on this:

    1) I spent a great deal of time in college (and after college) studying and implementing data mining systems. There is a lot more to effective data mining than finding simple correlations between data sets. Yes, your website’s traffic could have spiked significantly every time the Denver Broncos covered the point spread on the road…but that doesn’t mean that the two data sets are correlated. This WILL be an amazing tool for amateur data analysis and should provide a lot of interesting results to small and medium sized busineses

    2) I would be stunned if this has nearly the popular appeal of youtube. Uploading videos is something simple and fun…any one with basic video editing skills and a camera can upload videos, and anyone can watch. Data analysis doesn’t seem to have quite the same adoption. I think a lot of people will be able to come up with funny/interesting data sets and correlations, but I don’t know if it will have quite the viral appeal.

  21. Ramon

    Dr. Phil,

    I think you’re being too restrictive on who this would appeal to. The emphasis is on the sharing and community aspect of it - not just academics and SMBs. Just imagine what happens when people people on MySpace or their personal blogs want to post embedded data on their page. I could see politically active people doing this a lot with (as stated above) from both idealogies, or sports people wanting to keep track of their own stats (like in high schools), hobbyists tracking things like bird sitings - more people use graphed data than just companies and academics.

    As far as abuse goes, that comes with the territory on public and community-driven sites. The comparison to youtube is very accurate - you can opinionate movies just like you opinionate data.

    I’m very interested in the slicing and dicing of data - the cross sections could range from interesting to hilarious.

  22. Hashim

    Swivel should be a feature of a web based spreadsheet app, not it’s own product, or its own company.

  23. Pedro Beltrao

    For me (a PhD student working in bioinformatics) this sounds potentially very interesting. It would be useful to share data for collaborative research, specially if they make it easy to access the data via APIs. I could even release alongside a publication the means to fully reproduce the calculations via this site. Others could quickly build on the publication with access to the data and analysis. I know this can all be done already locally but this would make it much easier.

  24. Miguel Paraz

    Are you sure this kind of app won’t become sentient and take over the world?

  25. Brian Mulloy

    Hi Hashim,

    We kind of agree with you. Spreadsheets are great for editing data. In fact, we mention on our Web site how inspired we were listening to a NerdTV podcast with Dan Bricklin. He boiled down his idea for the spreadsheet as a _word processor_ for data.’ The problem we felt was that after we created a spreadsheet and wanted to switch to sharing and reading mode it wasn’t much fun. Yet you look at the rows and columns and realize there are only so many ways one can pivot and fold this data into graphs. Why isn’t there a Web site that jams out all the combos and then let’s me cruise the data. In one sense that what Swivel is, the read mode of a spreadsheet. Swivel is _Web site_ for data.

  26. dave

    WOW! the implications for research are astounding…if the founders are reading this, i’d love to add one critical request (though it might already be in there) - user driven reviews of data integrity and reliability (e.g. ‘this data set has been marked ‘riddled with errors’ by 42 users)…likewise, will there be indications that the data set has come from an authority or expert resource (a la google coop)?

  27. ABCota

    Like Michael Crichton says: “Science by consensus is not science”. Same goes for data analysis. While this like it might be fun and interesting, I would be wary of anyone drawing any concrete conclusions from any data analysis done by a community of virtual amateurs. Hell we can’t even come up with definitive conclusions on climate even though, “lots of scientists agree”. Same will go for any conclusions attempted to be drawn from this…uh….tool? What sort of things do we expect to get? “We compared the home run ratio of Barry Bonds to the voting record of Barack Osama, and as you can see, clearly there were no WMD in Iraq.

  28. Mary-Ann Horley

    I think it sounds wonderful! If it does half of what I hope it will Swivel will be hugely useful for amateur sportspeople and hobbyists quite apart from people who are into politics, environmentalism etc who will be the main market.

    No, it’s not another YouTube but there definitely is interest in this sort of thing as the success of Freakonomics and THe Tipping Point show.

  29. Brian Mulloy

    Hi Dave,

    We’re reading it. Half asleep on our keyboards at this point, but still chasing bugs and checking out TechCrunch. It would be great to get your feedback on this. Here is out current thinking.

    We want to make it easy to organize user-driven reviews into two buckets. One is subjective. On the site we represent this with thumbs up or down. The other is objective (or tries to be) and we _will_ represent this with meters.

    Subjective stuff is applied to, say, a graph of baseball stats showing the Yankees with more World Series wins than any other team, authored by a guy named Patrick from South Boston. The Title: Yankees Suck! Now, millions of pinstripe fans need to give this graph a big thumbs down. While all the folks from Boston…and Detroit, Cleveland, Oakland…give it a thumbs up. That’s spin and needs to be expressed because…it’s fun.

    ‘Objective’ stuff will be a bit more involved and will include measures of data accuracy, thoroughness, and attribution. That same Yankees graph with all the subjective thumbs down is also very accurate and would have a high objecive measure.

    When we open the doors later this week the subjective stuff will be there, but for the objective parts we’re going to huddle with our founding community members in a few weeks (please join us, Dave) to figure out a way to do the accuracy stuff that engenders trust among data browsers and grows credibility data uploaders.

  30. Jonah from Burrito Blog

    Swivel could be the missing link for my latest project. Me and the Junior Burrito Analysts have been tallying price and weight data for our meals, and have been storing it in a shared Google Spreadsheet. As anyone who intimately involved with Excel will tell you, their graphing engine is pretty ugly and is very poor at sharing charts online. Google Spreadsheets is better at sharing, but those screen grabs of Swivel look great!

    Personally I don’t see Swivel having major implications in research or academics, and it certainly won’t touch serious analytics software. But for fans of Tufte, the ‘YouTube for data’ could be hours of fun.

  31. Alexandre Rafalovitch

    To Hashim:

    You are quite possibly correct.

    In which case, it would be a perfect sense to have a company that rapidly develops the graphing and explores the best functionality interface and then gets bought by Google, Salesforce or anybody else with massive data sets.

    And in a meanwhile, it will allow for quirky consumer uses that being as part of (say) Salesforce from the beginning it would not allow for.

  32. JugHead

    As an associate at a macroeconomic research firm in New York- I think this site could be awesome and very useful in the finance world. I don’t know if you know the subscription prices for data sources, but they are pretty high.

    One question though- will all the data be historical, or will it be constantly updated manually? Will there be features for large, popular data sets that are updated, such as index levels and government releases? I assume stock prices and indices will be filtered in automatically- true?

  33. J.C.

    The concept is really good. Only I wished I could see it in action.

  34. Jonathan Dugan

    Well done. Another step in the right direction. Visualization and analysis are complex problems to crack and let’s hope they offer a service that does it well!

    I like their model of “free if the data is public”. We’re planning something similar with Matson Systems, but not focused on visualization or analysis, but rather distributed organization and dataset building and maintenence for groups, communities and businesses.

  35. Brian Mulloy

    Thank you, Joel:

    “I would be stunned if this has nearly the popular appeal of youtube. Uploading videos is something simple and fun…any one with basic video editing skills and a camera can upload videos, and anyone can watch. Data analysis doesn’t seem to have quite the same adoption. I think a lot of people will be able to come up with funny/interesting data sets and correlations, but I don’t know if it will have quite the viral appeal.”

    I agree. Are you available to help me set expectations with our investors. :-)

  36. JJ Saenz

    I wish these guys the best of luck, and as a user of statistical/econometric software, I say the road is clear ahead: not only can you be the youtube of datasets: you can sell analytic capabilities such as high level regression or analysis capabilities (dynamic panel data with instrumental variables a la Arellano-Bover, anyone?).

    These kinds of capabilities exist in very expensive software that you might use once or twice, even in a specialized environment. The situation is begging for a pay-as-you-use approach. I’m surprised STATA or MATLAB are not usable on the web on a pay-as-you-use basis.

  37. Brian Mulloy

    Jonah, bring on the Quesadilla. Long live Tufte.

  38. Sara Wood

    Excellent work - I can’t wait to see this launch. I have talked (much) more about this here:

    http://saraewood.com/2006/12/05/swivel/

  39. Brian Mulloy

    JJ Saenz , thanks for the well wishes.

  40. Allan Benamer

    I’ve talked about this on my blog as well at http://www.nonprofittechblog.org.

    Swivel could probably help to crack the tough issues of transparency and accountability in the nonprofit sector.

  41. David Lee

    To Swivel Team,

    This is David from the EditGrid team. I believe there’s something we can explore together. We’ve an online spreadsheet which allows users to collaboratively edit their data.

  42. Matthew Leitz

    Great idea, and I think labeling yourself as the YouTube of data is smart (you have to make data seem fun somehow). As everyone keeps saying, I do not believe this will become the mecca of serious data, but it has the potential for becoming the best for your everyday Joe (which is what really matters). What is really great about the model is the seperation of private and public data. The public will be challenging, but will certainly build brand awareness. The private will be a very useful tool and I believe if priced right could support the project.

    As for subjective vs. objective data, I imagined a system such as that as I first read the article. Without that in place before the word spreads about the site you could end up with egg on your face. Having a potential user’s first impression of Swivel being a graph that clearly shows without a doubt that “Defense spending effects the quality of American Idol contestants” could hurt more than help.

    I would CLEARLY MARK each graph with not only if data is objective or subjective, but also with the level of validity in the statistics. Maybe have a “Validate” button by the data on the site and have the graphs say “This data has been Validated by XX users.” This might not help initial traffic because website owners would not go create a “More pageview on this site leads to higher user income” graph, but would help the long term validity.

  43. Brian Mulloy

    Matthew Leitz,

    Great thoughts. Your warning is a good one. It’d be great to get your take after you take a look around.

  44. Joe Ropkins

    Even Stephen Dubner of Freakonomics seems excited by this new service [http://www.freakonomics.com/blog/2006/12/05/a-youtube-for-data/]. ;)

    Good luck gentlemen. Nice idea.

  45. Jason

    Brilliant! Death to SPSS!

  46. Tim

    A preemptive caution that I would like everyone to repeat after me:

    Correlation is not causation.

  47. grumpY!

    most of today’s useful datasets are useful because they are huge. even if swivel would assent to paying the storage costs for multiterabyte datasets, how do you think i should import it?

    what data formats do they accept? the data out there is dark matter partly because of the uncountable formats it is stored in.

  48. Jon Coward

    The name is terrible. It’s part of an insult in English, isn’t it?

  49. Pramit Singh

    Actually, this seems to be the Myspace for Data. It goes further beyond Myspace and provides good tools and content for the user. The ideal Web 2.0 site.
    http://mediavidea.blogspot.com.....thing.html

  50. grumpY!

    also, how does the inference work? you say you will draw correlations between disparate datasets…how? lets say i updload data in foxbase format for the number of bald eagles i see outside my window.

    first of all, can you read foxbase? now i labelled my columns “no. of bald sightings”. what do you plan to infer from this automatically? i predict that like most column labels, this will be more or less opaque to you. you could try a NLP approach but this likewise seems intractable for a start up.

    hmmm, sorry, but even a cursory analysis of the problem you want to solve doesn’t seem to inspire hope, although some answers to these trivial issues would shed some light

  51. anonymouse

    this is groundbreaking technology. I believe all investment banks will stop using excel and proprietary systems and upload all their data to swivel. google will buy them for at least $2 billion. I have a phd in statistics.

  52. anonymouse

    ok, on a more serious note, let’s take a break from all the slobbering over this site. it’s a clever idea, but sadly appears to be a classic example of something that sounds wonderful on paper but in practice goes unused. academic types will *not* go crazy over this. regression analysis is trivial in excel, and seriou researcher knows how to squeeze all kinds of power from excel or matlab or R or what have you. I highly doubt “swivel” will have the same level of power or sophistication, and if it does, I highly doubt it will be as usable as something installed locally.

    so who might use it? the average joe. problem? their business model doesn’t make any money of average joes. “swivel” is a clever solution to a problem that doesn’t really exist. but of course that won’t stop wannabe VCs from showing off how many buzzwords they know.

  53. david8

    Swivel sounds like great fun for the average user to play around. I’d play around with it myself.

    But there’s no reason why serious data users or data providers would EVER bother with swivel. Major data providers like the U.S. Bureau of Labor Statistics, Census Bureau and the BEA aren’t going to suddenly start placing all their datasets on Swivel. They already have their own websites which allow for the control of and downloading of datasets, many of which are very large. Why pay someone like Swivel to store their data? Furthermore, why would academics or other rearchers who have access to all kinds of on-site disk space bother with swivel either — especially when serious data may involve confidentiality agreements?

    SPSS and SAS aren’t dead by a longshot. Most datasets involve considerable work (spatial rearrangement and theoretical considerations) to pound them into some kind of usable shape. Data is NEVER arrayed as needed. There are always issues with how to aggregate the data, scaling, units being used, missing data, miscoded data and a million other things. Swivel isn’t going to do that. Shoot, you have tinker with data to even make a useful spreadsheet — never mind trying to make some kind of useful statistical output.

    To think that a person needs to nothing about statistics yet at the press of a keyboard key will have access to all kinds of MEANINGFUL stats is pie in the sky. If only it were that easy. Everyone could skip taking those pesky probability and stats courses in college.

    So folks can all have fun with some simple correlations (me too). You can always produce a number, even if it that number no reasonable interpretation. And yes, correlation is definitely not causality.

    There will never be a substitute for thinking (aside from artifical intelligence considerations).

  54. david8

    Regarding #44:

    Stephen Dubner is a writer/reporter. He’s not a statistician, or economist or any other kind of data analyst. His blog merely repeats the tech crunch post.

    By all means, though, read his book about his family while growing up. It was good. His parents were totally off their rockers — not in a goods way either. That has nothing to do with his qualifications to discuss data analysis, however.

    http://www.amazon.com/Turbulen.....mp;s=books

  55. Thomas

    Wow, another cool idea that some people will be able to make some dangerous and sometimes incorrect observations easier. I wouldn’t be surprised to see the next “Freakanomics”, “Tipping Point”, or “Longest Tail” type book be written based on all information used from information and correlations found within this site and service. Bravo!

  56. Dmitry Dimov

    david8,

    Thank you for the thoughtful and relevant points. Literally translating the awesome power of statistical packages such as R or SAS would be a futile exercise. Want we want to do is use the Web for what it’s so good at: communication. Just for grins, cruise around the Census Bureau site or DOE pages and see how easy it would be to gain some very simple insights from the thousands of spreadsheets prepared by trained analysts and statisticians (many thanks to them by the way, that data will keep Swivel going for a long time to come). We know there must be an easier way to let all this dormant data tell a story, and that’s why we built Swivel. Many posters also mentioned academics going ga-ga over Swivel — I sure hope that will be the case, but we’ve also talked to plenty of people who are not academics but are nevertheless data freaks, or just curious types who want a simpler way to gain insights from all the data surrounding us. So, if you know what questions to ask and want to do hard-core data analysis, R (which is free), SAS, and matlab are your weapons of choice. If, on the other hand, you’re thinking, “I’m not sure what I’m looking for but I’ll know it when I see it,” and you’re willing to cruise around and explore data, and you want to tell others what you found, we want you to come to Swivel. In any case, the proof is in the pudding, so off I go to give Swivel a few gentle pushes to get it out the door so you can see for yourself. I appreciate the thought you put into your post and hope to have you as a user at some point.

  57. Allen Sligar

    I want to conratulate the Swivel people just down the street (we’re in Sacramento) on thier launch for a few reasons:

    1. Congrats on cross pollinating a web 2.0 look with data sets and making it relavent to the current enviornment.

    2. Congrats on your functionality, thats no mean feat, making the data sets robust, appealing and relavent, and then allowing for analysis in a way that allows comparability of disparate data points…..very cool

    3. Contrats on getting close to launch, its hard, we’re 1 month away also

    4. Congrats on a good showing, that proves how cool, relavent, powerful and important, community generated data sets can be.

    I congratulate you on this because I know how hard it is to bring something like this to market, we’re doing something similar in the video games space….

    There is a huge opprotunity for business in aggregating data sets, globally aggregated data sets in and of themselves, can become excellent DSS from which one can extrapolate relavent market and trending data as well as predictively model consumer preferances……wow especially where one has comparative data sets…nice job guys.

    Allen

  58. Allen Sligar

    I also wanted to add:

    I agree with david8 on the points related to the challenges with data. Those are not easily overcome, the software, coding and etc. will only ever get you so far, good analysis is required for data thats going to be used in an actionable manner.

    However, if you think of SAS and SPSS as very fine tools that require some experiance and knowledge to use to get a very ganular and detialed look at your data, much like a laser.

    I think there is also room for data aggregation services, I have called this in meetings with my partners UGD (user generated data). This is more akin to a hammer, not quite as accurate, requiring little in the way of expertise, but it just might get the job done…..

  59. anthropocentric

    Finally a novel and very interesting idea. Good luck Swivel!

  60. r e g a l

    I can’t wait to analyze my sex life.

  61. BV

  62. BV

    http://www.venganza.org/piratesarecool4.jpg

  63. Brian Mulloy

    There it is. #62. The Flying Spaghetti Monster graph. at last.

  64. Reed

    “…rated, commented and bookmared by other users…”

    What is “bookmared?”

  65. Simon Shelston

    Well done guys! Looking forward to it!

    - You’re old BEA buddy, Simon

  66. BV

    Come on Mr. Mulloy, when you’re trying to make what is essentially a social networking site (”YouTube for data”) and say such things as “But then the real fun begins. You and other users can then compare… UFO sightings to iPod sales,” you’re going to get some good-natured teasing. You’re inviting logical fallacy and spurious claims.

    No offense, but when it comes to this sort of thing, you’ve either got to have controls in place to assure the uploaded data comes from some sort of reputable source, or you’re going to get a bunch of fake crap with silly conclusions drawn to “support” one’s case, potentially to a dangerous end as alluded to in post #13.

    In my estimation, data networking just isn’t going to be popular. YouTube works because of people’s vanity: they want to see themselves on TV and they want to see that “views” number grow, knowing people are watching them.

    The data with which my colleagues and I work is on the order of 1-2GB per dataset. No web-app right now could handle 10-20 sets of such data. We’ve got our own methods in place, as does as every scientific consortium that shares data.

    It’s a neat and fun idea, but it’s probably not going to make any Freakonomics-type discoveries due to the “fun” nature of the site.

    That said, I’m totally prepared to be proven wrong: I really hope that this projects adds to the general knowledge and–if this doesn’t work–future projects will be built from it.

    Also, you know you love that someone used the FSM graph: it ties a cultural phenomenon popular amongst your target user base to your commercial product. If that juxtaposition can take hold, your service will be much, much more successful. I’m just doing you a favor. :)

  67. Tom Paper

    I would encourage people to check out Data360. We launched a collaborative data tracking website 2 months ago. http://www.data360.org Best regards, Tom

  68. Reed

    Not sure how we can check that out, Tom.

    “Sorry, this Organisation is deleted or deactivated!”

  69. visnup

    i prefer to think of it as the youtube of all that is good and plentiful.

    offhand, isn’t the internet as a whole one big irreputable source? i personally thrive on it. i quote wikipedia even if it’s wrong and am happy in my ignorance. if i’m really into a subject, i can dig deeper and find out the truth on my own: no data is definitive on any subject no matter the source or binding and everything is a dull, unexciting grey. (props to my main man, global warming)

    and vanity still applies to swivel i think, just in a different, nerdier way that isn’t as common in most people. i give respek to milton friedman as much as paris hilton, but definitely in different ways.

  70. Marshall Poe

    What Tom Paper said. Data360 is basically the same concept (though I’d call it “Wikipedia for Data,” as YouTube is the home of Lonely15 sillyness). The site has been up for a couple months. The server is down right now, but I’ve used it. It’s easy to use, full of collaborative features, and already has a bunch of useful data. The URL is (I’ve got it bookmarked) http://www.data360.org. Check back when this server business clears up.

  71. Brian Mulloy

    BV. All your comments are spot on and balanced. We’ll see how this shakes this out.

    -b

  72. Brian Mulloy

    Simon!

  73. Tom Paper

    Marshall and Reed, our site is up and running. Go ahead and check out http://www.data360.org Best regards, Tom

  74. Nemrut

    This would be a very interesting complement to Google Earth —

    Google Dearth…?

    I bet these guys are licking their chops right now..

  75. mark

    I see a more mundane utility for people doing work in field biology. It’s hard because everyone works on different things, with data at different granularities, stored in different formats. I’ve got sets of data from 10km transects looking at vascular plants, and 100m transects looking at insects, and 9m square plots looking at pollinators. All in the same area, all done by different biologists.

    What happens today is that the data disappears on peoples’ computers. A paper gets published, but the data isn’t shared or shareable. Providing a platform for getting datasets into a single shareable and accessible format is actually a big thing for field biologists. Add some tagging and searching, and people can beging to pull out more meaningful information at a system (ecologically speaking) level than they currently do.

    I’ve been collecting and storing and trying to figure out how to deal with disparate biological field data for a while now. Just getting the basics in place has been challenging enough. I hope this service grows and improves.

  76. Silly Girl

    Color me unimpressed. As a geoscientist who is used to dealing with datasets large and small, the biggest problem has always been getting the data in the right format to do interesting things with it. There are many different data formats for different types of data. Also, as far as scientific data is concerned, it has to be recorded and preserved until controlled, known conditions. This will be about as useful to researchers as blogs and pictures are to journalists - which is useful, but not up to par for professional work.

  77. David Mackey

    This is a great idea and hopefully will yield some great insights. I’m glad to see that they have a monetization plan.

  78. Charlie

    I’d say this idea looks great on paper but extremely hard on implementation, especially on the usability front. The site is still not open to the public and we don’t know how easy for users to create meaningful charts and graphs. It’d have great potential if they can solve the usability issues.

  79. James