Big Data Camp Kicks Off Strata NY + Hadoop World
Several members of the Syncsort team attended the Big Data Camp event sunday afternoon in NYC wearing our hadoop hacker & Mainframe Maverick T-shirts. It was a great way to kick off Strata NY + Hadoop World! (Here’s more info about what we are doing at the show).
We co-sponsored the event along with Microsoft and Intel, given it’s just down the road from our HQ.
It was extremely well attended given it was a Sunday afternoon in NYC where there are no end of distractions. Dave Nielson, as ever, did a great job of organising and DJ’ing the unconference.
Dave began the session with a great talk – especially considering his lack of sleep, having just flown in from a hackathon leaving at 4am. As usual, Dave had some very insightful observations about Big Data and the cloud’s impact on jobs – he quoted the frog in hot water analogy – it’s widely suggested that if you put a frog in cold water, then slowly raise the temperature it will simply die from boiling as it’s unable to react to the gradual change around around it (though there are lots of suggestions this is just a “sounds good” rather than real analogy – hopefully nobody will try boiling any more frogs to confirm – read the existing research. I especially liked the comment about how if your above the cloud you’ll be telling it what to do, but if your under the cloud it will be telling you what to do and that is unlikely to pay a minimum wage.
Then we got into a number of lightning round presentations.
Joel Horwitz from Alpine Data Labs did a great “real world discussion” on why customers are more brutal thank you think – with some good examples from an unnamed security company.
Chang She of Datapad.io covered the impact of bad tools on your data analytics workflow – I loved his slide (pic attached) You’re not a data scientist “in data science 80% of the time is spent to prepare data, 20% of time is spent complaining about the need to prepare data. I was curious enough to sign up for the datapad.io beta program.
Jamie Fizgerald did a session on Rage Against the Black Box: Lessons Learned on the Challenge + Rewards of Radical Transparency”.
Our own Jorge Lopez then did a session on “Suits, Hoodies, Elephants, Mainframes”. I tweeted a few comments to give credit to Merv Adrian who gave Jorge and I the suits and hoodies slide we used in the webinar with Cloudera on this topic, re-used in this session and in the process discovered that the original idea for Merv’s slide came from Frank Buytendijk.
Finally Paul Dix from Errplane discussed a new open source time series database which was pretty interesting – it’s amazing how many companies are appearing in the hadoop ecosystem.
After the lightening rounds the unconference kicked off with an un-panel including Tendü Yoğurtçu from Syncsort and then the sessions.
Again great involvement with lots of great discussions – I especially liked the session on what challenges faced customers deploying Hadoop in the real world and who’s responsibility it was too fill the gap. Also I attended some of the 101 Hadoop sessions Dave was running – he made an attempt at repeating a slide I showed him, getting it close but not completely right – I actually did a blog post using the quote and used it at the beginning of the BDW Hug about a month ago.
Read my blog post here, though my version calls it kissing in the school yard, rather than the much less politically correct version going around 🙂
Also, if you are at Strata NY, don’t miss the presentation where my colleague Jorge Lopez teams with Cloudera’s Matt Brandwein on Wednesday to present, “How to Leverage Mainframe Data with Hadoop: Bridging the Gap Between Big Iron & Big Data.” Click here for full details.