Corner Office

‘Big Data’ was probably one of the most used and abused terms in the IT industry throughout 2012. Conferences, publications, vendors, and analysts alike talked tirelessly about the opportunities and challenges created by Big Data. However, what is really amazing is the speed at which organizations are trying to learn and assimilate radically new architectures to process their data. For instance, recent estimates from IDC and GigaOM predict the Big Data market will be $26B or more by 2016. It seems like 2012 was all about experimentation and setting expectations. Now, in 2013, it’s time to walk the talk! So, what can organizations expect as they embark on their Big Data journey? Well, for starters, it’s important to recognize some of the key challenges they will face.

The Big Data skills gap. Hadoop has emerged as the de-facto platform to process Big Data. However, as my colleague Steve Totman pointed out in a recent ITWorks blog, technical skills in Hadoop, MapReduce and all things Big Data are becoming more and more expensive and difficult to find. Therefore, it is critical for organizations to find tools that can leverage skills that already exist within their organizations –writing Java, designing data processing flows with a GUI – to take advantage of this highly scalable framework.

Weed out the noise. Earlier last year, Gartner pointed out the relationship between noise and Big Data. I couldn’t agree more. As companies start to collect, store and process Big Data, they need to be very careful to filter out the bad data. As noise grows, the value of Big Data goes down exponentially. Therefore, organizations will need the right tools to not only connect to all relevant sources of data, but also pre-process and cleanse before they load it to their data processing frameworks, which will most likely be a Hadoop environment.

It’s the economy, stupid. Ok, ok, I knew that line would catch some attention, but it’s actually true. What we’re seeing is a shift from big, heavy architectures that demand exponential costs just to keep up – a.k.a. scale to meet the demands for more data – to seemingly low cost, highly scalable approaches like Hadoop. While Hadoop can scale much more cost-effectively by adding commodity hardware (nodes), organizations will hit a wall at some point. Think about maintenance, cooling, power, and even real-state costs. As Hadoop implementations grow, the need for tools that can maximize the performance of each node will become a critical factor or success.

2013 can be the year that Big Data technology gets traction and this will only ramp up Hadoop adoption. Organizations looking to reap the benefits of Big Data, have to be smart about their Big Data strategy, not only as it pertains to Hadoop – yes Hadoop is not the holy grail for everything – but as they move through a path that should eventually lead them to answer the big questions, or isn’t that what Big Data is all about?

{ 0 comments }

The Big Data Top 10

December 19, 2012

2012 — “The Year of Big Data” — is coming to an end, but we know that Big Data itself is here to stay. So, whether you have been living on a desert island for the past twelve months (lucky you) and don’t know a thing about “Big Data” (not so lucky) — or, more likely, you have have been too busy keeping up with Big Data to actually read about it — we’re here to help.

The team here at Syncsort has compiled our “Top 10 ‘Don’t-Miss’ Big Data Resources of 2012.”

Check them out — and if you have some favorites of your own that are missing — continue the holiday spirit of giving and post them here!

Top 10 “Don’t-Miss” Big Data Resources of 2012:

  1. Wikibon.org Blog: A Comprehensive List of Big Data Statistics.  Not convinced that Big Data is real?  Check out more than 30 impressive stats on Big Data.
  2. TDWI’s Big Data Integration E-Book.  A quick, eight-page guide to help wrap your head around how Big Data will change the way you think about Data Integration.
  3. IT Business Edge: Eight Ways Any IT Division Can Use Hadoop.  Hadoop isn’t just for ultra-cutting-edge and cool use cases.  This blog covers the most practical ways almost any company can put Hadoop to work.
  4. InformationWeek: “How to Find Strategic Advantage From Big Data.” What you need to know when prioritizing Big Data for your organization.
  5. CIO.com: “How comScore is Using Hadoop to Tame Its Big Data Flow.”  Read how the Internet analytics giant (and Syncsort customer!) uses Hadoop to manage 32 billion new rows of data a day.
  6. FT.com: Decoding Big Data Series.  5-part series of articles offers great insight into what Big Data is and how companies across diverse industries are diving in.
  7. White Paper: “The Impact of Big Data on ETL”.  BI analyst Mike Ferguson takes on — you guessed it — the impact of Big  Data on ETL.
  8. Harvard Business Review Big Data Insight Center. Didn’t get your VP a holiday gift?  Send them this — a one-stop shop for what senior executives need to know about Big Data.
  9. On-Demand Webcast: “Best Practices for Big Data Integration.” On this popular TDWI webcast, BI guru Claudia Imhoff provides a no-nonsense view about how you need to integrate your data for successful Big Data BI.
  10. Monash Research Blog: Some trends that will continue in 2013. Clear and concise, industry influencer Curt Monash offers his year-end predictions for the trends that will carry us into the new year.

{ 2 comments }

Walter Curti Joins Syncsort

August 23, 2012

I’m delighted to report that Walter Curti has joined Syncsort as Vice President of Data Protection Engineering.  Walter is a long-time player in the data protection space and a true visionary.

I first met Walter back at Cheyenne Software where I worked in the mid 90s.  He was in charge of the Windows division at Cheyenne where we worked on ground-breaking products like ARCserve data backup software and InocuLAN anti-virus software. Those were the early days of what we used to call client-server computing, and the innovations were coming fast and furious. Cheyenne was breaking new ground in protecting open files, in integrating backup and anti-virus with applications, and with unique ways to make tape backup faster (including “Tape RAID” – whatever happened to that?). This was back in the day when a backup that measured in Gigabytes was a lot! These innovations seem routine today, but back then integrating backup with a database was cutting edge.

Another event I remember very well was when Cheyenne developed the first Bare Metal Restore application for Windows NT. That project was driven by Walter and I recall him telling me how Microsoft said it was an impossible task. There was no way to restore a server without going through the full Windows NT install and then copying back the data. You couldn’t restore the previous system state. Well Walter didn’t let Microsoft put him off, and sure enough he figured out how to do it and Cheyenne was the first vendor with a BMR product for NT.  

This refusal to accept defeat is characteristic of Walter’s approach to technology and innovation. He is also a terrific team leader and motivator who knows how to listen to his people. You can’t say that of everyone in this business!

Walter and I took different roads after CA acquired Cheyenne and I didn’t work with him again until our paths cross briefly several years ago. I’m very excited to be working with him again now at Syncsort, and I’m confident that our users and partners will soon see the benefits his vision, enthusiasm and insight will bring to our data protection offerings.

{ 0 comments }

What a busy, exciting June it has been at Syncsort! It started with CIO magazine featuring comScore, and their use of our DMExpress data integration software, in an article on taming Big Data. Mike Brown, comScore’s superstar CTO, says that Syncsort’s software made their Hadoop migration a piece of cake and is playing a central role in helping the company save 75 terabytes of data storage a month. Now that translates into BIG savings!

Next up was Hadoop Summit  in San Jose where we formally announced our partnership with Hortonworks as well as DMExpress certification for the new Hortonworks Data Platform. If you haven’t already had the chance to check them out, I highly recommend the event recaps that my colleagues Tendu Yogurtcu and Keith Kohl checked in with right here on the Syncsort blog. The cherry on top of Hadoop Summit was Syncsort’s Vice President of Marketing and Product Management Mitch Seigle appearing on “The Cube” with Wikibon’s Jeff Kelly and SiliconANGLE’s John Furrier. It is well worth watching the entire segment, but we’ll soon be making available shorter clips of some of the top highlights.

Finally, we announced this week that Syncsort has joined the Greenplum Catalyst Developer Program and that DMExpress is now certified for high-performance loading of Greenplum Database. There was a lot of excitement and great coverage of the news. In fact, ZDNet blogger Andrew Brust took notice of Syncsort’s big month and Big Data 1-2 punch.

While it is always nice to head into the summer months of July and August with momentum, we aren’t finished yet. We have a lot of exciting things going on and it will be well worth your while to stay tuned.

{ 0 comments }