Our fifth and final installment from Mitch Seigle’s time on “The Cube” at Hadoop Summit centers around the mantra that pragmaticism is key when embarking on a Big Data project.
With the buzz around Big Data and new tools and approaches like Hadoop popping up seemingly every day to better manage it, organizations must remember to define Big Data based on the goals they want to achieve. They also must remain focused on the value Big Data will deliver to the organization in terms of new opportunities, and base technology decisions around that as a means to that end goal.
Enterprises should remember not to get too wrapped up in the hype surrounding Hadoop but rather focus on how Hadoop and Big Data can help drive the bottom line.
For the complete series on Mitch’s insights from Hadoop Summit, remember to also check out videos 1, 2, 3, and 4.
Our fourth installment from Mitch Seigle’s time on “The Cube” at Hadoop Summit focuses on getting started with Hadoop. Three initial steps to consider include:
Experiment – We strongly recommend an experimentation phase, where significant testing is performed in a Hadoop environment, before it is put into production
Prioritize Data – Identify the high value problems and prioritize Hadoop projects aimed at solving those first
Don’t Re-architect…Yet – Test first before any strategy discussion around rebuilding data integration or data warehousing processes takes place
Syncsort DMExpress can help simplify many of the Hadoop processes such as loading data into the Hadoop framework, improving MapReduce performance, and helping to alleviate the skills gap associated with Hadoop (given DMExpress’ self-tuning capabilities).
If you haven’t already, I strongly encourage you to also check out videos 1, 2, and 3 from the series for more great insights from Mitch.
Our third installment from Mitch Seigle’s time spent on “The Cube” at Hadoop Summit centers around Syncsort’s history in helping customers with their most data intensive challenges. From the mainframe era, through ETL, to today’s Hadoop environments, the one constant is Syncsort’s ability to deliver as data requirements change, and enable optimal data performance in a reliable, scalable, and cost-effective way. As our customers evolve in the era of Big Data, so does Syncsort.
Video segment two from Mitch Seigle’s time on “The Cube” at the Hadoop Summit 2012 focuses on the hot topic of Big Data skills. Organizations are grappling with finding and retaining the right skill sets to not only handle long-existing and complex data management technology such as SQL, but also emerging Big Data technology such as Hadoop. Mitch speaks to the fact that technology exists today that can address both of these important areas.
Syncsort DMExpress provides an environment in which people can become productive at programming to the higher-level tasks that they want to achieve in a matter of only days. As it relates to Hadoop, this means that users don’t need to learn how to write MapReduce and maintain Java and Pig scripts. Instead, they can leverage existing ETL/data integration skills and focus on driving value from the data.