Introducing Ironcluster™ – Bringing Hadoop ETL to Amazon Elastic MapReduce in the Cloud

IronclusterToday, Syncsort is announcing our ETL Hadoop offering for the cloud on Amazon Web Services (AWS).  As organizations and individuals seek to learn more about Hadoop, try out new use cases, scale a cluster up and down easily, quickly & affordably, they are increasingly looking at cloud-based infrastructures.

Syncsort Ironcluster: Hadoop ETL for Amazon Elastic MapReduce – Release1, is the first and only ETL tool on AWS for Elastic MapReduce (EMR), Amazon’s Hadoop cloud-based environment, available in the Amazon Marketplace!  In fact, there are a lot of firsts here:

  • First Data Integration-as-a-Service Engine for Amazon Elastic MapReduce (Amazon EMR)
  • Syncsort’s first cloud-based offering for Hadoop
  • As mentioned, the first and only ETL(Extract – Transform – Load) tool available for EMR
  • The first, and only, ETL product that is deeply integrated with MapReduce
  • A free-use version is available (more below)

There are many documented use cases for Hadoop, but a very common one is ETL.  Even when users don’t know they’re doing ETL, that’s what they’re doing.  WRT Hadoop, I’ve heard it called data refinement, data preparation, data management, etc.  But at the end of the day, they’re aggregating web logs to understand patterns, joining data to merge disparate data sources, sorting data, filtering and reformatting it, and so on.  That’s ETL!

When we started this project, our goal was to make it easy and attractive for users to get started using Ironcluster on EMR.  For instance,

  • It’s available in the Amazon Marketplace
  • There’s a free usage version available.  You still need to pay for your EC2 & EMR usage, but Ironcluster is available free of charge for up to 10 nodes.
  • The pricing is very attractive; there are 4 usage levels available depending on the number of nodes you have and the level of support you need

Usage Level

Maximum Nodes

Ironcluster Price/Hour

Support

1

10

$0 – Free!

Online through our community

2

50

$10/hour

Community, email, phone

3

100

$20/hour

Community, email, phone

4

Unlimited

$30/hour

Community, email, phone

 

  • We provide examples and templates, what we call Use Case Accelerators, with documentation and even videos for users to get started quickly.  We have a Ironcluster resources page available to navigate the resources available
  • Nothing to download.  Everything is hosted in the cloud, including the graphical interface to develop & maintain the ETL jobs

So why is this “Release 1”?  This is obviously not the first release of our Hadoop product.  We released that back in May & June, but this is our first offering on Amazon EMR.  We’ve got many new enhancements and features planned to make the users’ experience even better.

If you happen to be at AWS re:Invent this week in Las Vegas, stop by booth #825 in re:Invent Central for a demo and to learn more from our technical experts.

Get started today and let us know what you think!

Keith Kohl

Authored by Keith Kohl

Vice President, Product Management
6 comments
  1. […] said its new Ironcluster data integration engine will bring ETL (extract transform and load) capabilities to Amazon Elastic MapReduce (EMR). The […]

  2. […] said its new Iron­clus­ter data inte­gra­tion engine will bring ETL (extract trans­form and load) capa­bil­i­ties to Ama­zon Elas­tic MapRe­duce […]

  3. […] said its new Ironcluster data integration engine will bring ETL (extract transform and load) capabilities to Amazon Elastic MapReduce (EMR). The […]

  4. Keith,
    thanks for bridging the gap between cloud-based ETL and in-house ETL. I look forward to exploring this offering.

    Kevin Janes
    Enterprise Data Architect
    Unum

  5. Commonly I would not understand submit in blogs and forums, however i want to point out that this particular write-up extremely forced us to look at and also do so! Your current composing style has become stunned myself. Appreciate it, quite nice post.

  6. Hey there I am so grateful I found your site, I really found you by error, while
    I was looking on Bing for something else, Anyways I am here now and would
    just like to say thanks for a fantastic post and a all round entertaining blog (I also
    love the theme/design), I don’t have time to read through it
    all at the minute but I have saved it and also added your RSS feeds, so
    when I have time I will be back to read much more, Please do keep up the superb b.

Leave a Comment

*