ETL tools overview

ETL tools have been around for decades and the market has a number of established commercial players like IBM’s DataStage, Microsoft’s SQL Server Integration Services, and Oracle’s Data Integrator.

Technology developments over the past 5-10 years has led to a new crop of market entrants, both commercial and open source. 

Comparison of ETL tools

Here’s a comparison of the best ETL tools on the market today (Suggested additions and edits are welcome).

There are really three options when it comes to ETL tools: a) you can purchase a commercial tool, b) you can use an open-source tool, or c) you can write your own scripts. We'll address each of these options in the sections below.

Commercial ETL Tools 

The best ETL tool for you will depend on a variety of factors. If you’re looking to dig deeper into the options available to you, here are a few great resources to get you started. These articles focus almost exclusively on commercial ETL tools servicing the large enterprise market. There is less information available on the newer commercial ETL tools that serve the SMB market.

Article Source Year Published
ETL Tools - Top 10 ETL Tools Reviews databaseetl.com 2014
Gartner Peer Insights: Data Integration Tools Gartner Peer Insights 2016
Adeptia’s comparison matrix of ETL tools Talend Experts 2016
Quora – Talend vs. Alteryx Gartner Peer Insights 2016
A Survey of ETL Tools International Journal of Computer Techniques 2015

Open-source ETL tools

The most popular open source ETL tools are:

The best ETL tool for you will depend on a variety of factors. If you’re looking to dig deeper into the options available to you, here are a few great resources to get you started. These articles focus exclusively on open source ETL tools.

ETL scripts

If you choose the path to build it yourself, there are plenty of ETL scripts out there to give you a starting point. Here are some of the most popular tutorials on the topic:

Python ETL

ETL scripts can be written in Python, SQL, or most other programming languages, but Python remains a popular choice. In addition to being the language of choice of several popular open source ETL projects (i.e., Pygrametl, Petl, Bubbles), it’s also a go-to for engineers and data scientists looking to DIY their ETL process. 

If you’re interested in this path, here are some resources to get you started: