ETL Tools Overview
ETL tools have been around for decades. The market has a number of established commercial players, including IBM InfoSphere DataStage, Microsoft SQL Server Integration Services, and Oracle Data Integrator.
Technology developments over the past five to 10 years have given birth to a new crop of market entrants, both commercial and open source.
Comparison of ETL tools
Here’s a comparison of the ETL tools on the market today (suggested additions and edits are welcome).
You have three general options when it comes to ETL tools:
- You can purchase a commercial tool
- You can use an open source tool
- You can write your own scripts
Commercial ETL tools
The best ETL tool for you will depend on a variety of factors. If you’re looking to dig deeper into the options available to you, here are a few resources to get you started. These articles focus almost exclusively on commercial ETL tools servicing the large enterprise market. There is less information available on the newer commercial ETL tools that serve the SMB market.
|ETL Tools – Top 10 ETL Tools Reviews||databaseetl.com||2014|
|Gartner Peer Insights: Data Integration Tools||Gartner Peer Insights||2016|
|Adeptia’s comparison matrix of ETL tools||Talend Experts||2016|
|Quora – Talend vs. Alteryx||Gartner Peer Insights||2016|
|A Survey of ETL Tools||International Journal of Computer Techniques||2015|
Open source ETL tools
The most popular open source ETL tools are:
- Pentaho Data Integration (Kettle)
- Talend Open Studio
The best ETL tool for you will depend on a variety of factors. If you’re looking to dig deeper into the options available to you, here are a few resources to get you started. These articles focus exclusively on open source ETL tools.
|Business Intelligence: Which is the best open source ETL tool to start working?||Quora||2012|
|Jaspersoft vs. Pentaho – Which One To Use & Is There Any Need To Purchase The Commercial Edition||IT Central Station||2013|
|Open Source ETL Tools Comparison||Talend Experts||2016|
If you choose the path to build it yourself, there are plenty of ETL scripts out there to give you a starting point. Here are some of the most popular tutorials on the topic:
- SQL ETL Script – A simple data ETL method – nothing’ but SQL
- Python ETL Script – Using a simple Python script for end-to-end data transformation and ETL (Part 1)
ETL scripts can be written in Python, SQL, or most other programming languages, but Python remains a popular choice. In addition to being the language of choice of several popular open source ETL projects (i.e., Pygrametl, Petl, Bubbles), it’s also a go-to for engineers and data scientists looking to DIY their ETL process.
If you’re interested in this path, here are some resources to get you started: