Reddit Reddit reviews Pentaho Kettle Solutions: Building Open Source ETL Solutions with Pentaho Data Integration

We found 2 Reddit comments about Pentaho Kettle Solutions: Building Open Source ETL Solutions with Pentaho Data Integration. Here are the top ones, ranked by their Reddit score.

Computers & Technology
Books
Data Warehousing
Databases & Big Data
Pentaho Kettle Solutions: Building Open Source ETL Solutions with Pentaho Data Integration
Check price on Amazon

2 Reddit comments about Pentaho Kettle Solutions: Building Open Source ETL Solutions with Pentaho Data Integration:

u/bbowler86 · 2 pointsr/datascience

Yeah to be honest the only thing that you get with the Enterprise version is some visualization stuff which is meh at best, an Enterprise Scheduler which doesn't even have job dependencies, and support. We had a Consultant come in from the Normandy Group before we started using it and do an evaluation between PDI and Informatica based on our needs and his conclusion was that 95% of everything we needed to do we could do with PDI and we didn't have to pay for it. It hasn't let me down except for some export to Excel stuff but you really shouldn't be doing reporting with an ETL tool anyway. There is of course performance tradeoffs between using any ETL tool and straight SQL/scripting but the amount of time you save and being able to reproduce with a tool like Pentaho make it worth it.

If you are serious about it I would suggest this book. And I mean read it. Bad code makes bad code regardless if you script this with Python or Pentaho. It is a bit of a learning curve but worth it in my opinion.

u/flipstables · 1 pointr/datascience

Highly recommend this book:

http://www.amazon.com/Pentaho-Kettle-Solutions-Building-Integration/dp/0470635177

I believe Pentaho has some free books out there, but this was already on the shelf at a company so I picked it up.