Sign In Start Free Trial
Account

Add to playlist

Create a Playlist

Modal Close icon
You need to login to use this feature.
  • Pentaho Data Integration Quick Start Guide
  • Toc
  • feedback
Pentaho Data Integration Quick Start Guide

Pentaho Data Integration Quick Start Guide

By : Carina Roldán
3 (1)
close
Pentaho Data Integration Quick Start Guide

Pentaho Data Integration Quick Start Guide

3 (1)
By: Carina Roldán

Overview of this book

Pentaho Data Integration(PDI) is an intuitive and graphical environment packed with drag and drop design and powerful Extract-Transform-Load (ETL) capabilities. Given its power and flexibility, initial attempts to use the Pentaho Data Integration tool can be difficult or confusing. This book is the ideal solution. This book reduces your learning curve with PDI. It provides the guidance needed to make you productive, covering the main features of Pentaho Data Integration. It demonstrates the interactive features of the graphical designer, and takes you through the main ETL capabilities that the tool offers. By the end of the book, you will be able to use PDI for extracting, transforming, and loading the types of data you encounter on a daily basis.
Table of Contents (8 chapters)
close

Understanding the purpose of PDI jobs


There are two main artifacts in PDI: transformations and jobs. While transformations deal mainly with data, jobs deal with tasks or processes. With a job, you organize a list of tasks by indicating the order of execution and deciding whether the execution should depend on certain conditions. Consider the following sample PDI job:

Sample PDI job

One of the good things about PDI is that by simply taking a look at the graphical representation of a job or a transformation in Spoon, it is quite easy to deduce its general purpose. In this example, we can see that the sample job checks some database connections. If everything is okay, it executes a daily process. Then, it compress some files in a ZIP format, putting the files in a remote server with FTP. If there is a problem with the database connection, the job sends an email and aborts.

Let's list some more examples of PDI jobs, as follows:

  • Download a list of files from a server. Validate the data that is stored...
bookmark search playlist download font-size

Change the font size

margin-width

Change margin width

day-mode

Change background colour

Close icon Search
Country selected

Close icon Your notes and bookmarks

Delete Bookmark

Modal Close icon
Are you sure you want to delete it?
Cancel
Yes, Delete