
Tableau Prep Cookbook
By :

In this recipe, we'll connect to a PDF file containing text and a table with data. Tableau Prep has an exciting feature that can automatically detect the presence of tables in PDF files and extract the data for you.
To follow along with the recipe, download the Sample Files 2.2 folder from the book's GitHub repository.
To get started, ensure you have the sample PDF file ready on your computer, and open Tableau Prep Builder:
Figure 2.14 – Sample PDF file with a table embedded in it
Figure 2.15 – Select PDF file from the Connect pane
Figure 2.16 – PDF tables are automatically extracted
Figure 2.17 – Tableau Prep can detect multiple tables in a single PDF file
In this recipe, you have learned how to connect to PDF files and extract data for processing in Tableau Prep.
Tableau Prep converts each table in a PDF document into a data table when ingesting the file into a new flow. As such, Tableau Prep removes the complexity of parsing PDF documents and allows you to treat this like any other data connection.