Skip to content

Extracting data from a sample file

Note

This assumes you are in a role that supports file uploads

Upload

1 Upload a sample PDF by clicking on the Uploads Menu under Personal on the left Navigation

Upload

2 Either drag and drop a file onto the Select Files area or click select files and choose your PDF.

3 Once the file is selected click the upload button.

Selected Upload

4 The file is uploaded to the server and should now be listed in the File listing .

Create Template

Since we need to create a template to parse this document select the document using the left-hand checkbox and choose Actions-Save File as Template

Save file as Template

5 Enter the Template name as Sample

Template Name Dialog

Click Templates under Personal in the left navigation menu. The Sample template should be listed so click on its name.

Once the template is loaded you can hover over any detected data fields.

Template field hover

As an example lets create a template to extract the Invoice Number and total due amount.

Create parse fields

We will create two parse fields for each data item, the item will be parsed by finding the Label first and then the related data field to its right.

First hover over the Invoice Number label and select it by clicking.

Template field select

Also select the Invoice Number itself. In the toolbar above the template click Convert to Field

Template selection convert to field

You notice two purple boxes are created, this denotes a parse field.

Now we can use a shortcut to relate the two fields together and also set that the Invoice number is found by looking for the label first.

Note

This allows for formatting changes between invoices where items may shift down or up depending on content.

Select both fields by clicking the first field and then holding down the shift key and clicking the second field Click Link Fields in the toolbar.

Template link fields

Selecting one field only and viewing the Parse Field properties on the right shows that the names of the Fields have been detected automatically, the related box has been set and the locate method set.

Template link field properties

Repeat this for the Total Due label and value.

Click Save to save the changes.

Export

Go back to Uploads and select the sample PDF.

Choose Export form the toolbar above.

Sample Export

Select the Sample template from the Select Extact Template dropdown. The Export format defaults to CSV.

Results

Click Extract on toolbar and you should see the Invoice Number and Total due values in Comma Separated Format.

Sample Export Data