Extract step

The Extract step is essential in each and every data mapping configuration. It extracts data from the data source, based on their location (a row and column in CSV or tabular data, an XPath in XML, or a region of the page in PDF and Text) or on a JavaScript. The data is placed in the record set that is the result of the extraction workflow.

Fields always belong to an Extract step, but they don't necessarily all contain extracted data. To learn how to add fields without extracted data to an Extract step, see Fields.

Adding an Extract step

To add an Extract step, first select the step on the Steps pane after which to insert the Extract step. Then:

  • In the Data Viewer, select some data, right-click that data and choose Add Extraction, or drag & drop the data in the Data Model. For more detailed information and instructions, see: Extracting data.

  • Alternatively, right-click the Steps pane and select Add a Step > Add Extraction. Make the required settings on the Step properties pane.

If an Extract step is added within a Repeat step, the extracted data are added to a detail table by default; see Extracting transactional data and Detail tables.

Configuring an Extract step

The names, order, data type and default value of the fields extracted in an Extract step are properties of that Extract step. These and other properties can be edited via the Step properties pane. For an explanation of all the options, see Extract step properties.

Data cannot be extracted more than once in any record, unless the Extract steps are mutually exclusive. This is the case when they are located in different branches of a Condition step or Multiple Conditions step.

Inside a Detail table, multiple Extract steps may extract the same data but each of them will create a new child record in the Detail table.

If you tick the Append values to current record option when several steps are extracting the same field, the step will error out.