Extract step

The Extract step is essential in each and every data mapping configuration. It extracts data from the data source, based on their location (a row and column in CSV or tabular data, an XPath in XML, or a region of the page in PDF and Text) or on a JavaScript. The data is placed in the record set that is the result of the extraction workflow.

Fields always belong to an Extract step, but they don't necessarily all contain extracted data. To learn how to add fields without extracted data to an Extract step, see Fields.

Adding an Extract step

To add an Extract step, first select the step on the Steps pane after which to insert the Extract step. Then:

  • In the Data Viewer, select some data, right-click that data and choose Add Extraction, or drag & drop the data in the Data Model. For more detailed information and instructions, see: Extracting data.

  • Alternatively, right-click the Steps pane and select Add a Step > Add Extraction. Make the required settings on the Step properties pane.

If an Extract step is added within a Repeat step, the extracted data are added to a detail table by default; see Extracting transactional data and Detail tables.

Configuring an Extract step

The names, order, data type and default value of the fields extracted in an Extract step are properties of that Extract step. These and other properties can be edited via the Step properties pane. For an explanation of all the options, see Extract step properties.

Fields cannot be used twice in one extraction workflow.
Different Extract steps can only write extracted data to the same field in the Data Model, if:

  • The field name is the same. (See: Renaming and ordering fields.)
  • The Extract steps are mutually exclusive. This is the case when they are located in different branches of a Condition step or Multiple Conditions step.
  • The option Append values to current record is checked in the Step properties pane under Extraction Definition.
 
  • Last Topic Update: 09, October, 2017 07:31 AM
  • Last Published: 23, May, 2019 01:55 PM