|
Extracting dataBefore you startData source settingsData source settings must be made beforehand, not only to make sure that the data is properly read but also to have it organized in a record structure that meets the purpose of the data mapping configuration (see Data source settings). It is important to set the boundaries before starting to extract data, especially transactional data (see Extracting transactional data). Boundaries determine which data blocks - lines, pages, nodes - form a record in the source data. Data that are located in different records cannot be put into the same record in the record set that is the result of the extraction workflow. Preprocessor stepThe Preprocessor step allows the application to perform actions on the data file itself before it is handed over to the Data Mapping workflow. In addition, properties can be defined in this step. These properties may be used throughout the extraction workflow. For more information, see Preprocessor step. Adding an extractionIn an extraction workflow, Extract steps are the pieces that take care of the actual data extractions.
Special conditionsThe Extract step may need to be combined with another type of step to get the desired result.
Fields cannot be used twice in one extraction workflow.
Extracting data into multiple fieldsWhen you select multiple fields in a CSV or tabular data file and extract them simultaneously, they are put into different fields in the Data Model automatically.
Adding fields to an existing Extract stepFor optimization purposes, it is better to add fields to an existing Extract step than to have a succession of extraction steps. To add fields to an existing Extract step:
When data are dropped on the Data Model, they are by default added to the last added Extract step. Editing fieldsAfter extracting some data, you may want to:
All this can be done via the Step properties pane (see Extract step properties), because the fields in the Data Model are seen as properties of an Extract step. See also: Fields. Testing the extraction workflowThe extraction workflow is always performed on the current record in the data source. When an error is encountered, the extraction workflow stops, and the field on which the error occurred and all subsequent fields will be greyed out. Click the Messages tab (next to the Step properties pane) to see any error messages. To test the extraction workflow on all records, you can:
If any errors are encountered in one or more records, an error message will be displayed. Errors encountered while performing the extraction workflow on the current record will also be visible on the Messages tab. |
|