|
Using the wizard for PDF/VT or AFP filesThe pages in PDF/VT and AFP files can be grouped on several levels. Additional information can be attached to each level in the structure. The structure and additional information are stored in the file's metadata. To extract information from the metadata in the extraction workflow itself, you have to create a JavaScript extraction (see Using scripts in the DataMapper and extractMeta()).
If the PDF doesn't contain any metadata, each page is a new record - in other words, a boundary is set at the start of a new page -, which is exactly what happens when you open the file without a wizard. You can open a PDF/VT or AFP file with a wizard using the Welcome screen or the File menu.
After selecting the file, select the following options in the Metadata page:
Click Finish to close the
dialog and open the actual Data Mapping configuration. Extracting data from a PDF that comes from a Windows printer queue (a PDF converted to PostScript, converted back to PDF by an Input task in Workflow) might not work (see the Connect Knowledge Base.)
The rule of thumb is: if copy-paste from Acrobat works, so will data mapping; if not, the DataMapper won't either. |
|