|
PDF2XL OCR Online Help
Table of Contents User Interface Dialogs Settings Format Settings Page
Format Settings Page
This page is one of the pages in the Settings dialog.

This page allows you to change various format settings for the PDF2XL OCR application, both for viewing and for converting.
Checking this option will ensure that only the characters in the box will be converted in columns and fields marked as numeric.
This character set will also be used to limit the OCR engine when performing OCR on columns and fields marked as numeric.
Checking this option will ensure that only the characters in the box will be converted in columns and fields marked as currency.
This character set will also be used to limit the OCR engine when performing OCR on columns and fields marked as currency.
Checking this option will ensure that only the characters in the box will be converted in columns and fields marked as date.
This character set will also be used to limit the OCR engine when performing OCR on columns and fields marked as date.
Checking this option will ensure that only the characters in the box will be converted in columns and fields marked as time.
This character set will also be used to limit the OCR engine when performing OCR on columns and fields marked as time.
Any character in this list will automatically be converted to the minus (negative) sign in fields and columns marked as numeric.
If this box is checked, PDF2XL OCR will move any negative sign on the right side of numeric fields to the left, so Excel will be able to regard the data as a number.
If the box is cleared, the numeric fields and columns will not be changed.
If this box is checked, PDF2XL OCR will replace a parentheses around numeric fields with a minus sign on the left.
If the box is cleared, the numeric fields and columns containing parantheses will not be changed.
If this box is checked, PDF2XL OCR will retain the relative indentation of all the columns or fields marked as text.
This will effect all the lines of text in case of fields or cells containing more then one line of text if the Keep line wrap option is set for text fields and column
Checking this box will ensure PDF2XL OCR will keep line wrapping for all fields and cells marked as text when converting.
Note that line wrapping will not be kept for CSV or Clipboard conversions, due to the fact the Excel and similar applications will not receive the data correctly in such a case.
When converting data using the font's attributes - specifically, color - white colored text can disappear if the background color of the resulting document is white; as the default background color of Word and Excel is white, this can cause some issues.
This setting can fix this problem by letting the user select the output color for white and nearly-white colored text; the user can keep the original color by clearing the check box.
Note that this problem only occurs if the user set the Keep text attributes option for either Excel or Word and Powerpoint in the Output Settings page of the Settings dialog.
Clearing this option will remove any superscript data from the conversion.
This is mostly useful when there are footnote or annotation marking next to numbers, and you wish to ignore them when converting so Excel won't consider them a part of the number.
This option allows the user to select what happens when the Conversion Format is changed for a column or field while in Scanned Document Mode.
Selecting 'Perform OCR' will automatically run OCR on the selected columns or fields.
Selecting 'Leave as is' will keep the OCR results.
Selecting 'Ask me' will display the OCR Conversion Format dialog (displayed below), in which you can select which of the previous actions to perform:

This option is set to 'Ask me' by default.
|