PDF to Excel

PDF to Excel OCR

PDF to Excel Ent.

PDF to Excel CLI

Update

PDF2XL

PDF2XL OCR

PDF2XL Ent.

PDF2XL CLI

Checkout

Upgrade

Corporate

Update

Contact us

Press Room

Privacy policy

Legal notice

PDF2XL OCR Online Help

Table of Contents User Interface Dialogs Settings Format Settings Page

Format Settings Page

Conversion Output OCR Advanced Number character set Currency character set Date character set Time character set Minus sign character set Move right-side negative sign to left Replace parentheses with negative sign Keep indentation Keep line wrap General Fixes: White text General Fixes: Convert Superscript OCR: When conversion format changes

This page is one of the pages in the Settings dialog.

This page allows you to change various format settings for the PDF2XL OCR application, both for viewing and for converting.

Number character set

Checking this option will ensure that only the characters in the box will be converted in columns and fields marked as numeric.
This character set will also be used to limit the OCR engine when performing OCR on columns and fields marked as numeric.

Currency character set

Checking this option will ensure that only the characters in the box will be converted in columns and fields marked as currency.
This character set will also be used to limit the OCR engine when performing OCR on columns and fields marked as currency.

Date character set

Checking this option will ensure that only the characters in the box will be converted in columns and fields marked as date.
This character set will also be used to limit the OCR engine when performing OCR on columns and fields marked as date.

Time character set

Checking this option will ensure that only the characters in the box will be converted in columns and fields marked as time.
This character set will also be used to limit the OCR engine when performing OCR on columns and fields marked as time.

Minus sign character set

Any character in this list will automatically be converted to the minus (negative) sign in fields and columns marked as numeric.

Move right-side negative sign to left

If this box is checked, PDF2XL OCR will move any negative sign on the right side of numeric fields to the left, so Excel will be able to regard the data as a number.
If the box is cleared, the numeric fields and columns will not be changed.

Replace parentheses with negative sign

If this box is checked, PDF2XL OCR will replace a parentheses around numeric fields with a minus sign on the left.
If the box is cleared, the numeric fields and columns containing parantheses will not be changed.

Keep indentation

If this box is checked, PDF2XL OCR will retain the relative indentation of all the columns or fields marked as text.
This will effect all the lines of text in case of fields or cells containing more then one line of text if the Keep line wrap option is set for text fields and column

Keep line wrap

Checking this box will ensure PDF2XL OCR will keep line wrapping for all fields and cells marked as text when converting.
Note that line wrapping will not be kept for CSV or Clipboard conversions, due to the fact the Excel and similar applications will not receive the data correctly in such a case.

General Fixes: White text

When converting data using the font's attributes - specifically, color - white colored text can disappear if the background color of the resulting document is white; as the default background color of Word and Excel is white, this can cause some issues.
This setting can fix this problem by letting the user select the output color for white and nearly-white colored text; the user can keep the original color by clearing the check box.
Note that this problem only occurs if the user set the Keep text attributes option for either Excel or Word and Powerpoint in the Output Settings page of the Settings dialog.

General Fixes: Convert Superscript

Clearing this option will remove any superscript data from the conversion.
This is mostly useful when there are footnote or annotation marking next to numbers, and you wish to ignore them when converting so Excel won't consider them a part of the number.

When conversion format changes

This option allows the user to select what happens when the Conversion Format is changed for a column or field while in Scanned Document Mode.
Selecting 'Perform OCR' will automatically run OCR on the selected columns or fields.
Selecting 'Leave as is' will keep the OCR results.
Selecting 'Ask me' will display the OCR Conversion Format dialog (displayed below), in which you can select which of the previous actions to perform:

This option is set to 'Ask me' by default.


Additional Site Links:

Important PDF and Excel sites:

2009 Cogniview Ltd. All rights reserved