PDF to Excel

PDF to Excel OCR

PDF to Excel Ent.

PDF to Excel CLI

Update

PDF2XL

PDF2XL OCR

PDF2XL Ent.

PDF2XL CLI

Checkout

Upgrade

Corporate

Update

Contact us

Press Room

Privacy policy

Legal notice

PDF2XL Enterprise Online Help

Table of Contents Concepts Document Modes Scanned Document Validation Mode

Scanned Document Validation Mode

Finish Validation Word Preview Area Word Editing Area Previously Validated Word Validate Next Finish Validation

The Scanned Document Validation Mode is used to validate the words retrieved from a Scanned PDF Document, scanned documents or an image file via the OCR process.
Pressing the Validate Data button allows the user to examine each word (or each suspected word) that is included in the defined layout area. The user can make sure that the OCR module recognized the word correctly, and fix it if necessary.

When in Validation Mode, the Document Window is changed to allow editing of the OCR data, by making the Conversion Preview area smaller and adding the Word Preview and Editing area, containing two parts: the Word Preview area and the Word Editing area:

To perform the validation, the user should:

  1. For each word:
    1. Compare the image in the Word Preview area with the word text in the Word Preview and Editing area
    2. If they different, use the Word Preview and Editing area to correct the identification errors
  2. Continue to the next word by pressing the <Tab> or the <Enter> key on the keyboard.
    The user can continue to the next word by pressing the Next Word button on the Validation toolbar.
  3. If the user skipped a word that contained an error, pressing <Shift>+<TAB> or <Shift>+<Enter> keys to return to the previous word.
  4. When reaching the last word in the table PDF2XL Enterprise will automatically exit the Validation Mode. The user can also exit the Validation mode at any time by pressing the Finish Validation button (or the [x] close button on the top-left corner of the Word Preview and Editing area).

Remarks:

  1. Once the user validated or fixed a word, the work you have done is saved, even after PDF2XL Enterprise has been closed.
  2. Using the OCR Settings Page, you can select to go over all the words or just the suspect words by selecting the Validate only suspect words option
  3. Using the OCR Settings Page, you can make the validation process jump over validated words by selecting the Ignore validated words option
  4. The Currently Validated word will be highlighted (background color will turn to yellow) in both the Document View Area and the Conversion Preview Area as shown below:

    Note that if the application is in Text Selection mode, the Conversion Preview Area will display the whole text of the page, with the background color changed as stated above.
  5. Clicking a word in the Document View Area will make it the Currently Validated Word (as shown below):
  6. You can move directly between cells in the current table (when the currently validated word is inside a cell) using <Alt>+Arrow.
    For example, if you want to validate the first word of the cell right below the current one, press the <Alt>+Down Arrow keys.

Additional Site Links:

Important PDF and Excel sites:

2009 Cogniview Ltd. All rights reserved