|
PDF2XL Enterprise Online Help
Table of Contents Concepts OCR - Optical Character Recognition Scanned PDF
Scanned PDF Document
PDF documents come in two flavours: regular, and scanned.
Regular PDF documents are usually created by using Acrobat, or by printing a document into PDF using a specialized printer driver.
Scanned PDF documents are documents created from an image of a scanned page, wrapped as a PDF file for viewing purposes only.
Scanned PDF documents can be distinguished from regular PDF documents by trying to selected words using the Text Selection mode of PDF2XL Enterprise while in the Regular Document mode. If the words are selected as you click-and-drag, this is a regular document. If they are not, this is probably a scanned PDF file.
When a PDF document is loaded into PDF2XL Enterprise, it is set to be in Regular Document Mode. When the user marks a table or a field, PDF2XL Enterprise tries to find the text matching it. If this is a regular PDF document, this operation is successful.
If the document is a scanned PDF document, no data is available to the application, and PDF2XL Enterprise will suggest trying the Scanned PDF Document Mode:

Authors of the scanned PDF documents should try to make sure that the documents are scanned in black-and-white for best results.
OKing the question will result in setting the Scanned PDF Document Mode.
|