PDF to Excel

PDF to Excel OCR

PDF to Excel Ent.

PDF to Excel CLI

Update

PDF2XL

PDF2XL OCR

PDF2XL Ent.

PDF2XL CLI

Checkout

Upgrade

Corporate

Update

Contact us

Press Room

Privacy policy

Legal notice

PDF2XL Enterprise Online Help

Table of Contents Concepts OCR - Optical Character Recognition Scanned PDF

Scanned PDF Document

PDF documents come in two flavours: regular, and scanned.

Regular PDF documents are usually created by using Acrobat, or by printing a document into PDF using a specialized printer driver.
Scanned PDF documents are documents created from an image of a scanned page, wrapped as a PDF file for viewing purposes only.

Scanned PDF documents can be distinguished from regular PDF documents by trying to selected words using the Text Selection mode of PDF2XL Enterprise while in the Regular Document mode. If the words are selected as you click-and-drag, this is a regular document. If they are not, this is probably a scanned PDF file.

When a PDF document is loaded into PDF2XL Enterprise, it is set to be in Regular Document Mode. When the user marks a table or a field, PDF2XL Enterprise tries to find the text matching it. If this is a regular PDF document, this operation is successful.
If the document is a scanned PDF document, no data is available to the application, and PDF2XL Enterprise will suggest trying the Scanned PDF Document Mode:

Authors of the scanned PDF documents should try to make sure that the documents are scanned in black-and-white for best results.

OKing the question will result in setting the Scanned PDF Document Mode.


Additional Site Links:

Important PDF and Excel sites:

2009 Cogniview Ltd. All rights reserved