![]() How side-by-side comparison helps you deal with scanned PDFs In a regular PDF, you can select and copy text. You cannot select text in a scanned PDF, you can only select an area of image. One way of knowing whether your PDF is a scanned, image-based PDF is to try and select some text. How to distinguish scanned PDF from a regular PDF? Clicking the change will show that 450 has been deleted and £50 added. ![]() The comparison shows a change, when you can see there is none. The OCR process converts the scanned PDF and mistakenly converts the handwritten £50 to 450.00. Imagine the original document was a scanned rental agreement where the rent had been filled in by hand as £50.00 and the modified document was a regular PDF with the rent as £50. The comparison may indicate that text has been changed, while you can see that the text has not been changed. For example, when the scanned PDF is a document that has been photocopied multiple times or includes hand-written notes. While the conversion attempts to be as accurate as possible, some content may be converted incorrectly. Consequently, the comparison results may not match what you can see in the original and modified documents. You cannot see the converted original PDF. Workshare converts the PDF to a text-based PDF and then runs the comparison using this converted original PDF. Shown above, a scanned PDF is selected as the original document. This means, that the document Workshare actually compares may not be exactly the same as the document you selected. Workshare automatically runs OCR when you select to compare a scanned PDF and uses the converted version of the document for the comparison. This conversion process - OCR - is an imperfect process. To run a comparison on a scanned PDF, the images must first be converted into editable text. A scanned PDF contains images of content there’s no actual text content but only images embedded into the PDF file. If this happens, it is because Optical Character Recognition (OCR) has been performed on your PDF.Ī regular PDF contains text that can be selected, copied and edited. You may find that when you are comparing a scanned PDF, some of the changes identified by the comparison appear illogical or are unexpected.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |