Digitization involves the recognition of specific content elements in the input document and converting them to a format that supports those elements in a better or more useful fashion. For example, a JPEG image might contain text but the JPEG raster content does not store the text as text. It just appears like text to our eyes, as opposed to a paragraph of text in a web page, which would be wrapped as text in a paragraph tag. When the image is embedded in a PDF or a web page, the text is not going to be available as text. Wouldn't it be great if the text was selectable, just as text on a web page or a MS Word document?
Using the Document Convert component of Document Studio .NET, you can recover text from raster images so that they can be made selectable and searchable in a converted PDF document. NOTE: This requires the use of an optional unmanaged OCR component.
NOTE: This is a trial version of the product. If you are happy with it you can purchase a registered version on our website.