PDF Analysis before OCR

The optical character recognition OCR allows the user to recognize and separate the non-native PDF text and the real DWG / DXF text.

An important question before the application of the OCR method is, "Which text in the PDF drawing are the native text and how are the non-native text represented?" The non-native text can be represented as polylines, hatch, or raster images.

Start the Print2CAD analysis and look at the separated PDF files. You can also use these images to determine the direction of non-native text.

If all non-native text should be in one direction, the fully automatic text recognition can be applied, if the text are in different directions, then only the extended text recognition can be applied.