Developers Corner

Deep OCR – Tips and Tricks

Have you already experienced the performance boost by using Deep OCR compared to the classical rule‑based approaches? In this article, we’ll show you practical tips and tricks to further improve your Deep OCR results.

Deep OCR outperforms the classical approaches in many ways. The most obvious point is the better recognition rate. But also, the independence of the font type, the polarity and the orientation of the text are strong arguments for Deep OCR. In this article we will show you some tips and tricks how to further improve your Deep OCR results.

First, you should take your time to think about the mode which is suitable for your application. In Deep OCR two modes are available: ‘detection’ and ‘recognition’. The mode ‘detection’ finds connected characters as words in the image and gives you the bounding box of the word. The mode ‘recognition’ translates cropped images in machine-readable text.

Deep OCR detection example
Detection and Recognition in Deep OCR

In a lot of applications, both modes ‘detection’ and ‘recognition’ are needed. In this case you can use the mode 'auto'. Have in mind that HALCON provides a powerful toolbox to write your own algorithm to detect the text you are interested in. This could be very useful, if you are only interested in specific text lines in an image full of text, or if you would like to decrease your runtime.

In the following we give you some hints how to improve your Deep OCR performance in case of the modes ‘detection’ and ‘recognition’. But before evaluating the results, you should look at your image data and think about preprocessing methods such as enhancing the contrast.

Detection

If you want to improve your ‘detection’ result, you should take a look into the score maps. You can find the score maps in the DeepOcrResult handle. Especially, the character score map and the link score map can give you useful insights about the ‘detection’ results. This information could be used to adapt the important parameters ’detection_min_word_score’ and ‘detection_min_character_score’.

If your text is already horizontally aligned, set the parameter ‘detection_orientation’ = 0 and you will not have any issues with imperfectly aligned rectangles.

In case of large images with small text use the powerful parameter ‘detection_tiling’. Setting this parameter to 'true' splits the images internally into tiles. The tilling is done fully automatically, and you don't have to care about the tiled images, but you can simply enjoy the detected text. However, bear in mind that the runtime will increase, depending on the size of the processed image.

Recognition

If you are surprised by the results you get from the recognition mode, we recommend opening the DeepOcrResult handle and take a look into the preprocessed image which is used for the recognition. In many cases this image gives you already an explanation, why the ‘recognition’ fails. The most common issue is the length of the text line, which can be handled by setting the parameter 'recognition_image_width'. A detailed example can be found in the standard example ‘deep_ocr_workflow.hdev’.

Finally, it is always worth to reconsider whether Deep OCR is the right choice. Even if Deep OCR outperforms the rule-based approaches, there are still use cases (speed restrictions, specific font types, hardware limitations or pixel-precise segmentation) which justify the choice of classical approaches.

Further articles

Visualize Object Model 3D
Improve your surface-based matching with two helpful features
Do you sometimes have objects, which have rather small symmetry-breaking elements (such as small boreholes on an object)? Does your surface-based 3D matching not find the correct orientation?
Read more
Developers Corner
Deep OCR Interface
Deep OCR recognition training – the next level
HALCON’s Deep OCR is very powerful and can detect and recognize text in various industrial scenes. However, what if you have a special font, or want to read foreign characters? With HALCON 22.05 it is possible to train the recognition model to read s…
Read more
Developers Corner
Fitted primitives distance threshold
Metrology Model – Quality of Fit
"For most applications, the standard parameter values are sufficient." This sentence is often read in the HALCON Solution Guide. But what if the results do not meet your expectations? The Metrology Model allows lightning-fast measurement of geometric…
Read more
Developers Corner
Review of Acquisition Modes
This article gives an overview of HALCON’s image acquisition modes, explaining how continuous, triggered, and synchronous acquisition work, and clarifying common misconceptions for practical applications.
Read more
Developers Corner
Easy text and code reading with MERLIC standard tools
If you want to build an MVApp that reads for example QR codes or bar codes you can do so with just a few clicks. You can even combine the tools to get all available information printed on a product in different formats without the need for programmin…
Read more
Developers Corner
About MVTec's Heatmap
Imagine you intend to deliver a HALCON deep-learning-based classification application. And you are about to evaluate a trained model. You are therefore looking for feedback about this model, i.e. about its performances, biases, and other possible def…
Read more
Developers Corner
Gabor filter: What is it for?
Gabor filters, which are well known in the realm of time series analyses, can also be used in HALCON for 2D image analysis. They are particularly useful for detecting textures, patterns, and orientations in complex images.
Read more
Developers Corner
Introduction to new sub-pixel feature of bar code reader
Do you have small resolution bar codes to read but don't get any good results? Then please try our new feature – the subpixel bar code reader.
Read more
Developers Corner
Introduction to XYZ-Mappings (part 2)
This technical article continues our introduction to XYZ-mappings. In the last article, we answered the question "What are XYZ-mappings?" and gave a short preview towards "Why is using XYZ-mappings beneficial for many 3D applications?". Today, we wil…
Read more
Developers Corner
Introduction to XYZ-Mappings (part 1)
This technical article explores the benefits of XYZ-mappings in HALCON, showing how they increase speed, flexibility, and ease of use for many 3D applications.
Read more
Developers Corner
Deep learning: Why is the dataset key for a success result?
Deep learning success starts with the dataset: Learn why acquiring high-quality, well-labeled training data is crucial for reliable classification, detection, segmentation, and anomaly detection in your machine vision applications.
Read more
Developers Corner
How to prepare 3D height images for further processing with MERLIC’s standard tools
Learn how to prepare 3D height images in MERLIC for further processing: convert non-byte images to byte images to enable alignment, embossed text reading, and defect detection with standard easyTouch tools.
Read more
Developers Corner
Deflectrometry demo HALCON
Inspection of specular surfaces with deflectometry in HALCON
Inspect flat and curved reflective surfaces quickly and reliably with HALCON deflectometry: detect scratches, dents, and other defects with synchronized image acquisition and flexible image processing.
Read more
Developers Corner
Deep Learning classifier HALCON
Training a deep learning classifier with HALCON on the embedded board Jetson TX2
Learn how to train a deep learning classifier with HALCON on both a PC and an embedded Jetson TX2 board, from image acquisition to model training and inference, for efficient machine vision applications.
Read more
Developers Corner
Add touch input to the HSmartWindowControlWPF
Learn how to easily add touch input, including pinch-to-zoom, to the HSmartWindowControlWPF in HALCON, leveraging WPF’s built-in multi-touch events for intuitive image control.
Read more
Developers Corner
Increasing Speed in Deflectometry Set-ups
Discover three approaches to increase speed in deflectometry setups, from simple software-based synchronization to hardware triggers and FPGA-based real-time control, enabling faster and more precise inspection of reflective surfaces.
Read more
Developers Corner
HDevelop matching assistant speedup greediness
Speeding up shape-based matching with "Greediness"
Learn how the 'Greediness' parameter in shape-based matching balances speed and detection completeness, enabling faster searches while maintaining robust results in HALCON.
Read more
Developers Corner
Best practice for classification and OCR
Discover best practices for setting up classification and OCR in HALCON using the HDevelop OCR Training File Browser – quickly review, correct, and optimize your training data to improve segmentation and classification results.
Read more
Developers Corner
Rejection all classes 1.5
How to use rejection classes in MVTec HALCON
Learn how to handle outlier samples in MVTec HALCON by using rejection classes in MLP classifiers – automatically generate samples outside the training classes to improve classification reliability.
Read more
Developers Corner
Using regularization Weight prior 0.1 large
Using Regularization in MLP Classification
Learn how to use regularization in HALCON to prevent overfitting in MLP classifiers, smooth decision boundaries, and achieve better generalization for new and unseen data.
Read more
Developers Corner
MVTec Software