Developers Corner

Deep OCR recognition training – the next level

HALCON’s Deep OCR is very powerful and can detect and recognize text in various industrial scenes. However, what if you have a special font, or want to read foreign characters? With HALCON 22.05 it is possible to train the recognition model to read special alphabet or font you want to read and boost the performance of Deep OCR to fit exactly your application.

As you already know, for any training you need a dataset first. How can you create a suitable Deep OCR dataset?

From Deep Learning Tool 22.06, you can use the new functionality of Deep OCR Labeling. There are two possibilities:

You can create the Deep OCR dataset from scratch: Simply initialize a Deep OCR Project in the Deep Learning Tool (screenshot) and load the images that you would like to label. For the labeling you need to draw a rectangle around the word in the reading orientation and a text will be suggested that you could correct or confirm (screenshot).
Alternatively, you can use the standard example deep_ocr_prelabel_dataset.hdev. It creates a dataset based on the images to be labeled and the text suggested by the pretrained Deep OCR. You can import the created dataset (hdict file) into Deep Learning Tool 22.06 and correct the suggested labels.

In general, the Training images should comprise a representative set of the possibilities that could occur during the inference. In addition, it is recommended to use a balanced dataset, which means having roughly the same number of occurrences for all characters. And of course, excellent results require a good labeling. A good ground truth bounding box needs to look like an output of the Deep OCR detection model (screenshot). If satisfied with the dataset, you need to export the dataset as an hdict file and proceed to the next step which is the training.

The standard example deep_ocr_recognition_training_workflow.hdev guides you through the training workflow. All you need to do is to import the extracted dataset in the training script and adjust the training parameters. An important parameter is the image width of the recognition model. It must be increased if the dataset contains images of words with a lot of characters in it. Please note that the more the setting of ImageWidth differs from its default (120) the more training data you will need, because the pretrained model was trained on the default width Therefore, it is advisable to keep the image width close to the default (120) during training. The number of epochs needs as well to be adjusted based on the training error and the task complexity.

After training the recognition model, the standard example shows you how to evaluate your finetuned model and to compare it to the pretrained model.

The last step is to integrate the finetuned model in your inference step and get ready to be impressed by the results. As always, more information can be found in the documentation (Solutions Guide I Chapter 19.2).

Further articles

Improve your surface-based matching with two helpful features

Do you sometimes have objects, which have rather small symmetry-breaking elements (such as small boreholes on an object)? Does your surface-based 3D matching not find the correct orientation?

Developers Corner

Metrology Model – Quality of Fit

"For most applications, the standard parameter values are sufficient." This sentence is often read in the HALCON Solution Guide. But what if the results do not meet your expectations? The Metrology Model allows lightning-fast measurement of geometric…

Developers Corner

Review of Acquisition Modes

This article gives an overview of HALCON’s image acquisition modes, explaining how continuous, triggered, and synchronous acquisition work, and clarifying common misconceptions for practical applications.

Developers Corner

Deep OCR – Tips and Tricks

Have you already experienced the performance boost by using Deep OCR compared to the classical rule‑based approaches? In this article, we’ll show you practical tips and tricks to further improve your Deep OCR results.

Developers Corner

Easy text and code reading with MERLIC standard tools

If you want to build an MVApp that reads for example QR codes or bar codes you can do so with just a few clicks. You can even combine the tools to get all available information printed on a product in different formats without the need for programmin…

Developers Corner

About MVTec's Heatmap

Imagine you intend to deliver a HALCON deep-learning-based classification application. And you are about to evaluate a trained model. You are therefore looking for feedback about this model, i.e. about its performances, biases, and other possible def…

Developers Corner

Gabor filter: What is it for?

Gabor filters, which are well known in the realm of time series analyses, can also be used in HALCON for 2D image analysis. They are particularly useful for detecting textures, patterns, and orientations in complex images.

Developers Corner

Introduction to new sub-pixel feature of bar code reader

Do you have small resolution bar codes to read but don't get any good results? Then please try our new feature – the subpixel bar code reader.

Developers Corner

Introduction to XYZ-Mappings (part 2)

This technical article continues our introduction to XYZ-mappings. In the last article, we answered the question "What are XYZ-mappings?" and gave a short preview towards "Why is using XYZ-mappings beneficial for many 3D applications?". Today, we wil…

Developers Corner

Introduction to XYZ-Mappings (part 1)

This technical article explores the benefits of XYZ-mappings in HALCON, showing how they increase speed, flexibility, and ease of use for many 3D applications.

Developers Corner

Deep learning: Why is the dataset key for a success result?

Deep learning success starts with the dataset: Learn why acquiring high-quality, well-labeled training data is crucial for reliable classification, detection, segmentation, and anomaly detection in your machine vision applications.

Developers Corner

How to prepare 3D height images for further processing with MERLIC’s standard tools

Learn how to prepare 3D height images in MERLIC for further processing: convert non-byte images to byte images to enable alignment, embossed text reading, and defect detection with standard easyTouch tools.

Developers Corner

Inspection of specular surfaces with deflectometry in HALCON

Inspect flat and curved reflective surfaces quickly and reliably with HALCON deflectometry: detect scratches, dents, and other defects with synchronized image acquisition and flexible image processing.

Developers Corner

Training a deep learning classifier with HALCON on the embedded board Jetson TX2

Learn how to train a deep learning classifier with HALCON on both a PC and an embedded Jetson TX2 board, from image acquisition to model training and inference, for efficient machine vision applications.

Developers Corner

Add touch input to the HSmartWindowControlWPF

Learn how to easily add touch input, including pinch-to-zoom, to the HSmartWindowControlWPF in HALCON, leveraging WPF’s built-in multi-touch events for intuitive image control.

Developers Corner

Increasing Speed in Deflectometry Set-ups

Discover three approaches to increase speed in deflectometry setups, from simple software-based synchronization to hardware triggers and FPGA-based real-time control, enabling faster and more precise inspection of reflective surfaces.

Developers Corner

HDevelop matching assistant speedup greediness

Speeding up shape-based matching with "Greediness"

Learn how the 'Greediness' parameter in shape-based matching balances speed and detection completeness, enabling faster searches while maintaining robust results in HALCON.

Developers Corner

Best practice for classification and OCR

Discover best practices for setting up classification and OCR in HALCON using the HDevelop OCR Training File Browser – quickly review, correct, and optimize your training data to improve segmentation and classification results.

Developers Corner

How to use rejection classes in MVTec HALCON

Learn how to handle outlier samples in MVTec HALCON by using rejection classes in MLP classifiers – automatically generate samples outside the training classes to improve classification reliability.

Developers Corner

Using regularization Weight prior 0.1 large

Using Regularization in MLP Classification

Learn how to use regularization in HALCON to prevent overfitting in MLP classifiers, smooth decision boundaries, and achieve better generalization for new and unseen data.

Developers Corner