August 08, 2022 | Developers Corner

Deep OCR recognition training – the next level

HALCON’s Deep OCR is very powerful and can detect and recognize text in various industrial scenes. However, what if you have a special font, or want to read foreign characters? With HALCON 22.05 it is possible to train the recognition model to read special alphabet or font you want to read and boost the performance of Deep OCR to fit exactly your application.

As you already know, for any training you need a dataset first. How can you create a suitable Deep OCR dataset?

From Deep Learning Tool 22.06, you can use the new functionality of Deep OCR Labeling. There are two possibilities:

  • You can create the Deep OCR dataset from scratch: Simply initialize a Deep OCR Project in the Deep Learning Tool (screenshot) and load the images that you would like to label. For the labeling you need to draw a rectangle around the word in the reading orientation and a text will be suggested that you could correct or confirm (screenshot).
  • Alternatively, you can use the standard example deep_ocr_prelabel_dataset.hdev. It creates a dataset based on the images to be labeled and the text suggested by the pretrained Deep OCR. You can import the created dataset (hdict file) into Deep Learning Tool 22.06 and correct the suggested labels.

In general, the Training images should comprise a representative set of the possibilities that could occur during the inference. In addition, it is recommended to use a balanced dataset, which means having roughly the same number of occurrences for all characters. And of course, excellent results require a good labeling. A good ground truth bounding box needs to look like an output of the Deep OCR detection model (screenshot). If satisfied with the dataset, you need to export the dataset as an hdict file and proceed to the next step which is the training.

The standard example deep_ocr_recognition_training_workflow.hdev guides you through the training workflow.  All you need to do is to import the extracted dataset in the training script and adjust the training parameters. An important parameter is the image width of the recognition model. It must be increased if the dataset contains images of words with a lot of characters in it. Please note that the more the setting of ImageWidth differs from its default (120) the more training data you will need, because the pretrained model was trained on the default width Therefore, it is advisable to keep the image width close to the default (120) during training. The number of epochs needs as well to be adjusted based on the training error and the task complexity.

After training the recognition model, the standard example shows you how to evaluate your finetuned model and to compare it to the pretrained model.

The last step is to integrate the finetuned model in your inference step and get ready to be impressed by the results. As always, more information can be found in the documentation (Solutions Guide I Chapter 19.2). 

Published on: August 08, 2022

Further Success Stories

MVTec HALCON product icon
3D vision enables robots to “see” in the carpentry industry
As in many industrial areas, automation is advancing in furniture production as well. One example of this is MAB Möbel AG from Switzerland: with the support of HOMAG Bohrsysteme GmbH and MVTec Software GmbH, this furniture manufacturer automated the…
Show full Story
HALCON
Machinery
Robotics
3D Vision
OCR
MVTec HALCON product icon
Warehouse where cheese wheels age during the ripening process
Revolutionizing Cheese Production with AI and Machine Vision
The food industry is experiencing a transformative shift in quality control, due in part to advances in artificial intelligence (AI). When combined with rule-based machine vision, AI is enabling automation of processes that were previously impossible…
Show full Story
HALCON
Food & Beverage
Robotics
Anomaly Detection
Deep Learning
MVTec HALCON product icon
Dynamic OCR inspection with HALCON ensures maximum product quality
In the pharmaceutical industry, strict compliance regulations require medical product packaging to be correctly labeled before delivery. To meet these requirements, the Chinese company Shenzhen Jiangrun Xuneng Technology has developed a powerful mach…
Show full Story
HALCON
Medical Supplies & Pharmaceutical
Bar Code & 2D Code Reading
OCR
MVTec HALCON product icon
Automatisierte Transport- und Inspektionsstation für pharmazeutische Ampullen mit Machine-Vision-System von Aspen.
AI-powered machine vision automates quality inspection in the pharmaceutical industry
The pharmaceutical company Aspen is improving the quality inspection of filled ampoules with the MVTec HALCON machine vision software and the consulting and technical support services provided by MVTec. With deep learning methods, the company has sig…
Show full Story
HALCON
Medical Supplies & Pharmaceutical
Blob Analysis
Deep Learning
Matching
MVTec HALCON product icon
Workflow der Produktion zylindrischer Lithium-Ionen-Batterien mit verteilten Sensorsystemen und zentraler Deep-Learning-Auswertung mit MVTec HALCON.
MVTec HALCON’s deep learning helps Panasonic Energy to propel automotive battery production
Panasonic Energy manufactures automotive batteries, for which demand is increasing worldwide, at multiple sites in Japan and North America. The company recently implemented MVTec HALCON’s deep learning technologies on a large scale at its Kansas plan…
Show full Story
HALCON
Battery Production
Automotive
Deep Learning
Inspection
MVTec HALCON product icon
Advancing machine vision education at Reutlingen University
At Reutlingen University, Professor Arnd Buschhaus has been integrating MVTec HALCON into the teaching curriculum since 2018. HALCON is used extensively in courses related to robotics and machine vision, providing students with practical experience i…
Show full Story
Education, Research & Science
3D Vision
Blob Analysis
Classification
MVTec HALCON product icon
Robotic welding cell inspecting car body weld seams using machine vision.
Machine vision optimizes quality inspection in automotive production
Consistently high quality is an absolute must in automotive production. To ensure that welded connections in body shells meet these standards, DGH has developed an application that automatically inspects them and identifies anomalies. The MVTec HALCO…
Show full Story
HALCON
Automotive
Anomaly Detection
Deep Learning
Inspection
MVTec HALCON product icon
High-speed battery inspection using machine vision.
Machine Vision automates and optimizes battery inspection
A variety of defects can occur during battery production. Averna has developed a method for the automated inspection of batteries that reliably detects all conceivable defects. Here, the machine vision software MVTec HALCON plays a key role.
Show full Story
HALCON
Battery Production
Blob Analysis
Inspection
MVTec HALCON product icon
Automated robot cell handling vaccine vials with machine vision.
Machine Vision automates vaccine production
A global company needed to quickly ramp up vaccine production. Goldfuß engineering GmbH and SIMON IBV GmbH developed a robot-based solution for loading and unloading trolleys with the vaccine vials. The machine vision software HALCON ensures the seam…
Show full Story
HALCON
Robotics
Medical Supplies & Pharmaceutical
3D Vision
MVTec HALCON product icon
Micromechanical part measured with machine vision for high-precision inspection.
Machine vision improves the measuring process of precision and micro components
The precise inspection of very small precision components, for example in watchmaking, is a challenge. Petitpierre, the Swiss specialist for high-precision technology, has developed sophisticated applications with which micromechanical components can…
Show full Story
HALCON
Matching
Measuring
MVTec HALCON product icon
Industrial robots performing laser welding on battery cells guided by machine vision.
Machine vision accelerates battery production
High speed, precision, and reliability: These are the determining factors for battery production. To meet the rapidly growing demand for batteries, companies are developing increasingly efficient production processes. Laserax has developed a new syst…
Show full Story
HALCON
Battery Production
3D Calibration
Calibration
Matching
MVTec HALCON product icon
Robotic manufacturing cell assembling satellite panels with optical solar reflectors using machine vision.
Seeing robots process large work area with high precision
The automation specialist AKEOPLUS has developed an innovative robotic cell for the production of satellite panels at the global space manufacturer Thales Alenia Space. The special feature: robots place the components with high absolute accuracy to a…
Show full Story
HALCON
Robotics
3D Vision
Matching
MVTec HALCON product icon
Students and researchers using a robot with machine vision in a university lab to demonstrate the impact of MVTec HALCON on education and research.
Technical University of Applied Sciences Würzburg-Schweinfurt
At Technical University of Applied Sciences Würzburg-Schweinfurt (THWS), Professor Christian Zirkelbach is teaching robotics and machine vision at Faculty of Applied Natural Sciences and Humanities. He has integrated advanced machine vision tools int…
Show full Story
HALCON
Robotics
Education, Research & Science
3D Vision
Deep Learning
MVTec HALCON product icon
Aluminum bottles on a production line identified by OCR using machine vision for automated quality control.
Empty bottle? Not with Deep OCR
To identify products quickly and reliably, even in difficult circumstances, machine vision has proven its worth. The company Visione Artificiale also relies on machine vision for its system to trace aluminum bottles during production. Within this sys…
Show full Story
HALCON
Food & Beverage
OCR
MVTec HALCON product icon
Operating personnel using the PackCheck machine vision solution for quality control in food packaging production.
Machine vision for every taste
The Dutch company PackCheck BV ensures packaging quality in the food industry with a modular solution. Part of the solution is the machine vision software HALCON from MVTec Software GmbH. This enables the company to increase the efficiency of packagi…
Show full Story
HALCON
Food & Beverage
Bar Code & 2D Code Reading
Blob Analysis
Matching
OCR
MVTec HALCON product icon
Hand-eye calibration enabling precise alignment between camera and robot for accurate gripping of accessory bags.
Perfectly positioned plastic bags – with classic machine vision and deep learning
Consistently automated pick-and-place activities require the reliable gripping of differently shaped and translucent objects. TEKVISA has implemented a sophisticated application that enables reliable gripping even with complex surfaces with the help…
Show full Story
HALCON
Robotics
Deep Learning
MVTec HALCON product icon
Fully automated packaging inspection system using machine vision for inline quality control in food packaging.
Deep learning detects defects in the food industry
Automation specialist INNDEO has developed a sophisticated automation solution for quality control in the packaging industry. Thanks to machine vision and deep learning technologies, high speeds and defect detection rates are no longer a problem.
Show full Story
HALCON
Food & Beverage
Deep Learning
OCR
MVTec HALCON product icon
Bottle caps inspected by machine vision to verify print quality in beverage production.
Beverage industry: HALCON reliably checks bottle cap imprints
Machine vision offers enormous advantages for quality control applications. High accuracy and speed as well as high flexibility and availability are important for practical use. K&S Anlagenbau GmbH has implemented such an application for the beverage…
Show full Story
HALCON
Food & Beverage
Blob Analysis
Calibration
Matching
MVTec HALCON product icon
Robots and machine vision software interact perfectly
The Spanish robotics specialist Tekniker has developed a solution that allows chaotically lying parts to be automatically gripped and placed in an orderly manner. The integrated machine vision software MVTec HALCON uses 3D vision technologies to ensu…
Show full Story
HALCON
Robotics
Automotive
3D Vision
MVTec MERLIC product icon
Inspection setup for electrical connections using deep learning–based machine vision to detect defects.
An eye on everything with deep learning
Bosch Car Multimédia in Portugal specializes in the development and manufacture of electronic components for vehicle safety and dynamics, driver assistance, automated driving and car multimedia. The company uses the machine vision software MERLIC fro…
Show full Story
MERLIC
Automotive
Electric Components
Deep Learning
MVTec Software