do_ocr_word_cnnT_do_ocr_word_cnnDoOcrWordCnnDoOcrWordCnndo_ocr_word_cnn (Operator)

Name

do_ocr_word_cnnT_do_ocr_word_cnnDoOcrWordCnnDoOcrWordCnndo_ocr_word_cnn — Classify a related group of characters with an CNN-based OCR classifier.

Signature

do_ocr_word_cnn(Character, Image : : OCRHandle, Expression, NumAlternatives, NumCorrections : Class, Confidence, Word, Score)

Description

do_ocr_word_cnndo_ocr_word_cnnDoOcrWordCnnDoOcrWordCnnDoOcrWordCnndo_ocr_word_cnn works like do_ocr_multi_class_cnndo_ocr_multi_class_cnnDoOcrMultiClassCnnDoOcrMultiClassCnnDoOcrMultiClassCnndo_ocr_multi_class_cnn insofar as it computes the best class for each of the characters given by the regions CharacterCharacterCharacterCharactercharactercharacter and the gray values ImageImageImageImageimageimage with the OCR classifier OCRHandleOCRHandleOCRHandleOCRHandleOCRHandleocrhandle, and returns the classes in ClassClassClassClassclassValclass and the corresponding confidences (probabilities) of the classes in ConfidenceConfidenceConfidenceConfidenceconfidenceconfidence.

In contrast to do_ocr_multi_class_cnndo_ocr_multi_class_cnnDoOcrMultiClassCnnDoOcrMultiClassCnnDoOcrMultiClassCnndo_ocr_multi_class_cnn, do_ocr_word_cnndo_ocr_word_cnnDoOcrWordCnnDoOcrWordCnnDoOcrWordCnndo_ocr_word_cnn treats the group of characters as an entity which yields a WordWordWordWordwordword by concatenating the class names for each character region. This allows to restrict the allowed classification results on a textual level by specifying an ExpressionExpressionExpressionExpressionexpressionexpression describing the expected word.

The ExpressionExpressionExpressionExpressionexpressionexpression may restrict the word to belong to a predefined lexicon created using create_lexiconcreate_lexiconCreateLexiconCreateLexiconCreateLexiconcreate_lexicon or import_lexiconimport_lexiconImportLexiconImportLexiconImportLexiconimport_lexicon, or by specifying the name of the lexicon in angular brackets as in '<mylexicon>'"<mylexicon>""<mylexicon>""<mylexicon>""<mylexicon>""<mylexicon>". If the ExpressionExpressionExpressionExpressionexpressionexpression is of any other form, it is interpreted as a regular expression with the same syntax as specified for tuple_regexp_matchtuple_regexp_matchTupleRegexpMatchTupleRegexpMatchTupleRegexpMatchtuple_regexp_match. Note that you will usually want to use an expression of the form '^...$' when using variable quantifiers like '*', to ensure that the entire word is used in the expression. Also note that in contrast to tuple_regexp_matchtuple_regexp_matchTupleRegexpMatchTupleRegexpMatchTupleRegexpMatchtuple_regexp_match, do_ocr_word_cnndo_ocr_word_cnnDoOcrWordCnnDoOcrWordCnnDoOcrWordCnndo_ocr_word_cnn does not support passing extra options in an expression tuple.

If the word derived from the best class for each character does not match the ExpressionExpressionExpressionExpressionexpressionexpression, do_ocr_word_cnndo_ocr_word_cnnDoOcrWordCnnDoOcrWordCnnDoOcrWordCnndo_ocr_word_cnn attempts to correct it by considering the NumAlternativesNumAlternativesNumAlternativesNumAlternativesnumAlternativesnum_alternatives best classes for each character. The alternatives used are identical to those returned by do_ocr_single_class_cnndo_ocr_single_class_cnnDoOcrSingleClassCnnDoOcrSingleClassCnnDoOcrSingleClassCnndo_ocr_single_class_cnn for a single character. It does so by testing all possible corrections for which the classification result is changed for at most NumCorrectionsNumCorrectionsNumCorrectionsNumCorrectionsnumCorrectionsnum_corrections character regions. Note that NumAlternativesNumAlternativesNumAlternativesNumAlternativesnumAlternativesnum_alternatives and NumCorrectionsNumCorrectionsNumCorrectionsNumCorrectionsnumCorrectionsnum_corrections affect the complexity of the algorithm, so that in some cases internal restrictions are made. See the section 'Complexity' below for further information.

In case the ExpressionExpressionExpressionExpressionexpressionexpression is a lexicon and the above procedure did not yield a result, the most similar word in the lexicon is returned as long as it requires less than NumCorrectionsNumCorrectionsNumCorrectionsNumCorrectionsnumCorrectionsnum_corrections edit operations for the correction (see suggest_lexiconsuggest_lexiconSuggestLexiconSuggestLexiconSuggestLexiconsuggest_lexicon).

The resulting word is graded by a ScoreScoreScoreScorescorescore between 0.0 (no correction found) and 1.0 (original word correct). The ScoreScoreScoreScorescorescore is lowered by adding a penalty according to the number of corrected characters and another (minor) penalty depending on how many classes with higher confidences have been ignored in order to match the ExpressionExpressionExpressionExpressionexpressionexpression:

with num_corr being the actual number of applied corrections and num_alt the total number of discarded alternatives.

Note that this is a combinatorial score which does not reflect the original ConfidenceConfidenceConfidenceConfidenceconfidenceconfidence of the best ClassClassClassClassclassValclass.

A string of the number '\032'"\032""\032""\032""\032""\032" (alternatively displayed as '\0x1A'"\0x1A""\0x1A""\0x1A""\0x1A""\0x1A") in ClassClassClassClassclassValclass signifies that the region has been classified as rejection class.

Execution Information

Multithreading type: reentrant (runs in parallel with non-exclusive operators).
Multithreading scope: global (may be called from any thread).
Processed without parallelization.

Parameters

CharacterCharacterCharacterCharactercharactercharacter (input_object) region(-array) → object

Characters to be recognized.

ImageImageImageImageimageimage (input_object) singlechannelimage → object (byte / uint2)

Gray values of the characters.

OCRHandleOCRHandleOCRHandleOCRHandleOCRHandleocrhandle (input_control) ocr_cnn → (handle)

Handle of the OCR classifier.

ExpressionExpressionExpressionExpressionexpressionexpression (input_control) string → (string)

Expression describing the allowed word structure.

NumAlternativesNumAlternativesNumAlternativesNumAlternativesnumAlternativesnum_alternatives (input_control) integer → (integer)

Number of classes per character considered for the internal word correction.

Default value: 3

Suggested values: 3, 4, 5

Typical range of values: 1 ≤ NumAlternatives NumAlternatives NumAlternatives NumAlternatives numAlternatives num_alternatives

NumCorrectionsNumCorrectionsNumCorrectionsNumCorrectionsnumCorrectionsnum_corrections (input_control) integer → (integer)

Maximum number of corrected characters.

Default value: 2

Suggested values: 1, 2, 3, 4, 5

Typical range of values: 0 ≤ NumCorrections NumCorrections NumCorrections NumCorrections numCorrections num_corrections

ClassClassClassClassclassValclass (output_control) string(-array) → (string)

Result of classifying the characters with the CNN.

Number of elements: Class == Character

ConfidenceConfidenceConfidenceConfidenceconfidenceconfidence (output_control) real(-array) → (real)

Confidence of the class of the characters.

Number of elements: Confidence == Character

WordWordWordWordwordword (output_control) string → (string)

Word text after classification and correction.

ScoreScoreScoreScorescorescore (output_control) real → (real)

Measure of similarity between corrected word and uncorrected classification results.

Complexity

The complexity of checking all possible corrections is of magnitude , where a is the number of alternatives, n is the number of character regions, and c is the number of allowed corrections. However, to guard against a near-infinite loop in case of large n, c is internally clipped to 5, 3, or 1 if a*n >= 30, 60, or 90, respectively.

Result

If the parameters are valid, the operator do_ocr_word_cnndo_ocr_word_cnnDoOcrWordCnnDoOcrWordCnnDoOcrWordCnndo_ocr_word_cnn returns the value 2 (H_MSG_TRUE). If necessary, an exception is raised.

Possible Predecessors

read_ocr_class_cnnread_ocr_class_cnnReadOcrClassCnnReadOcrClassCnnReadOcrClassCnnread_ocr_class_cnn

Alternatives

do_ocr_multi_class_cnndo_ocr_multi_class_cnnDoOcrMultiClassCnnDoOcrMultiClassCnnDoOcrMultiClassCnndo_ocr_multi_class_cnn

Module

OCR/OCV

Operators