local_thresholdlocal_thresholdLocalThresholdLocalThresholdlocal_threshold (Operator)

Name

local_thresholdlocal_thresholdLocalThresholdLocalThresholdlocal_threshold — Segment an image using local thresholding.

Signature

local_threshold(Image : Region : Method, LightDark, GenParamName, GenParamValue : )

Description

local_thresholdlocal_thresholdLocalThresholdLocalThresholdlocal_threshold segments a single-channel image ImageImageImageimageimage using the thresholding method given in MethodMethodMethodmethodmethod and returns the segmented region in RegionRegionRegionregionregion. Currently the operator offers only the Method 'adapted_std_deviation'"adapted_std_deviation""adapted_std_deviation""adapted_std_deviation""adapted_std_deviation". This algorithm is a text binarization technique and provides good results for document images.

Adaptive Thresholding

By selecting MethodMethodMethodmethodmethod = 'adapted_std_deviation'"adapted_std_deviation""adapted_std_deviation""adapted_std_deviation""adapted_std_deviation", a locally adaptive thresholding based on local mean and standard deviation according to Sauvola (see the paper in References) is invoked. The algorithm is able to segment document images even if they are degraded, e.g., due to inhomogeneous illumination or noise. It enables text binarization on an inhomogeneous background by taking into account the local contrast.

For a segmentation of the dark foreground (see parameter LightDarkLightDarkLightDarklightDarklight_dark), for a pixel at position (r,c), a local threshold T(r,c) is calculated within a window of size 'mask_size'"mask_size""mask_size""mask_size""mask_size" x 'mask_size'"mask_size""mask_size""mask_size""mask_size" (see the generic parameter 'mask_size'"mask_size""mask_size""mask_size""mask_size") as follows: where is the local mean value within the window and denotes the corresponding standard deviation. The parameter R (see 'range'"range""range""range""range") is the assumed maximum value of the standard deviation (R = 128 for byte images) and k (see 'scale'"scale""scale""scale""scale") a parameter that controls how much the threshold value T(r,c) differs from the mean value . If there is high contrast in the neighborhood of a point (r,c) the standard deviation has a value close to R which yields a threshold value T(r,c) close to the local mean . If the contrast is low, the local threshold is below the local mean value. For dark text on light background containing also darker regions, this lower threshold enables the segmentation of the text even in darker areas.

The parameter LightDarkLightDarkLightDarklightDarklight_dark controls, whether light or dark structures are segmented.

If LightDarkLightDarkLightDarklightDarklight_dark = 'dark'"dark""dark""dark""dark", dark structures on a light background are segmented. Every pixel p(r,c) whose gray value is smaller than the calculated local threshold T(r,c) is selected.
If LightDarkLightDarkLightDarklightDarklight_dark = 'light'"light""light""light""light", light structures on a dark background are segmented. The result is essentially the same as if the image would have been inverted and then, LightDarkLightDarkLightDarklightDarklight_dark was set to 'dark'"dark""dark""dark""dark".

By setting GenParamNameGenParamNameGenParamNamegenParamNamegen_param_name to one of the following values, additional parameters specific for the 'adapted_std_deviation'"adapted_std_deviation""adapted_std_deviation""adapted_std_deviation""adapted_std_deviation" method can be set with GenParamValueGenParamValueGenParamValuegenParamValuegen_param_value:

'mask_size'"mask_size""mask_size""mask_size""mask_size":

specifies the mask size, i.e., the size of the neighborhood in which the local threshold is calculated. The smaller the window size the thinner the segmented strokes. 'mask_size'"mask_size""mask_size""mask_size""mask_size" must be set to a value that is larger than the stroke width of the characters or structures to be segmented. If 'mask_size'"mask_size""mask_size""mask_size""mask_size" is even, the next larger odd value is used.

Suggested values: 15, 21, 31.

Default: 15.

'scale'"scale""scale""scale""scale":

sets the parameter k (), that controls how much the threshold value differs from the local mean value. Use smaller values for 'scale'"scale""scale""scale""scale" to also segment structures with a lower contrast to their background. Use larger values to suppress clutter.

Suggested values: 0.2, 0.3, 0.5.

Default: 0.2.

'range'"range""range""range""range":

sets the maximum assumed value of standard deviation R . This parameter should be adapted based on the expected gray value range. As a rule of thumb, the value for 'range'"range""range""range""range" can be set to , where MinGray and MaxGray are the minimum and maximum gray values in the image, which can be determined with min_max_graymin_max_grayMinMaxGrayMinMaxGraymin_max_gray.

Suggested values: 128, 32767.5.

Default: 128 (for byte images), 32767.5 (for uint2 images).

Attention

Note that filter operators may return unexpected results if an image with a reduced domain is used as input. Please refer to the chapter Filters.

Execution Information

Multithreading type: reentrant (runs in parallel with non-exclusive operators).
Multithreading scope: global (may be called from any thread).
Automatically parallelized on tuple level.
Automatically parallelized on internal data level.

Parameters

ImageImageImageimageimage (input_object) singlechannelimage(-array) → object (byte / uint2)

Input Image.

RegionRegionRegionregionregion (output_object) region(-array) → object

Segmented output region.

MethodMethodMethodmethodmethod (input_control) string → (string)

Segmentation method.

Default: 'adapted_std_deviation' "adapted_std_deviation" "adapted_std_deviation" "adapted_std_deviation" "adapted_std_deviation"

List of values: 'adapted_std_deviation'"adapted_std_deviation""adapted_std_deviation""adapted_std_deviation""adapted_std_deviation"

LightDarkLightDarkLightDarklightDarklight_dark (input_control) string → (string)

Extract foreground or background?

Default: 'dark' "dark" "dark" "dark" "dark"

List of values: 'dark'"dark""dark""dark""dark", 'light'"light""light""light""light"

GenParamNameGenParamNameGenParamNamegenParamNamegen_param_name (input_control) attribute.name(-array) → (string)

List of generic parameter names.

Default: []

List of values: 'mask_size'"mask_size""mask_size""mask_size""mask_size", 'range'"range""range""range""range", 'scale'"scale""scale""scale""scale"

GenParamValueGenParamValueGenParamValuegenParamValuegen_param_value (input_control) attribute.value(-array) → (integer / real)

List of generic parameter values.

Default: []

Suggested values: 0.2, 15, 30, 128.0

Possible Successors

connectionconnectionConnectionConnectionconnection, select_shapeselect_shapeSelectShapeSelectShapeselect_shape, select_grayselect_graySelectGraySelectGrayselect_gray

Alternatives

auto_thresholdauto_thresholdAutoThresholdAutoThresholdauto_threshold, binary_thresholdbinary_thresholdBinaryThresholdBinaryThresholdbinary_threshold, char_thresholdchar_thresholdCharThresholdCharThresholdchar_threshold

References

J. Sauvola, M. Pietikäinen, “Adaptive document image binarization", Pattern Recognition, 33, 225-236 (2000)

Module

Foundation

Operators