SET_UNICHARSET_PROPE(1)   SET_UNICHARSET_PROPE(1)

set_unicharset_properties - set properties about the unichars

set_unicharset_properties --U input_unicharsetfile --script_dir /path/to/langdata --O output_unicharsetfile

set_unicharset_properties(1) reads a unicharset file, puts the result in a UNICHARSET object, fills it with properties about the unichars it contains and writes the result back to another unicharset file.

--script_dir /path/to/langdata

(Input) Specify the location of directory for universal script unicharsets and font xheights (type:string default:)

--U unicharsetfile

(Input) Specify the location of the unicharset to load as input.

--O unicharsetfile

(Output) Specify the location of the unicharset to be written with updated properties.

set_unicharset_properties(1) was first made available for tesseract version 3.03.

Main web site: https://github.com/tesseract-ocr Information on training: https://tesseract-ocr.github.io/tessdoc/Training-Tesseract.html

tesseract(1)

Copyright (C) 2012 Google, Inc. Licensed under the Apache License, Version 2.0

The Tesseract OCR engine was written by Ray Smith and his research groups at Hewlett Packard (1985-1995) and Google (2006-present).

01/18/2024