ocr4linux.com / Documentation Home / Step 2: layout analysis and recognition
Key | Parameters | Default | Description |
---|---|---|---|
-rl |
--recognitionLanguage |
||
list of recognition languages |
English |
Sets the recognition language which should to be one of the predefined ABBYY FineReader Engine languages. |
|
-rdt |
--recognitionDocumentType |
||
AutoDetect |
AutoDetect |
Selects one of the three recognized document type: autodetect, single-column, plain text. |
|
-db |
--detectBarcodes |
||
no |
|
Detects barcodes and creates barcode blocks during the page layout analysis. If this key is not available, barcodes may be detected as blocks of some other type (e.g. pictures). |
|
-it |
--dontClearBackgroundNoise |
||
no |
|
Disallows ABBYY FineReader Engine to remove the background noise from a temporary image used for recognition. |
|
--printType |
|||
Normal Typewriter Matrix OCR_A OCR_B MICR_E13B |
Normal |
Sets the text type of recognized document.
|
|
-fm |
--fastMode |
||
no |
|
Provides 2-2,5 times faster recognition speed at the cost of moderate increased error rate (1,5-2 times more errors). For text printed with good quality ABBYY FineReader Engine makes 1-2 errors per page on average. The increase of errors can be easily eliminated by using various scenarios, e.g. the full text indexing with “fuzzy” search, preliminary recognition, etc. |
|
-tol |
--oneLinePerTextCell |
||
no |
|
Presumes that the text in the block to which the current RecognizerParams object corresponds contains no more than one string. |
|
-tnm |
--noMergedCellInTable |
||
no |
|
Recognizes tables with no merged cells. The table layout will be analyzed more readily. |
|
-ido |
--detectImageOrientetion |
||
no |
|
Detects page orientation during layout analysis, and if it differs from the normal, rotate the image. |
|
-sdp |
--splitDualPages |
||
no |
|
Splits dual pages |
Note: Full keys are marked by italic.
ABBYYOCR CLI for Linux - Documentation Home, Image processing keys, Input file keys, Output file keys, Miscellaneous,
Export to: HTML format, PDF format, RTF format, DBF format, XML format, TXT format