General Feature Overview
A new version of the ABBYY CLI OCR application for Linux based on the latest OCR technology is available.
How to use the CLI application? - Samples
abbyyocr11 -if sample.jpg -f HTML -hkl -of sample.html -f RTF -rmp -of sample.rtf
-
The sample.jpg file will be recognised
-
The results will be exported to
-
The original lines in the recognised text will be retained during export to
HTML format (-hkl).
-
The source page layout will not be retained when exporting recognised text to RTF format (-rmp).
abbyyocr11 -ii -fm -if sample.jpg -tet UTF8 -of sample.txt
-
The sample.jpg file will be recognised in fast mode (-fm).
-
The colours of the prepared image will be inverted during conversion to the internal format (-ii).
-
The results will be exported to an Unicode UTF8 type text file (-tet UTF8).
Features + Functionality
ABBYY FineReader Engine CLI for Linux offers easy and instant access to ABBYY’s high quality OCR technology on the Linux platform. Processing can be easily controlled and automated via terminal/command line calls.
The following image and document formats can be opened and processed:
-
PDF
-
BMP
-
PCX
-
DCX
-
JPEG
-
JPEG2000
-
TIFF
-
PNG
b) Processing and Recognition Features:
The image processing and recognition are controlled through a set of parameters:
-
Image processing
Skew correction, image format, compression settings, image resolution, cleaning images, colour inversion, splitting of dual pages
-
Recognition Keys
Fast/balanced mode, format recognition (e.g. Italic), recognition languages that should be used, Recognition of mixed font types, such as normal text, typewriter, dot-matrix, OCR-A, OCR-B and MICR (E13b)
-
Barcode Keys
17 most popular
1D barcodes,
2D: PDF417, Aztec, DataMatrix, QRCode
positioned at any angle on a document
-
General: Miscellaneous Keys
Processing Profiles for different scenarios, e.g. archiving, text extraction or editing; multi-core-CPU processing
c) Export Options:
FineReader Engine CLI for Linux offers sophisticated output options and formats:
-
Synthesis Keys
Settings how the recognition result export should be exported, e.g. fonts, paragraphs, text color, hyperlinks…
The recognition results can be exported to these formats:
-
-
text only
-
text on image
-
image on text
-
image only
-
protected PDFs
-
-
-
-
-
-
-
-
XML
…more details can be found here: http://http://www.abbyy-developers.eu/en:tech:features:xml
Further details can be found in the documentation.
OCR Languages
ABBYY FineReader Engine for Linux recognizes over 190 OCR languages – Version 11 now also supports Arabic OCR 1)
Read more...
Barcode Types
-
1D: Check Code 39, Check Interleaved 25, Code 128, Code 39, EAN 13, EAN 8, Interleaved 25, CODABAR (without checksum), UCC Code 128, Code 2 of 5 (Industrial, IATA, Matrix), Code 93, UPC-A, UPC-E and Postnet.
-
2D: PDF 417, Aztec, DataMatrix, QRCode
Licence Add ons