ABBYY FineReader Engine 9.0 CLI for Linux
Export to HTML

www.ocr4linux.com / Documentation Home / Step 3: HTML keys

 

 

Key Parameters Default Description

-ha

--htmlAuthor

name

empty string

Specifies the name of the author of the HTML document.

-hkl

--htmlKeepLines

no

 

Retains the original lines in recognised text during export.

-hp

--htmlSeparatePages

no

 

If several pages are exported to HTML format, the <HR> tag is inserted between pages which makes browser draw a horizontal rule.

-hsp

--htmlSkipPictures

no

 

Pictures and references to them will not be saved to the resultant HTML file.

-huu

--htmlUseUnicode

no

 

Uses Unicode when writing recognised text into the HTML file.

-hrc

--htmlReleaseTextAndBackgroundcolor

no

 

Original colors of text and background will not be retained when exporting recognised text to HTML format.

-hk

--htmlKeywords

keywords

empty string

Specifies the keywords of the HTML document.

-hsm

--htmlSynthesisMode

PlainText
The recognised text will be divided only into paragraphs by  <p> tags.

FormattedStream
Paragraphs and fonts of the original document will be retained in the output HTML file. The <p> tag is used.

PageLayout
The full source page layout will be retained by using table.

PageLayout

Specifies the mode of synthesis of HTML code from the recognised text.

-hfm

--htmlVersion

Format32
Simple format using the HTML 3.2 standard.

Format40
Full format using the HTML 4.0 standard.

TwoFormats32_40
Both formats (Simple and Full) are saved to the same file.

Format40

Specifies the HTML version used for export. Export may be done to HTML 3.2 files that can be viewed by old browsers, HTML 4.0 files that can be viewed by new browsers or to an HTML file of a compromise format that is suitable for different types of browser.

-hcpt

--htmlCodePageType

ANSI
ANSI code page.

OEM
OEM code page.

Macintosh
Macintosh code page.

ISO
ISO code page.

ANSI

The code page of the export file will be determined automatically. The value of this property is valid only when exporting non-Unicode text and the -hcp key doesn't specify any code page.

-hcp

--htmlCodePage

predefined Code Page

Null

Specifies the code page which will be used for exporting recognised text. The value of this property is valid when exporting non-Unicode text. If this key doesn't specify any code page (Null), the code page is selected automatically by using the -hcpt key value.

-hpf

--htmlPictureFormat

Automatic
Jpegcolor
JpegGray
PngBlackWhite
Pngcolor
PngGray
(see descriptions in ExportPictureFormatEnum)

Automatic

Specifies the image format to be used during export to HTML; images are saved to separate files.

-hpr

--htmlPictureResolution

picture resolution in dpi

72

Specifies the picture resolution in dpi that is used for exporting pictures into HTML format.

-hq

--htmlQuality

JPEG quality in percent

50

Specifies the value of the JPEG quality, percents, for color pictures saved to HTML format in percent.

-hs

--htmlSubject

subject

empty string

Specifies the subject of the HTML document.

-ht

--htmlTitle

title

empty string

Specifies the title of the HTML document.

Export keys for both HTML and RTF formats:

Key Parameters Default Description

-rtn

--retainLayout

no

 

Retains the full page layout.

-sp

--skipPictures

no

 

Pictures will not be saved to the resulting file.

Note: Full keys are marked by italic.

See also

ABBYYOCR CLI for Linux - Documentation Home