Step 3: Export to PDF/A


PDF/A is new in CLI OCR Version 9.0, so the export format is not available in V8.

PDF/A Standard Keys – PDF/A MRC Keys

--pdfaExportMode

  • Specifies the mode of exporting recognised text into PDF format.
Key Parameters Default
-paem TextOnly
The recognised text will be saved as text, and the pictures will be saved as pictures. The original document layout will not be retained.
TextOnImage
The entire page image will be saved as a picture. Text areas will be saved as text over the picture.
ImageOnText
The entire page image will be saved as a picture. The recognised text is 'written' under the picture. This option is useful if you export text to document archives: the full page layout will be retained and the full-text search will be available.
ImageOnly
The entire page image will be saved as a picture.
TextOnly
–pdfaExportMode

--pdfaPictureFormat

  • Specifies the image format to be used during export to a PDF/A file with embedded pictures.
Key Parameters Default
-pafpf Automatic
JpegColor
JpegGray
ZipColor
ZipGray
CCITT4
Automatic
–pdfaPictureFormat

--pdfaPictureResolution

  • Specifies the picture resolution that will be used for exporting pictures into PDF/A format.
Key Parameters Default
-pafpr picture resolution in dpi -1, the original
resolution must be preserved.
–pdfaPictureResolution

--pdfaQuality

  • Specifies the JPEG quality for colour pictures saved to PDF/A format.
Key Parameters Default
-pafq JPEG quality in percent 50
–pdfaQuality

--pdfaAuthor

  • Stores the author's name of the PDF/A document. You may set this property to the name of the user.
Key Parameters Default
-pafa name empty string
–pdfaAuthor

--pdfaCreator

  • Stores the author's name of the PDF/A document.
    You may set this property to the name of the user.
Key Parameters Default
-pafc name empty string
–pdfaCreator

--pdfaKeywords

  • Specifies the keywords of the PDF/A document.
Key Parameters Default
-pafk keywords empty string
–pdfaKeywords

--pdfaProducer

  • Specifies the producer's name of the PDF/A document.
Key Parameters Default
-pafp name empty string
–pdfaProducer

--pdfaTitle

  • Specifies the title of the PDF/A document.
Key Parameters Default
-paft title empty string

--pdfaSubject

  • Specifies the subject of the PDF/A document.
Key Parameters Default
-pafs subject empty string
–pdfaSubject

-pdfaPaperHeight

  • Specifies paper height in twips (1/1440 of inch)
Key Parameters Default
-paph paper height in twips height of an A4 page
-pdfaPaperHeight

--pdfaPaperWidth

  • Specifies paper width in twips (1/1440 of inch).
Key Parameters Default
-papw paper width in twips width of an A4 page
–pdfaPaperWidth

--pdfaReleasePageSizeByLayoutSize

  • The values of the -paph and -papw keys define the page size.
Key Parameters Default
-pars no
–pdfaReleasePageSizeByLayoutSize
  • Specifies that the hyperlinks are exported as text.
Key Parameters Default
-parl no

--pdfaWritePDFA1A

  • Specifies that the recognized text is exported to PDF/A-1a format.
Key Parameters Default
-paa1 no
–pdfaWritePDFA1A

--pdfaWriteTaggedPDF

  • Specifies that the recognized text is exported to tagged PDF.
Key Parameters Default
-pakt no
–pdfaWriteTaggedPDF