List of the predefined languages

Here is the list of internal names of the predefined languages that are supported in ABBYY FineReader Engine Linux (= CLI is based on the version 11 of the SDK).

There are 2 types of languages

  • languages that have full built-in dictionary support
  • “simple” languages with a specific language definition, for example allowed/used character sets.

Standard Languages

Internal name Recognition language Can be used for OCR Full dictionary support available

Abkhaz

Abkhaz

+

Adyghe

Adyghe

+  

Afrikaans

Afrikaans

+  

Agul

Agul

+  

Albanian

Albanian

+  

Altaic

Altaic

+  

Arabic

Arabic (Saudi Arabia)

+ +

ArmenianEastern

Armenian (Eastern)

+ +

ArmenianGrabar

Armenian (Grabar)

+ +

ArmenianWestern

Armenian (Western)

+ +

Awar

Avar

+  

Aymara

Aymara

+  

AzeriCyrillic

Azerbaijani (Cyrillic)

+  

AzeriLatin

Azerbaijani (Latin)

+ +

Bashkir

Bashkir

+ +

Basic

Basic programming language

+  

Basque

Basque

+  

Belarusian

Belarussian

+  

Bemba

Bemba

+  

Blackfoot

Blackfoot

+  

Breton

Breton

+  

Bugotu

Bugotu

+  

Bulgarian

Bulgarian

+ +

Buryat

Buryat

+  

C++

C/C++ programming language

+  

Catalan

Catalan

+ +

Chamorro

Chamorro

+  

Chechen

Chechen

+  

Chemistry

Simple chemical formulas

+  

ChinesePRC

Chinese Simplified

+  

ChinesePRC+English*

Chinese Simplified and English

+  

ChineseTaiwan

Chinese Traditional

+  

ChineseTaiwan+English*

Chinese Traditional and English

+  

Chukcha

Chukcha

+  

Chuvash

Chuvash

+  

"CMC7">CMC7

For MICR CMC-7 text type

+  

Cobol

Cobol programming language

+  

Corsican

Corsican

+  

CrimeanTatar

Crimean Tatar

+  

Croatian

Croatian

+ +

Crow

Crow

+  

Czech

Czech

+ +

Danish

Danish

+ +

Dargwa

Dargwa

+  

Digits

Numbers

+  

Dungan

Dungan

+  

Dutch

Dutch (Netherlands) + +

DutchBelgian

Dutch (Belgium) +  

E13B

For MICR (E-13B) text type

+  

English

English

+ +

EskimoCyrillic

Eskimo (Cyrillic)

+  

EskimoLatin

Eskimo (Latin)

+  

Esperanto

Esperanto

+  

Estonian

Estonian

+ +

Even

Even

+  

Evenki

Evenki

+  

Faeroese

Faeroese

+  

Fijian

Fijian

+  

Finnish

Finnish

+ +

Fortran

Fortran programming language

+  

French

French

+ +

Frisian

Frisian

+  

Friulian

Friulian

+  

GaelicScottish

Scottish Gaelic

+  

Gagauz

Gagauz

+  

Galician

Galician

+  

Ganda

Ganda

+  

German

German

+ +

GermanNewSpelling

German (new spelling)

+ +

GermanLuxembourg

German (Luxembourg)

+  

Greek

Greek

+ +

Guarani

Guarani

+  

Hani

Hani

+  

Hausa

Hausa

+  

Hawaiian

Hawaiian

+  

Hebrew

Hebrew

+ +

Hungarian

Hungarian

+ +

Icelandic

Icelandic

+  

Ido

Ido

+  

Indonesian

Indonesian

+ +

Ingush

Ingush

+  

Interlingua

Interlingua

+  

Irish

Irish

+  

Italian

Italian

+ +

Japanese

Japanese

+ +

Japanese+English*

Japanese and English

+ +

Java

Java programming language

+  

Kabardian

Kabardian

+  

Kalmyk

Kalmyk

+  

KarachayBalkar

Karachay-Balkar

+  

Karakalpak

Karakalpak

+  

Kasub

Kasub

+  

Kawa

Kawa

+  

Kazakh

Kazakh

+  

Khakas

Khakas

+  

Khanty

Khanty

+  

Kikuyu

Kikuyu

+  

Kirgiz

Kirghiz

+  

Kongo

Kongo

+  

Korean

Korean

+ +

Korean+English*

Korean and English

+ +

KoreanHangul

Korean (Hangul)

+ +

Koryak

Koryak

+  

Kpelle

Kpelle

+  

Kumyk

Kumyk

+  

Kurdish

Kurdish

+  

Lak

Lak

+  

Lappish

Sami (Lappish)

+  

Latin

Latin

+ +

Latvian

Latvian

+ +

LatvianGothic

Latvian language written in Gothic script

+  

Lezgin

Lezgin

+  

Lithuanian

Lithuanian

+ +

Luba

Luba

+  

Macedonian

Macedonian

+  

Malagasy

Malagasy

+  

Malay

Malay

+  

Malinke

Malinke

+  

Maltese

Maltese

+  

Mansi

Mansi

+  

Maori

Maori

+  

Mari

Mari

+  

Maya

Maya

+  

Miao

Miao

+  

Minankabaw

Minangkabau

+  

Mixed*

Russian and English

+ +

Mohawk

Mohawk

+  

Mongol

Mongol

+  

Mordvin

Mordvin

+  

Nahuatl

Nahuatl

+  

Nenets

Nenets

+  

Nivkh

Nivkh

+  

Nogay

Nogay

+  

Norwegian

NorwegianNynorsk and NorwegianBokmal

+ +

NorwegianBokmal

Norwegian (Bokmal)

+ +

NorwegianNynorsk

Norwegian (Nynorsk)

+ +

Nyanja

Nyanja

+  

Occidental

Occidental

+  

OcrA

For OCR-A text type

+  

OcrB

For OCR-B text type

+  

Ojibway

Ojibway

+  

Papiamento

Papiamento

+  

Pascal

Pascal programming language

+  

PidginEnglish

Tok Pisin

+  

Polish

Polish

+ +

PortugueseBrazilian

Portuguese (Brazil)

+ +

PortugueseStandard

Portuguese (Portugal)

+ +

Provencal

Provencal

+  

Quechua

Quechua

+  

RhaetoRomanic

Rhaeto-Romanic

+  

Romanian

Romanian

+ +

RomanianMoldavia

Romanian (Moldavia)

+  

Romany

Romany

+  

Ruanda

Ruanda

+  

Rundi

Rundi

+  

RussianOldSpelling

Russian (old spelling)

+ +

Russian

Russian

+ +

RussianWithAccent

Russian (with accents marking stress position)

+ +

Samoan

Samoan

+  

Selkup

Selkup

+  

SerbianCyrillic

Serbian (Cyrillic)

+  

SerbianLatin

Serbian (Latin)

+  

Shona

Shona

+  

Sioux

Sioux (Dakota)

+  

Slovak

Slovak

+ +

Slovenian

Slovenian

+ +

Somali

Somali

+  

Sorbian

Sorbian

+  

Sotho

Sotho

+  

Spanish

Spanish

+ +

Sunda

Sunda

+  

Swahili

Swahili

+  

Swazi

Swazi

+  

Swedish

Swedish

+ +

Tabassaran

Tabassaran

+  

Tagalog

Tagalog

+  

Tahitian

Tahitian

+

Tajik

Tajik

+  

Tatar

Tatar

+ +

Thai

Thai

+ +

Tinpo

Jingpo

+  

Tongan

Tongan

+  

Tswana

Tswana

+  

Tun

Tun

+  

Turkish

Turkish

+ +

Turkmen

Turkmen

+  

TurkmenLatin

Turkmen (Latin)

+  

Tuvin

Tuvan

+  

Udmurt

Udmurt

+  

UighurCyrillic

Uighur (Cyrillic)

+  

UighurLatin

Uighur (Latin)

+  

Ukrainian

Ukrainian

+ +

UzbekCyrillic

Uzbek (Cyrillic)

+  

UzbekLatin

Uzbek (Latin)

+  

Vietnamese

Vietnamese

+ +

Visayan

Cebuano

+  

Welsh

Welsh

+  

Wolof

Wolof

+  

Xhosa

Xhosa

+  

Yakut

Yakut

+  

Yiddish

Yiddish

+  

Zapotec

Zapotec

+  

Zulu

Zulu

+  

* These are compound recognition languages. The compound predefined languages are to be removed in future versions.

Note: Support for historic fonts and the related "old" languages are not included per default in ABBYYs CLI OCR.

 

Language Keys when the CJK module is licensed

  • ChinesePRC
  • ChineseTaiwan
  • Japanese
  • Korean
  • KoreanHangul