Table of Contents
List of the predefined languages
Here is the list of internal names of the predefined languages that are supported in ABBYY FineReader Engine Linux (= CLI is based on the version 11 of the SDK).
There are 2 types of languages
-
languages that have full built-in dictionary support
-
“simple” languages with a specific language definition, for example allowed/used character sets.
Standard Languages
Internal name | Recognition language | Can be used for OCR | Full dictionary support available |
---|---|---|---|
Abkhaz |
Abkhaz |
+ | |
Adyghe |
Adyghe |
+ | |
Afrikaans |
Afrikaans |
+ | |
Agul |
Agul |
+ | |
Albanian |
Albanian |
+ | |
Altaic |
Altaic |
+ | |
Arabic |
Arabic (Saudi Arabia) |
+ | + |
ArmenianEastern |
Armenian (Eastern) |
+ | + |
ArmenianGrabar |
Armenian (Grabar) |
+ | + |
ArmenianWestern |
Armenian (Western) |
+ | + |
Awar |
Avar |
+ | |
Aymara |
Aymara |
+ | |
AzeriCyrillic |
Azerbaijani (Cyrillic) |
+ | |
AzeriLatin |
Azerbaijani (Latin) |
+ | + |
Bashkir |
Bashkir |
+ | + |
Basic |
Basic programming language |
+ | |
Basque |
Basque |
+ | |
Belarusian |
Belarussian |
+ | |
Bemba |
Bemba |
+ | |
Blackfoot |
Blackfoot |
+ | |
Breton |
Breton |
+ | |
Bugotu |
Bugotu |
+ | |
Bulgarian |
Bulgarian |
+ | + |
Buryat |
Buryat |
+ | |
C++ |
C/C++ programming language |
+ | |
Catalan |
Catalan |
+ | + |
Chamorro |
Chamorro |
+ | |
Chechen |
Chechen |
+ | |
Chemistry |
Simple chemical formulas |
+ | |
ChinesePRC |
Chinese Simplified |
+ | |
ChinesePRC+English* |
Chinese Simplified and English |
+ | |
ChineseTaiwan |
Chinese Traditional |
+ | |
ChineseTaiwan+English* |
Chinese Traditional and English |
+ | |
Chukcha |
Chukcha |
+ | |
Chuvash |
Chuvash |
+ | |
"CMC7">CMC7 |
For MICR CMC-7 text type |
+ | |
Cobol |
Cobol programming language |
+ | |
Corsican |
Corsican |
+ | |
CrimeanTatar |
Crimean Tatar |
+ | |
Croatian |
Croatian |
+ | + |
Crow |
Crow |
+ | |
Czech |
Czech |
+ | + |
Danish |
Danish |
+ | + |
Dargwa |
Dargwa |
+ | |
Digits |
Numbers |
+ | |
Dungan |
Dungan |
+ | |
Dutch |
Dutch (Netherlands) | + | + |
DutchBelgian |
Dutch (Belgium) | + | |
E13B |
For MICR (E-13B) text type |
+ | |
English |
English |
+ | + |
EskimoCyrillic |
Eskimo (Cyrillic) |
+ | |
EskimoLatin |
Eskimo (Latin) |
+ | |
Esperanto |
Esperanto |
+ | |
Estonian |
Estonian |
+ | + |
Even |
Even |
+ | |
Evenki |
Evenki |
+ | |
Faeroese |
Faeroese |
+ | |
Fijian |
Fijian |
+ | |
Finnish |
Finnish |
+ | + |
Fortran |
Fortran programming language |
+ | |
French |
French |
+ | + |
Frisian |
Frisian |
+ | |
Friulian |
Friulian |
+ | |
GaelicScottish |
Scottish Gaelic |
+ | |
Gagauz |
Gagauz |
+ | |
Galician |
Galician |
+ | |
Ganda |
Ganda |
+ | |
German |
German |
+ | + |
GermanNewSpelling |
German (new spelling) |
+ | + |
GermanLuxembourg |
German (Luxembourg) |
+ | |
Greek |
Greek |
+ | + |
Guarani |
Guarani |
+ | |
Hani |
Hani |
+ | |
Hausa |
Hausa |
+ | |
Hawaiian |
Hawaiian |
+ | |
Hebrew |
Hebrew |
+ | + |
Hungarian |
Hungarian |
+ | + |
Icelandic |
Icelandic |
+ | |
Ido |
Ido |
+ | |
Indonesian |
Indonesian |
+ | + |
Ingush |
Ingush |
+ | |
Interlingua |
Interlingua |
+ | |
Irish |
Irish |
+ | |
Italian |
Italian |
+ | + |
Japanese |
Japanese |
+ | + |
Japanese+English* |
Japanese and English |
+ | + |
Java |
Java programming language |
+ | |
Kabardian |
Kabardian |
+ | |
Kalmyk |
Kalmyk |
+ | |
KarachayBalkar |
Karachay-Balkar |
+ | |
Karakalpak |
Karakalpak |
+ | |
Kasub |
Kasub |
+ | |
Kawa |
Kawa |
+ | |
Kazakh |
Kazakh |
+ | |
Khakas |
Khakas |
+ | |
Khanty |
Khanty |
+ | |
Kikuyu |
Kikuyu |
+ | |
Kirgiz |
Kirghiz |
+ | |
Kongo |
Kongo |
+ | |
Korean |
Korean |
+ | + |
Korean+English* |
Korean and English |
+ | + |
KoreanHangul |
Korean (Hangul) |
+ | + |
Koryak |
Koryak |
+ | |
Kpelle |
Kpelle |
+ | |
Kumyk |
Kumyk |
+ | |
Kurdish |
Kurdish |
+ | |
Lak |
Lak |
+ | |
Lappish |
Sami (Lappish) |
+ | |
Latin |
Latin |
+ | + |
Latvian |
Latvian |
+ | + |
LatvianGothic |
Latvian language written in Gothic script |
+ | |
Lezgin |
Lezgin |
+ | |
Lithuanian |
Lithuanian |
+ | + |
Luba |
Luba |
+ | |
Macedonian |
Macedonian |
+ | |
Malagasy |
Malagasy |
+ | |
Malay |
Malay |
+ | |
Malinke |
Malinke |
+ | |
Maltese |
Maltese |
+ | |
Mansi |
Mansi |
+ | |
Maori |
Maori |
+ | |
Mari |
Mari |
+ | |
Maya |
Maya |
+ | |
Miao |
Miao |
+ | |
Minankabaw |
Minangkabau |
+ | |
Mixed* |
Russian and English |
+ | + |
Mohawk |
Mohawk |
+ | |
Mongol |
Mongol |
+ | |
Mordvin |
Mordvin |
+ | |
Nahuatl |
Nahuatl |
+ | |
Nenets |
Nenets |
+ | |
Nivkh |
Nivkh |
+ | |
Nogay |
Nogay |
+ | |
Norwegian |
NorwegianNynorsk and NorwegianBokmal |
+ | + |
NorwegianBokmal |
Norwegian (Bokmal) |
+ | + |
NorwegianNynorsk |
Norwegian (Nynorsk) |
+ | + |
Nyanja |
Nyanja |
+ | |
Occidental |
Occidental |
+ | |
OcrA |
For OCR-A text type |
+ | |
OcrB |
For OCR-B text type |
+ | |
Ojibway |
Ojibway |
+ | |
Papiamento |
Papiamento |
+ | |
Pascal |
Pascal programming language |
+ | |
PidginEnglish |
Tok Pisin |
+ | |
Polish |
Polish |
+ | + |
PortugueseBrazilian |
Portuguese (Brazil) |
+ | + |
PortugueseStandard |
Portuguese (Portugal) |
+ | + |
Provencal |
Provencal |
+ | |
Quechua |
Quechua |
+ | |
RhaetoRomanic |
Rhaeto-Romanic |
+ | |
Romanian |
Romanian |
+ | + |
RomanianMoldavia |
Romanian (Moldavia) |
+ | |
Romany |
Romany |
+ | |
Ruanda |
Ruanda |
+ | |
Rundi |
Rundi |
+ | |
RussianOldSpelling |
Russian (old spelling) |
+ | + |
Russian |
Russian |
+ | + |
RussianWithAccent |
Russian (with accents marking stress position) |
+ | + |
Samoan |
Samoan |
+ | |
Selkup |
Selkup |
+ | |
SerbianCyrillic |
Serbian (Cyrillic) |
+ | |
SerbianLatin |
Serbian (Latin) |
+ | |
Shona |
Shona |
+ | |
Sioux |
Sioux (Dakota) |
+ | |
Slovak |
Slovak |
+ | + |
Slovenian |
Slovenian |
+ | + |
Somali |
Somali |
+ | |
Sorbian |
Sorbian |
+ | |
Sotho |
Sotho |
+ | |
Spanish |
Spanish |
+ | + |
Sunda |
Sunda |
+ | |
Swahili |
Swahili |
+ | |
Swazi |
Swazi |
+ | |
Swedish |
Swedish |
+ | + |
Tabassaran |
Tabassaran |
+ | |
Tagalog |
Tagalog |
+ | |
Tahitian |
Tahitian |
+ | |
Tajik |
Tajik |
+ | |
Tatar |
Tatar |
+ | + |
Thai |
+ | + | |
Tinpo |
Jingpo |
+ | |
Tongan |
Tongan |
+ | |
Tswana |
Tswana |
+ | |
Tun |
Tun |
+ | |
Turkish |
Turkish |
+ | + |
Turkmen |
Turkmen |
+ | |
TurkmenLatin |
Turkmen (Latin) |
+ | |
Tuvin |
Tuvan |
+ | |
Udmurt |
Udmurt |
+ | |
UighurCyrillic |
Uighur (Cyrillic) |
+ | |
UighurLatin |
Uighur (Latin) |
+ | |
Ukrainian |
Ukrainian |
+ | + |
UzbekCyrillic |
Uzbek (Cyrillic) |
+ | |
UzbekLatin |
Uzbek (Latin) |
+ | |
Vietnamese |
Vietnamese |
+ | + |
Visayan |
Cebuano |
+ | |
Welsh |
Welsh |
+ | |
Wolof |
Wolof |
+ | |
Xhosa |
Xhosa |
+ | |
Yakut |
Yakut |
+ | |
Yiddish |
Yiddish |
+ | |
Zapotec |
Zapotec |
+ | |
Zulu |
Zulu |
+ |
* These are compound recognition languages. The compound predefined languages are to be removed in future versions.
Note: Support for historic fonts and the related "old" languages are not included per default in ABBYYs CLI OCR.
Language Keys when the CJK module is licensed
-
ChinesePRC
-
ChineseTaiwan
-
Japanese
-
Korean
-
KoreanHangul