ML Kit language identification: supported languages
Supported languages:
BCP-47 Code | Language | Script |
af | Afrikaans | Latin |
am | Amharic | Ge'ez |
ar | Arabic | Arabic |
ar-Latn | Arabic | Latin |
az | Azerbaijani | Latin |
be | Belarusian | Cyrillic |
bg | Bulgarian | Cyrillic |
bg-Latn | Bulgarian | Latin |
bn | Bengali | Bengali |
bs | Bosnian | Latin |
ca | Catalan | Latin |
ceb | Cebuano | Latin |
co | Corsican | Latin |
cs | Czech | Latin |
cy | Welsh | Latin |
da | Danish | Latin |
de | German | Latin |
el | Greek | Greek |
el-Latn | Greek | Latin |
en | English | Latin |
eo | Esperanto | Latin |
es | Spanish | Latin |
et | Estonian | Latin |
eu | Basque | Latin |
fa | Persian | Arabic |
fi | Finnish | Latin |
fil | Filipino | Latin |
fr | French | Latin |
fy | Western Frisian | Latin |
ga | Irish | Latin |
gd | Scots Gaelic | Latin |
gl | Galician | Latin |
gu | Gujarati | Gujarati |
ha | Hausa | Latin |
haw | Hawaiian | Latin |
he | Hebrew | Hebrew |
hi | Hindi | Devanagari |
hi-Latn | Hindi | Latin |
hmn | Hmong | Latin |
hr | Croatian | Latin |
ht | Haitian | Latin |
hu | Hungarian | Latin |
hy | Armenian | Armenian |
id | Indonesian | Latin |
ig | Igbo | Latin |
is | Icelandic | Latin |
it | Italian | Latin |
ja | Japanese | Japanese |
ja-Latn | Japanese | Latin |
jv | Javanese | Latin |
ka | Georgian | Georgian |
kk | Kazakh | Cyrillic |
km | Khmer | Khmer |
kn | Kannada | Kannada |
ko | Korean | Korean |
ku | Kurdish | Latin |
ky | Kyrgyz | Cyrillic |
la | Latin | Latin |
lb | Luxembourgish | Latin |
lo | Lao | Lao |
lt | Lithuanian | Latin |
lv | Latvian | Latin |
mg | Malagasy | Latin |
mi | Maori | Latin |
mk | Macedonian | Cyrillic |
ml | Malayalam | Malayalam |
mn | Mongolian | Cyrillic |
mr | Marathi | Devanagari |
ms | Malay | Latin |
mt | Maltese | Latin |
my | Burmese | Myanmar |
ne | Nepali | Devanagari |
nl | Dutch | Latin |
no | Norwegian | Latin |
ny | Nyanja | Latin |
pa | Punjabi | Gurmukhi |
pl | Polish | Latin |
ps | Pashto | Arabic |
pt | Portuguese | Latin |
ro | Romanian | Latin |
ru | Russian | Cyrillic |
ru-Latn | Russian | English |
sd | Sindhi | Arabic |
si | Sinhala | Sinhala |
sk | Slovak | Latin |
sl | Slovenian | Latin |
sm | Samoan | Latin |
sn | Shona | Latin |
so | Somali | Latin |
sq | Albanian | Latin |
sr | Serbian | Cyrillic |
st | Sesotho | Latin |
su | Sundanese | Latin |
sv | Swedish | Latin |
sw | Swahili | Latin |
ta | Tamil | Tamil |
te | Telugu | Telugu |
tg | Tajik | Cyrillic |
th | Thai | Thai |
tr | Turkish | Latin |
uk | Ukrainian | Cyrillic |
ur | Urdu | Arabic |
uz | Uzbek | Latin |
vi | Vietnamese | Latin |
xh | Xhosa | Latin |
yi | Yiddish | Hebrew |
yo | Yoruba | Latin |
zh | Chinese | Chinese |
zh-Latn | Chinese | Latin |
zu | Zulu | Latin |
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2024-07-10 UTC.
[null,null,["Last updated 2024-07-10 UTC."],[[["A wide array of languages are supported, spanning various language families and geographical regions."],["The list includes languages with diverse scripts such as Latin, Cyrillic, Arabic, Devanagari, and more."],["Some languages are represented with both their native script and a Latin transliteration for broader accessibility."],["Support is provided for major global languages like English, Spanish, Chinese, Hindi, Arabic, and Russian, among others."],["Numerous less common or regional languages like Cebuano, Corsican, Hawaiian, and Zulu are also included in the supported list."]]],["The data outlines supported languages using BCP-47 codes, language names, and scripts. It lists over 100 languages, including Afrikaans, Amharic, Arabic, Chinese, English, French, German, Hindi, Japanese, Russian, and Spanish. Each language is paired with its corresponding script, such as Latin, Cyrillic, Arabic, Ge'ez, Chinese, and others, indicating the writing system used for that language. Some languages are presented using different script.\n"]]