What languages does the character encoding UTF-8 support

charsetutf-8

What written and speak-able languages does UTF-8 support?

How many languages does the UTF-8 support?

Almost Duplicate(closed): What languages does the encoding UTF-8 spport?

Best Answer

UTF-8 supports any unicode character, which pragmatically means any natural language (Coptic, Sinhala, Phonecian, Cherokee etc), as well as many non-spoken languages (Music notation, mathematical symbols, APL).

The stated objective of the Unicode consortium is to encompass all communications. The few exceptions which are not supported well (like Klingon) usually have a roman-alphabet equivalent and/or have an unofficial private unicode code page.

If you're concerned about a particular language, you're better off asking about that exact and particular one.

see http://www.unicode.org/charts/index.html which shows all the major code blocks (character sets) supported by unicode. Typically a character set corresponds to a language family, but the correspondence is not exactly one-to-one.