UTF-8(7) Miscellaneous Information Manual UTF-8(7) UTF-8 - ASCII 3.0 16 . ( UCS-2) 16 . - 16 - '\0' '/' C . ASCII 16 . UCS-2 . (UCS) ISO/IEC 10646 -31 - UCS-4 ( 32 ) . UTF-8 UCS . UTF-8 : o UCS 0x00000000 0x0000007f ( US-ASCII ) 0x00 0x7f ( ASCII). ASCII 7 ASCII UTF-8. o UCS 0x7f 0x80 0xfd ASCII '\0' '/'. o UCS-4. o UCS 2^31 UTF-8. o 0xc0 0xc1 0xfe 0xff UTF-8. o UCS ASCII 0xc2 0xfd . 0x80 0xbf. . o UCS UTF-8 0x10ffff UTF-8. . UCS : 0x00000000 - 0x0000007F: 0xxxxxxx 0x00000080 - 0x000007FF: 110xxxxx 10xxxxxx 0x00000800 - 0x0000FFFF: 1110xxxx 10xxxxxx 10xxxxxx 0x00010000 - 0x001FFFFF: 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 0x00200000 - 0x03FFFFFF: 111110xx 10xxxxxx 10xxxxxx 10xxxxxx 10xxxxxx 0x04000000 - 0x7FFFFFFF: 1111110x 10xxxxxx 10xxxxxx 10xxxxxx 10xxxxxx 10xxxxxx xxx (big-endian). . UCS 0xd800-0xdfff ( UTF-16) 0xfffe 0xffff (- UCS) UTF-8 . RFC 3629 U+10FFFF . 0xa9 = 1010 1001 ( ) UTF-8 11000010 10101001 = 0xc2 0xa9 0x2260 = 0010 0010 0110 0000 ( " ") : 11100010 10001001 10100000 = 0xe2 0x89 0xa0 UTF-8 export LANG=en_GB.UTF-8 UTF-8 . setlocale(LC_CTYPE, "") streq(nl_langinfo(CODESET), "UTF-8") UTF-8 UTF-8. US-ASCII ISO/IEC8859 UTF-8. . UTF-8 ASCII. mbsrtowcs(3) wcswidth(3) . ESC ISO/IEC2022 ( VT100) UTF-8 ESC % G ("\x1b%G"). UTF-8 ISO/IEC2022 ESC % @ ("\x1b%@"). ISO/IEC2022 ( G0 G1) UTF-8. UCS UTF-8 0xc0 . 3.1 . : ASCII "/../" ";" NUL ASCII UTF-8 . ISO/IEC 10646-1:2000 3.1 RFC 3629 9. locale(1), nl_langinfo(3), setlocale(3), charsets(7), unicode(7) 3 . . : . 6.18 22 2026 UTF-8(7)