Supported character sets and encoding

Exasol supports the Unicode and ASCII character sets. Unicode characters are defined using code points, which in Exasol are transformed to binary values using the UTF-8 character encoding standard (UTF = Unicode Transformation Format). The first 128 code points in Unicode are identical to the ASCII character set, which makes ASCII a subset of Unicode.

Exasol does not support SQL collation.

UTF‑8

Binary encoding of the code point value of Unicode characters, using 1 to 4 bytes per character depending on the value of the code point.

The first 128 code points are identical to the ASCII character set.

ASCII

Binary encoding using 1 byte per character, only the lower 7 bits allowed.

Extended ASCII (8 bits) is not supported in Exasol.