Unicode is a computing industry standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. Unicode contains a repertoire of 139 modern and historic scripts, as well as multiple symbol sets.
The standard has been implemented in modern operating systems, XML, Java (and other programming languages). The Unicode standard defines UTF-8, UTF-16, and UTF-32.
- Character set
- Character set encoding