Character representation

By Martin McBride, 2016-11-16

Categories: data representation

A text character is a letter, digit, punctuation mark, or other symbol that you might type in on your keyboard.

Computers store characters as numbers – each letter or other symbol has a unique number (for example, A might be represented by the number 65, B by 66 etc). A character string such as "Hello world!" is represented by a list of numbers.

A character set is a list of all the characters which are recognised by a computer's hardware and software. It also defines which number is used to represent each character. Here is a simple character set which just contains the capital letters, and gives each letter a different number to identify it:

character set

In order to transfer text data between different computer systems, we need to use a standard character set that all computers understand. There are two standard character sets which are very widely used - ASCII and Unicode.

See also

Sign up to the Creative Coding Newletter

Join my newsletter to receive occasional emails when new content is added, using the form below:

Popular tags

555 timer abstract data type abstraction addition algorithm and gate array ascii ascii85 base32 base64 battery binary binary encoding binary search bit block cipher block padding byte canvas colour coming soon computer music condition cryptographic attacks cryptography decomposition decryption deduplication dictionary attack encryption file server flash memory hard drive hashing hexadecimal hmac html image insertion sort ip address key derivation lamp linear search list mac mac address mesh network message authentication code music nand gate network storage none nor gate not gate op-amp or gate pixel private key python quantisation queue raid ram relational operator resources rgb rom search sort sound synthesis ssd star network supercollider svg switch symmetric encryption truth table turtle graphics yenc