Encoding Primer

Computers think in bits, which are just 0s and 1s. Humans read text. Encoding is the bridge between those two worlds, and it is what lets a 64-byte post-quantum public key of your avatar become 87 or 88 characters that you could share with the world and anyone, anywhere can decode it back into the exact same bytes.

Every .avtr domain name, every signed engram, and every verification you earn traces back to keys and hashes encoded this way. Before we can talk about what those keys do, we need to understand how they are written.

The number of bits you group together determines how many distinct values you can represent.

1 bit   = 0 or 1                            (2 possibilities)
4 bits  = 0000 to 1111                      (16 possibilities)
6 bits  = 000000 to 111111                  (64 possibilities)
8 bits  = 1 byte = 00000000 to 11111111     (256 possibilities)

That last row is the one that matters most. Eight bits grouped together form a byte, and the byte is the standard unit that computers use to store and transmit data. Cryptographic keys and hashes are just long sequences of these bytes.

To display those bytes as readable text, whether in a URL, a config file, or on screen, we encode them into printable characters. Each encoding scheme does this by slicing the bytes into smaller bit groups and mapping each group to a character. The only real question is how many bits each character carries.

Base16 (Hexadecimal)

4 bits per character. 16 possible values.

Hexadecimal is the simplest encoding because each character maps to exactly 4 bits, which is half a byte, so two hex characters together always form one full byte.

Alphabet (16 characters):
0 1 2 3 4 5 6 7 8 9 a b c d e f

Each value maps to a specific 4-bit pattern:

Bits	Value	Char	Bits	Value	Char
0000	0	`0`	1000	8	`8`
0001	1	`1`	1001	9	`9`
0010	2	`2`	1010	10	`a`
0011	3	`3`	1011	11	`b`
0100	4	`4`	1100	12	`c`
0101	5	`5`	1101	13	`d`
0110	6	`6`	1110	14	`e`
0111	7	`7`	1111	15	`f`

How 1 byte becomes 2 hex characters:

Byte: 01111010  (decimal 122)
      ├──┤├──┤
       ↓    ↓
       7    a    →  "7a"

If you encoded a 64-byte public key in hex, it would become 128 characters, because every byte turns into two characters. Base16 is the simplest encoding to implement, since you just slice the bytes into groups of four bits, and it is fast and universally understood by every tool that touches binary data. The tradeoff is that hex doubles the length of whatever you encode, which becomes noticeable once keys and signatures grow large.

Base64

6 bits per character. 64 possible values.

Base64 packs 50 percent more information into each character than hex by using 6-bit groups instead of 4-bit groups, drawing from all uppercase and lowercase letters, all ten digits, plus + and / to reach exactly 64.

Alphabet (64 characters):
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z   (26)
a b c d e f g h i j k l m n o p q r s t u v w x y z   (26)
0 1 2 3 4 5 6 7 8 9                                   (10)
+ /                                                   (2)
                                               Total: (64)

Each value maps to a specific 6-bit pattern:

Bits	Value	Char	Bits	Value	Char
000000	0	`A`	100000	32	`g`
000001	1	`B`	100001	33	`h`
000010	2	`C`	100010	34	`i`
000011	3	`D`	100011	35	`j`
000100	4	`E`	100100	36	`k`
000101	5	`F`	100101	37	`l`
000110	6	`G`	100110	38	`m`
000111	7	`H`	100111	39	`n`
001000	8	`I`	101000	40	`o`
001001	9	`J`	101001	41	`p`
001010	10	`K`	101010	42	`q`
001011	11	`L`	101011	43	`r`
001100	12	`M`	101100	44	`s`
001101	13	`N`	101101	45	`t`
001110	14	`O`	101110	46	`u`
001111	15	`P`	101111	47	`v`
010000	16	`Q`	110000	48	`w`
010001	17	`R`	110001	49	`x`
010010	18	`S`	110010	50	`y`
010011	19	`T`	110011	51	`z`
010100	20	`U`	110100	52	`0`
010101	21	`V`	110101	53	`1`
010110	22	`W`	110110	54	`2`
010111	23	`X`	110111	55	`3`
011000	24	`Y`	111000	56	`4`
011001	25	`Z`	111001	57	`5`
011010	26	`a`	111010	58	`6`
011011	27	`b`	111011	59	`7`
011100	28	`c`	111100	60	`8`
011101	29	`d`	111101	61	`9`
011110	30	`e`	111110	62	`+`
011111	31	`f`	111111	63	`/`

How 3 bytes become 4 base64 characters:

3 bytes = 24 bits
24 bits ÷ 6 bits per char = exactly 4 characters

Bytes:  01001101  01100001  01101110
        ├────┤├──────┤├──────┤├────┤
        010011 010110  000101 101110
          ↓       ↓       ↓      ↓
          T       W       F      u     →  "TWFu"

If you encoded the same 64-byte key in base64, it would become 88 characters once padding is added. That is only 33 percent overhead compared to hex's 100 percent, which is why base64 became the standard for embedding binary data in HTTP headers, JSON, and email attachments. It does have two rough edges that matter for identity systems. The + and / characters break when dropped into URLs without escaping, and characters like uppercase I, lowercase l, uppercase O, and the digit 0 look nearly identical in many fonts, which makes base64 risky when a human has to read a key aloud or copy it by hand.

Base58

Around 5.86 bits per character. 58 possible values.

Base58 was invented by Bitcoin to solve exactly the readability problem base64 suffered from. The idea is straightforward: start with base64's 64 characters and remove the 6 that cause the most trouble.

Base64 alphabet:  A-Z  a-z  0-9  + /      (64 characters)

Remove:  0  (looks like O)
         O  (looks like 0)
         I  (looks like l)
         l  (looks like I)
         +  (breaks URLs)
         /  (breaks URLs)

64 - 6 = 58 characters remain

The same grid with the removed characters struck through:

Alphabet (58 characters remaining):
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z   (24, removed I and O)
a b c d e f g h i j k l m n o p q r s t u v w x y z   (25, removed l)
0 1 2 3 4 5 6 7 8 9                                   (9, removed 0)
+ /                                                   (0, removed + and /)
                                               Total: (58)

The remaining 58 characters are assigned values 0 through 57:

Value	Char	Value	Char	Value	Char	Value	Char
0	`1`	15	`G`	30	`X`	45	`n`
1	`2`	16	`H`	31	`Y`	46	`o`
2	`3`	17	`J`	32	`Z`	47	`p`
3	`4`	18	`K`	33	`a`	48	`q`
4	`5`	19	`L`	34	`b`	49	`r`
5	`6`	20	`M`	35	`c`	50	`s`
6	`7`	21	`N`	36	`d`	51	`t`
7	`8`	22	`P`	37	`e`	52	`u`
8	`9`	23	`Q`	38	`f`	53	`v`
9	`A`	24	`R`	39	`g`	54	`w`
10	`B`	25	`S`	40	`h`	55	`x`
11	`C`	26	`T`	41	`i`	56	`y`
12	`D`	27	`U`	42	`j`	57	`z`
13	`E`	28	`V`	43	`k`
14	`F`	29	`W`	44	`m`

Note: I is skipped after H, O after N, l after k.

Why 5.86 bits and no clean bit mapping

58 is NOT a power of 2.

2^5 = 32   (too few, wastes 26 values)
2^6 = 64   (too many, wastes 6 values)

log₂(58) = 5.858 bits per character  (not a whole number)

Because there is no clean bit boundary, Base58 cannot slice bytes into fixed groups the way hex and base64 can. Instead it treats the entire input as one enormous number and divides by 58 repeatedly, collecting the remainders as it goes.

Input (64 bytes as one giant number):
  N = 0x7ab2c3d4e5f6...  (very large number)

Encoding loop:
  N ÷ 58 = quotient, remainder 23 → alphabet[23] = 'Q'
  Q ÷ 58 = quotient, remainder  7 → alphabet[7]  = '8'
  Q ÷ 58 = quotient, remainder 41 → alphabet[41] = 'i'
  ...repeat until quotient = 0, then reverse

A 64-byte Avatarnet public key becomes 87 or 88 Base58 characters. The formula gives 512 / 5.858 = 87.4, but because base58 has no padding and the output length depends on the actual numeric value of the input, that .4 has to round to a whole character. About 80 percent of random keys land on 88, the rest on 87. Either way it is on par with base64 for compactness while offering the payoff of readability: no visually ambiguous characters, no symbols that break URLs, and no embarrassing moments when someone misreads an identifier over the phone. The only real cost is that big-number division is slower than the simple bit slicing that hex and base64 rely on.

Comparison

The same 64-byte Avatarnet public key in three encodings:

Hex:    7ab2c3d4e5f6a7b8c9d0e1f2a3b4c5d6e7f8a9b0c1d2e3f4a5b6c7d8e9f0a1b2   (64)
        3c4d5e6f7a8b9c0d1e2f3a4b5c6d7e8f9a0b1c2d3e4f5a6b7c8d9e0f1a2b3c4d   (64)
        └─────────────────────── 128 characters ───────────────────────┘

Base64: erLD1OX2p7jJ0OHyo7TF1uf4qbDB0uP0pbbH2OnwobI8TV5veoucDR4vOktcbX6P   (64)
        mgscLT5PWmt8jZ4PGis8TQ==                                           (24)
        └──────────────────────── 88 characters ───────────────────────┘

Base58: 3THLVn365LxxWq8TXdj2U4h8tj8u5SVRW1otk5L7PuZhsBBkWMMVjS8cumi3o5Ew   (64)
        Xc6riUdH5N8hPzoMQqhF9AsN                                           (24)
        └──────────────────────── 88 characters ───────────────────────┘

The pattern is clear: more bits per character means fewer characters for the same data.

Encoding	Bits/Char	64-byte key	Clean slicing?
Base16	4.00	128 chars	Yes (4-bit groups)
Base64	6.00	88 chars	Yes (6-bit groups)
Base58	5.86	87-88 chars	No (division math)

64 Bytes in Every Base

An Avatarnet public key and a SHA-512 content hash are both 64 bytes, which is 512 bits. The three encodings above are the ones Avatarnet actually uses, but they sit on a spectrum that includes every common base. The table below shows how the same 512 bits expand or compress depending on how many bits each character carries.

The math

Base	Values/Char	Bits/Char	Formula	Chars for 64 bytes
2	2	1.00	512 ÷ 1 = 512	512 (binary)
8	8	3.00	512 ÷ 3 = 171	171 (octal)
10	10	3.32	512 ÷ 3.32 = 154.1	154-155 (decimal)
16	16	4.00	512 ÷ 4 = 128	128 (hex)
32	32	5.00	512 ÷ 5 = 103	103 (base32)
58	58	5.86	512 ÷ 5.86 = 87.4	87-88 (base58)
64	64	6.00	512 ÷ 6 = 86	88 (base64 + pad)

Visual: same key, every encoding

Binary (base2) -- 512 characters:
01111010101100101100001111010100111001011111011010100111101110001100100111010000111000011111001010100011101101001100010111010110111001111111100010101001101100001100000111010010111000111111010010100101101101101100011111011000111010011111000010100001101100100011110001001101010111100110111101111010100010111001110000001101000111100010111100111010010010110101110001101101011111101000111110011010000010110001110000101101001111100100111101011010011010110111110010001101100111100000111100011010001010110011110001001101

Octal (base8) -- 171 characters:
172545417247137324756144720703712435514272671774251541407227077224555543730723702415443611527467572427160150742747222656155375076320261605517447532326762154740743212636115

Decimal (base10) -- 154 characters:
6426231439428688780160821520481653369851945368757767980841938812350201104760978515516502572108208257329007500974646048271668522153118506373005704832957517

Hex (base16) -- 128 characters:
7ab2c3d4e5f6a7b8c9d0e1f2a3b4c5d6e7f8a9b0c1d2e3f4a5b6c7d8e9f0a1b23c4d5e6f7a8b9c0d1e2f3a4b5c6d7e8f9a0b1c2d3e4f5a6b7c8d9e0f1a2b3c4d

Base32 -- 103 characters:
pkzmhvhf62t3rsoq4hzkhngf23t7rknqyhjoh5ffw3d5r2pqugzdytk6n55ixhandyxtus24nv7i7gqldqwt4t22nn6i3hqpdivtyti

Base58 -- 88 characters:
3THLVn365LxxWq8TXdj2U4h8tj8u5SVRW1otk5L7PuZhsBBkWMMVjS8cumi3o5EwXc6riUdH5N8hPzoMQqhF9AsN

Base64 -- 88 characters:
erLD1OX2p7jJ0OHyo7TF1uf4qbDB0uP0pbbH2OnwobI8TV5veoucDR4vOktcbX6PmgscLT5PWmt8jZ4PGis8TQ==

Why more bits per character means shorter output

512 bits to encode:

Base2:   ████████████████████████████████████████████████████  512 chars
Base8:   █████████████████                                     171 chars
Base10:  ████████████████                                  154-155 chars
Base16:  █████████████                                         128 chars
Base32:  ██████████                                            103 chars
Base58:  █████████                                           87-88 chars
Base64:  █████████                                             88 chars

← fewer characters = more efficient

Each step up in base squeezes more information into each character. Base58 and Base64 land at nearly the same length because 5.86 and 6.00 bits per character are close, but Base58 trades that last fraction of efficiency for the readability gains described above.

The tradeoff

More compact ──────────────────────────────────────► More readable
Base64/58          Base32          Base16            Base2
(~88 chars)        (103 chars)     (128 chars)       (512 chars)
hard to read       Tor .onion      easy to read      impractical
hard to debug      addresses       easy to debug     but obvious

For Avatarnet:

Hex (Base16) for keys and hashes in logs, config files, and debugging output, because every byte is exactly two characters and the mapping is trivial to read
Base58 for Peer IDs, following the libp2p and Bitcoin convention, because humans may need to read, copy, or compare them
Base64 for signatures in wire formats and JSON transport, because compactness matters when a single signature is 49,856 bytes

The keys and signatures these encodings carry are dramatically larger than what Bitcoin or Signal use, and that raises an obvious question: why accept the extra weight? The answer starts with quantum computers and the algorithms that survive them. That is the subject of the next page, Post-Quantum Cryptography.

Value	Char	Value	Char	Value	Char	Value	Char
0	`1`	15	`G`	30	`X`	45	`n`
1	`2`	16	`H`	31	`Y`	46	`o`
2	`3`	17	`J`	32	`Z`	47	`p`
3	`4`	18	`K`	33	`a`	48	`q`
4	`5`	19	`L`	34	`b`	49	`r`
5	`6`	20	`M`	35	`c`	50	`s`
6	`7`	21	`N`	36	`d`	51	`t`
7	`8`	22	`P`	37	`e`	52	`u`
8	`9`	23	`Q`	38	`f`	53	`v`
9	`A`	24	`R`	39	`g`	54	`w`
10	`B`	25	`S`	40	`h`	55	`x`
11	`C`	26	`T`	41	`i`	56	`y`
12	`D`	27	`U`	42	`j`	57	`z`
13	`E`	28	`V`	43	`k`
14	`F`	29	`W`	44	`m`

Value	Char	Value	Char	Value	Char	Value	Char
0	`1`	15	`G`	30	`X`	45	`n`
1	`2`	16	`H`	31	`Y`	46	`o`
2	`3`	17	`J`	32	`Z`	47	`p`
3	`4`	18	`K`	33	`a`	48	`q`
4	`5`	19	`L`	34	`b`	49	`r`
5	`6`	20	`M`	35	`c`	50	`s`
6	`7`	21	`N`	36	`d`	51	`t`
7	`8`	22	`P`	37	`e`	52	`u`
8	`9`	23	`Q`	38	`f`	53	`v`
9	`A`	24	`R`	39	`g`	54	`w`
10	`B`	25	`S`	40	`h`	55	`x`
11	`C`	26	`T`	41	`i`	56	`y`
12	`D`	27	`U`	42	`j`	57	`z`
13	`E`	28	`V`	43	`k`
14	`F`	29	`W`	44	`m`

#Encoding Primer

#Base16 (Hexadecimal)

#Base64

#Base58

#Why 5.86 bits and no clean bit mapping

#Comparison

#64 Bytes in Every Base

#The math

#Visual: same key, every encoding

#Why more bits per character means shorter output

#The tradeoff