Paste any text and the tool encodes it to UTF-8 bytes (using TextEncoder), then formats each byte as an 8-bit binary string. ASCII characters become a single byte (e.g. 'A' = 01000001). Multi-byte UTF-8 characters — accented letters, CJK, emoji — produce 2, 3 or 4 bytes each, all shown sequentially.
Decoding accepts space-separated or contiguous 8-bit groups and reverses the process via TextDecoder. If the input has malformed binary or invalid UTF-8 sequences, you'll see the Unicode replacement character (U+FFFD) rather than a hard error. The byte separator (space, newline or none) is configurable to match whatever upstream format you're working with.
8-bit UTF-8. ASCII characters fit in one byte, but characters outside ASCII (é, 你, 🎉) become 2–4 bytes each, all output as 8-bit groups.
Yes — decoder mode accepts both 'space-separated' and 'contiguous' 8-bit groups. The decoder reads 8 bits at a time.
Trailing bits that don't form a full byte are ignored. The decoded output will be missing the corresponding character — a warning is shown.
Emoji are encoded in UTF-8 using 4 bytes (32 bits). Compound emoji like family glyphs use multiple 4-byte sequences joined with zero-width joiners, so they can be 8, 12 or more bytes.
Explore the full suite of Converter tools and 290+ other free utilities at Chunky Munster.