FerrLens
← Retour au changelog

· Nouveau

ASCII / Unicode inspector

Codepoint-by-codepoint breakdown of any string — UTF-8 bytes, UTF-16 units, HTML entities, CSS escape and Unicode block.

ⓘ Cette entrée est rédigée en anglais par les développeurs.

/tools/ascii-unicode takes any string and lists every codepoint with its encoding details:

  • Codepoint as U+XXXX and decimal.
  • UTF-8 byte sequence (1 to 4 bytes per codepoint).
  • UTF-16 code units (surrogate pairs decoded).
  • HTML entity in decimal (😀) and hex (😀) form.
  • CSS escape (\1F600 ).
  • Unicode block name (Basic Latin, Latin-1 Supplement, CJK Unified Ideographs, Emoticons, etc.).

Designed for debugging weird text — a zero-width space pasted into a config file, a Cyrillic look-alike in a domain, a combining diacritic that won’t normalise, or that one emoji that turns into a surrogate-pair mess in your backend log. Runs entirely in your browser.