Unicode Character Lookup

Inspect any character: codepoint, HTML entities, JS escape, UTF-8 bytes.

Paste a character or a codepoint (U+20AC, 0x20AC, 8364, \u20AC) and get its codepoint, HTML decimal/hex entity, JavaScript escape, URL-encoded form, UTF-8 byte sequence and Unicode block. Useful when wrangling encoding, escaping or i18n bugs.

Common use cases

  • Find the HTML entity or JS escape for a special character
  • Identify a mystery glyph copy-pasted from a document
  • Look up which Unicode block an emoji or symbol belongs to
  • Generate URL-safe or HTML-safe forms of a single character

Frequently asked questions

How do I enter a codepoint?
Any of these work: type the character itself, U+20AC, 0x20AC, the decimal 8364, or the JS escape \u20AC.
Are emoji supported?
Yes — including emoji above U+FFFF. JS escapes for these use the \u{XXXX} form rather than two surrogate halves.
Where does the character name come from?
We bundle a small lookup of common Latin, punctuation, currency and symbol characters. For arbitrary characters the block name is shown instead.

Related tools