All Four Unicode Normal Forms
Convert to NFC, NFD, NFKC, or NFKD — the complete set of Unicode normalization forms defined by the standard.
500+ fast, free tools. Most run in your browser only; Image & PDF tools upload files to the backend when you run them.
Normalize Unicode text across NFC, NFD, NFKC, and NFKD with diff summary.
The same visible character can be encoded multiple ways in Unicode — an 'é' can be stored as a single precomposed code point (U+00E9) or as a base 'e' followed by a combining acute accent (U+0301). This matters more than it appears: two strings that look identical on screen may fail a strict equality check, break text search, or produce duplicate entries in a database because they use different normalization forms. Unicode Normalizer converts text between the four Unicode normalization forms defined by the Unicode Standard — NFC, NFD, NFKC, and NFKD — and shows a summary of how many characters changed. NFC is the preferred form for most storage and interchange; NFKC additionally maps compatibility characters like full-width letters and ligatures to their canonical equivalents.
Convert to NFC, NFD, NFKC, or NFKD — the complete set of Unicode normalization forms defined by the standard.
See exactly how many characters were added, removed, or replaced during normalization so nothing changes unexpectedly.
Normalize before storing text to prevent the same visible string from producing duplicate records or search misses.
NFKC maps full-width characters, Roman numerals, ligatures, and compatibility forms to their standard equivalents.
NFC is the recommended form for most web content, JSON, and database storage — precomposed characters use fewer code units.
Normalization surface-level reveals combining characters and compatibility variants that are otherwise invisible in rendered text.
Input: café (composed as e + combining acute: NFD form)
Output: NFC: café (single code point U+00E9 for é — 4 chars) NFD: café (5 chars with combining accent)
Input: Abc123 (full-width Unicode)
Output: NFKC: Abc123 (standard ASCII equivalents)
Input: file and flow (fi and fl ligatures)
Output: NFKC: file and flow