← RFC Reference

RFC 6531 – SMTP Extension for Internationalized Email

Proposed Standard Core SMTP & Message Format Obsoletes RFC 5321 Published February 2012
ELI5: Traditional email addresses can only use English letters, numbers, and a few symbols. That means billions of people around the world can’t have email addresses in their own script — no Chinese characters, no Arabic, no Cyrillic. SMTPUTF8 fixes this by extending SMTP to handle full Unicode, so someone can have an address like émilie@exemple.fr or an address in Hindi, Japanese, or any other writing system.

Why This RFC Exists

Email was designed in the era of ASCII. The original SMTP specification (RFC 5321) restricts email addresses and envelope commands to 7-bit ASCII characters. This works fine for English, but excludes the majority of the world's languages. An email address like 田太郎@例.jp was simply impossible.

RFC 6531 defines the SMTPUTF8 extension, which allows UTF-8 encoding in the SMTP envelope — specifically in MAIL FROM, RCPT TO, and the EHLO domain. This is part of a suite of RFCs collectively known as Email Address Internationalization (EAI), which also includes RFC 6532 for internationalized message headers.

This extension opens email to every writing system supported by Unicode, which is essential for global email adoption.

How It Works

  1. The client sends EHLO and confirms the server advertises SMTPUTF8 in its capability list.
  2. When sending a message that uses non-ASCII addresses (in the envelope, headers, or both), the client adds the SMTPUTF8 parameter to the MAIL FROM command.
  3. The MAIL FROM and RCPT TO addresses may now contain UTF-8 characters.
  4. The message headers may also contain UTF-8 (per RFC 6532), replacing the old RFC 2047 encoded-word workarounds.
  5. If the next-hop server does not support SMTPUTF8, the sending server must either downgrade the message or reject it — it cannot silently strip the international characters.

SMTP Example

Sending a message with internationalized addresses:

S: 220 mx.example.com ESMTP C: EHLO sender.example.net S: 250-mx.example.com S: 250-SMTPUTF8 S: 250-8BITMIME S: 250-STARTTLS S: 250 SIZE 52428800 # SMTPUTF8 parameter required when using international addresses C: MAIL FROM:<émilie@exemple.fr> SMTPUTF8 S: 250 2.1.0 OK C: RCPT TO:<田太郎@例.jp> S: 250 2.1.5 OK C: DATA S: 354 Start mail input C: From: Émilie Dupont <émilie@exemple.fr> C: To: 田太郎 <田太郎@例.jp> C: Subject: Meeting confirmation C: Date: Wed, 11 Mar 2026 10:00:00 +0100 C: MIME-Version: 1.0 C: Content-Type: text/plain; charset=UTF-8 C: C: Confirming our meeting for next week. C: . S: 250 2.0.0 OK, queued

Key Technical Details

The SMTPUTF8 Parameter

The SMTPUTF8 keyword on MAIL FROM signals that this message uses internationalized content. It must be present whenever any of the following contain non-ASCII characters:

If the SMTPUTF8 parameter is not declared and the envelope contains non-ASCII characters, the server must reject the command.

Domain Names: IDN and UTF-8

Internationalized domain names (IDN) have existed for years using Punycode encoding (e.g., xn--e1afmapc.xn--p1ai for пример.рф). SMTPUTF8 allows the UTF-8 form of domain names directly in SMTP, though Punycode (A-labels) remain valid. For DNS lookups, the domain must still be converted to its A-label form.

Downgrading and Fallback

The biggest challenge with SMTPUTF8 is interoperability with servers that don't support it. When relaying a message to a non-SMTPUTF8 server:

Interaction with Authentication

Email authentication mechanisms need updates for internationalized addresses:

Common Mistakes

Deliverability Impact

Related RFCs