URLs with UTF-8 / Non-ASCII Characters

When determining the URL for a web page, you often want to use keywords that accurately describe the page’s content. Sometimes, these keywords aren’t in English and contain accented characters. One thing you can do is choose one URL to be the canonical URL and create a redirect to that URL from another that contains the ascii-equivalent version of the words, e.g.

Canonical: http://www.somedomain.com/nǐhǎo

Redirects:

  • http://www.somedomain.com/nihao
  • http://www.somedomain.com/你好