Some unicode domain names aren't recognized as such
Reported by
sorokinm...@gmail.com,
Mar 16 2017
|
||||||
Issue descriptionUserAgent: Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/57.0.2987.21 Safari/537.36 Example URL: ╯°□°╯︵┻━┻.tk Steps to reproduce the problem: 1. Enter the url `╯°□°╯︵┻━┻.tk' into the adress bar What is the expected behavior? Chromium recognizes the domain name as such and opens it What went wrong? It tries to google it Does it occur on multiple sites: N/A Is it a problem with a plugin? No Did this work before? N/A Does this work in other browsers? No chome for android, safari for ios (but macos version works properly), wget, curl, elinks Chrome version: 57.0.2987.21 Channel: beta OS Version: Linux benny-pc 4.9.8-gentoo #1 SMP Sat Feb 11 23:04:31 MSK 2017 x86_64 Intel(R) Core(TM) i7-3520M CPU @ 2.90GHz GenuineIntel GNU/Linux Flash Version: Shockwave Flash 24.0 r0
,
Mar 20 2017
,
Mar 20 2017
Are you sure this is a real domain name? DNS lookup on it fails for me. I ask because it seems to be forced to a search due to a failure to convert to valid punycode, and thus I wonder if it's a legal, navigable domain at all.
,
Mar 21 2017
IIRC, Emoji is disallowed per IDNA2008 ( https://tools.ietf.org/html/rfc5892 ), and when they work, it's due to IDNA2003 rules being applied. Adding jshin@ to make sure I didn't botch this.
,
Mar 21 2017
If it helps at all, removing the "︵" character causes this to be treated as a valid URL.
,
Mar 21 2017
Not a DNS issue - the domain name isn't even sent to the DNS resolver. Seems like this is most likely a GURL, icu, or omnibox issue.
,
Jun 26 2017
Closering per comment #4. Supporting the claim that emoji is invalid, according to a reputable site, https://www.punycoder.com/, the ︵ is invalid. (It would get encoded as something involving a "(", which is an illegal character in hostnames / domain names.) |
||||||
►
Sign in to add a comment |
||||||
Comment 1 by sorokinm...@gmail.com
, Mar 16 2017