New issue
Advanced search Search tips
Note: Color blocks (like or ) mean that a user may not be available. Tooltip shows the reason.

Issue 821800 link

Starred by 2 users

Issue metadata

Status: Duplicate
Merged: issue 53579
Owner: ----
Closed: Mar 2018
Cc:
Components:
EstimatedDays: ----
NextAction: ----
OS: ----
Pri: 2
Type: Bug



Sign in to add a comment

Chromium just removes %E2%80%8B if entered somewhere in the path part of a URL

Reported by danst.t...@gmail.com, Mar 14 2018

Issue description

UserAgent: Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/41.0.2272.118 Safari/537.36

Steps to reproduce the problem:
1. visit https://www.google.at/intl%E2%80%8B/en_en/policies/technologies/
2. visit https://www.google.at/intl/en_en/policies/technologies/

What is the expected behavior?
Just show the damn whole URL. Do not remove parts, do not hide parts.

What went wrong?
I have absolutely no idea as of WHY the browser should tamper with URLs in this way at all. This is especially annoying when someone provides you with a flawed link and you can't see that you actually requested a totally different site, because this change is not displayed. Please just display all characters within a URL.

Did this work before? N/A 

Chrome version: 65.0.3325.162  Channel: stable
OS Version: 
Flash Version: 

Related: https://bugs.chromium.org/p/chromium/issues/detail?id=53579&q=%25E2%2580%258B&colspec=ID%20Pri%20M%20Stars%20ReleaseBlock%20Component%20Status%20Owner%20Summary%20OS%20Modified
 

Comment 1 by mmenke@chromium.org, Mar 14 2018

Cc: mkwst@chromium.org
Components: UI>Browser>Omnibox Internals>Network UI>Internationalization
Labels: -OS-Linux
Summary: Chromium just removes %E2%80%8B if entered somewhere in the path part of a URL (was: Chromium just removes %E2%80%8B if entered somewhere in the URL hostname (after http(s):// and before /), and it does not display %E2%80%8B in the URL bar if it is actually written in the procotol or in the file name.)
Unfortunately, internatialized paths use escape-encoded UT8.  I believe %E2%80%8B is a zero-width space, which is visible and meaningful in the case of some languages, though obviously not others.  Not sure what the fix here.

Note that if you include it in the domain name, the characters are removed entirely for just this reason.  They are not removed, and displayed as UTF8 characters instead when they're in the path (so this is not a security issue - displaying the wrong domain, or a confusing one, would be a serious security issue).

CCing some possible interested parties, to see if anything feels it's worth investing some effort here.  Similar characters are the the non-breaking space, and the zero-width joiner / non-joiner characters.
Labels: Needs-Triage-M65
Mergedinto: 53579
Status: Duplicate (was: Unconfirmed)

Sign in to add a comment