Autoencoder fails on specific website
Reported by
dar...@gmail.com,
Sep 17 2017
|
||
Issue descriptionChrome Version: Google Chrome 60.0.3112.113 (Official Build) (64-bit) Revision 95c454326a7a3153e984e50a4719924968490717-refs/branch-heads/3112@{#744} OS Mac OS X JavaScript V8 6.0.286.56 Flash 27.0.0.130 /Library/Internet Plug-Ins/PepperFlashPlayer/PepperFlashPlayer.plugin User Agent Mozilla/5.0 (Macintosh; Intel Mac OS X 10_11_6) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/60.0.3112.113 Safari/537.36 URLs (if applicable) : http://lend-lease.airforce.ru Other browsers tested: Add OK or FAIL, along with the version, after other browsers where you have tested this issue: Safari: FAIL* Firefox: FAIL* IE: N/A *Can be fixed through manually switching character encoding to Cyrillic (Win-1251) What steps will reproduce the problem? (1) Enter http://lend-lease.airforce.ru (2) Load website and observe What is the expected result? Legible text in Russian language What happens instead? Illegible symbols appear Please provide any additional information below. Attach a screenshot if possible.
,
Sep 18 2017
Header: Content-Type: text/html; charset=UTF-8 Page: <meta http-equiv="Content-Type" content="text/html; charset=windows-1251"> content.characterSet ends up as: "UTF-8" HTML spec has: https://html.spec.whatwg.org/multipage/parsing.html#determining-the-character-encoding Step 3: "If the transport layer specifies a character encoding, and it is supported, return that encoding with the confidence certain, and abort these steps." Which is to say: if the headers specify an encoding, that is taken as the encoding regardless of what the page content says. So behavior is per spec, and Chrome behaves the same as other browsers.
,
Sep 18 2017
Spec compliance is fine... except that means this website will be forever inaccessible for Chrome browser users. Other browsers have the workaround (manual page encoding selection) to bypass the wrong page encoding - Chrome doesn't. "I can't read the content of this page in any possible way" - from the user point of view the browser IS broken. Broken HTML is too common of feature to ignore, Regards, Dennis |
||
►
Sign in to add a comment |
||
Comment 1 by ajha@chromium.org
, Sep 18 2017Labels: Needs-Milestone OS-Linux OS-Mac OS-Windows