New issue
Advanced search Search tips

Issue 766008 link

Starred by 3 users

Issue metadata

Status: WontFix
Owner: ----
Closed: Sep 2017
Components:
EstimatedDays: ----
NextAction: ----
OS: Linux , Windows , Mac
Pri: 3
Type: Bug



Sign in to add a comment

Autoencoder fails on specific website

Reported by dar...@gmail.com, Sep 17 2017

Issue description

Chrome Version: 
Google Chrome	60.0.3112.113 (Official Build) (64-bit)
Revision	95c454326a7a3153e984e50a4719924968490717-refs/branch-heads/3112@{#744}
OS	Mac OS X
JavaScript	V8 6.0.286.56
Flash	27.0.0.130 /Library/Internet Plug-Ins/PepperFlashPlayer/PepperFlashPlayer.plugin
User Agent	Mozilla/5.0 (Macintosh; Intel Mac OS X 10_11_6) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/60.0.3112.113 Safari/537.36


URLs (if applicable) : http://lend-lease.airforce.ru
Other browsers tested:
  Add OK or FAIL, along with the version, after other browsers where you
have tested this issue:
     Safari: FAIL*
    Firefox: FAIL*
         IE: N/A

*Can be fixed through manually switching character encoding to Cyrillic (Win-1251)

What steps will reproduce the problem?
(1) Enter http://lend-lease.airforce.ru
(2) Load website and observe

What is the expected result?
Legible text in Russian language

What happens instead?
Illegible symbols appear

Please provide any additional information below. Attach a screenshot if
possible.

 
Screen Shot 2017-09-17 at 17.15.58.png
606 KB View Download

Comment 1 by ajha@chromium.org, Sep 18 2017

Components: Blink>TextEncoding
Labels: Needs-Milestone OS-Linux OS-Mac OS-Windows
This seems to be showing the same behavior on Chrome,Firefox and IE. Adding proper component for someone from the respective team to have a look into this and provide more inputs.

Comment 2 by jsb...@chromium.org, Sep 18 2017

Status: WontFix (was: Unconfirmed)
Header: Content-Type: text/html; charset=UTF-8

Page: <meta http-equiv="Content-Type" content="text/html; charset=windows-1251">

content.characterSet ends up as: "UTF-8"

HTML spec has:

https://html.spec.whatwg.org/multipage/parsing.html#determining-the-character-encoding

Step 3: "If the transport layer specifies a character encoding, and it is supported, return that encoding with the confidence certain, and abort these steps."

Which is to say: if the headers specify an encoding, that is taken as the encoding regardless of what the page content says. So behavior is per spec, and Chrome behaves the same as other browsers.




Comment 3 by dar...@gmail.com, Sep 18 2017

Spec compliance is fine... except that means this website will be forever inaccessible for Chrome browser users. 

Other browsers have the workaround (manual page encoding selection) to bypass the wrong page encoding - Chrome doesn't. 

"I can't read the content of this page in any possible way" - from the user point of view the browser IS broken. 

Broken HTML is too common of feature to ignore,

Regards,
Dennis


Sign in to add a comment