New issue
Advanced search Search tips
Note: Color blocks (like or ) mean that a user may not be available. Tooltip shows the reason.

Issue 765341 link

Starred by 1 user

Issue metadata

Status: Available
Owner: ----
Cc:
Components:
EstimatedDays: ----
NextAction: ----
OS: Linux , Android , Windows , Chrome , Mac
Pri: 3
Type: Task



Sign in to add a comment

Base + Combining mark sequence NOT supported by any font are turned into Tofus.

Project Member Reported by js...@chromium.org, Sep 14 2017

Issue description

Reported internally in b/64850414 .

Arguably, this issue can be considered 'Work As Intended'. Moreover, sequences affected by this bug would not show up in regular text (they'd only show up in text concocted to test or spoof or something interesting). So, this is not a high priority issue. 


How to reproduce:

A. 
Have a web page with "base + combining mark" sequences that are not likely to be supported. 

e.g. U+AC00 + U+0305  "가̅"    

B. 
1.  Open in Chrome   http://osagelanguagetools.appspot.com/OsageFonts/?utext=%F0%90%93%88%F0%90%92%B0%CD%98%20%F0%90%92%B9%F0%90%92%B7%20%F0%90%93%8D%F0%90%92%B2%F0%90%93%87%F0%90%92%B7&osageText=

2.  Enter U+0305 (Combining Overline) after the last character in the text field.

Actual:

  1. Both base character and combining mark are turned into tofus
  2. Or  the base character is turned into tofu while a free standing combining mark without a dotted circle is shown. 

What's expected:

  There's no clear-cut answer to this situation, especially considering that the script extension of U+0305 is still Common. Perhaps, it'd better be tightened to Latin-like (LGC + a few other scripts). 

If 'base + combining mark' is "invalid" ('definition of invalid' TBD),  we may as well just show 'base' character (e.g. Korean Hangul, CJK ideograph) intact followed by a combining mark (e.g. U+0305) with a dotted circle. 

In case of Firefox on Linux, U+0305 is attached to the base character (Hangul, Han, you name it) , which we can also consider doing, but my preference is in the previous paragraph (especially in light of potential spoofing attempts in domain names with Firefox-like-behavior). 


 

Comment 1 by js...@chromium.org, Sep 21 2017

> In case of Firefox on Linux, U+0305 is attached to the base character (Hangul, Han, you name it) , which we can also consider doing, but my preference is in the previous paragraph (especially in light of potential spoofing attempts in domain names with Firefox-like-behavior). 

See the following comment as to why I don't want Chrome to do that: 

https://bugs.chromium.org/p/chromium/issues/detail?id=756456#c3


In that comment, I have 'e' + U+0F37.  On my Mac, U+0F37 is shown without a dotted circle (U+25CC) after 'e'. (it's a security risk especially in the omnibox, but it can be an issue in web contents area as well). 

google༷.com   (U+0F37 after 'e').

I think what needs to be done for Base + Combining mark (of unrelated scripts) is:

1. Show base character as usual
2. Show an isolated combining mark (unmatched) with a dotted circle + a nominal glyph (it'd be rather automatic in some fonts.  In other fonts, Blink may have to trigger this behavior 'manually'). 


Project Member

Comment 2 by sheriffbot@chromium.org, Sep 24

Labels: Hotlist-Recharge-Cold
Status: Untriaged (was: Available)
This issue has been Available for over a year. If it's no longer important or seems unlikely to be fixed, please consider closing it out. If it is important, please re-triage the issue.

Sorry for the inconvenience if the bug really should have been left as Available.

For more details visit https://www.chromium.org/issue-tracking/autotriage - Your friendly Sheriffbot
Labels: -Type-Bug Type-Task
Status: Available (was: Untriaged)
I'll move this to type task. We'd need a definition of what an invalid combination it, and it really exists, and then we always need a font that has all combinations of dotted circle and those combining marks. I am not sure that's feasible.

Sign in to add a comment