New issue
Advanced search Search tips
Note: Color blocks (like or ) mean that a user may not be available. Tooltip shows the reason.

Issue 753262 link

Starred by 1 user

Issue metadata

Status: WontFix
Owner: ----
Closed: Aug 2017
Cc:
Components:
EstimatedDays: ----
NextAction: ----
OS: Linux
Pri: 2
Type: Compat



Sign in to add a comment

Copying/pasting text from PDF produces wrong text

Reported by khym.cha...@gmail.com, Aug 8 2017

Issue description

UserAgent: Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/60.0.3112.90 Safari/537.36

Example URL:

Steps to reproduce the problem:
1. Copy the word "collection" from the attached PDF.
2. Paste into anything which accepts text.

What is the expected behavior?
The word "collection" is pasted.

What went wrong?
The word "collec6on" is pasted.

Does it occur on multiple sites: N/A

Is it a problem with a plugin? Yes PDF

Did this work before? N/A 

Does this work in other browsers? N/A

Chrome version: 60.0.3112.90  Channel: stable
OS Version: Fedora 25
Flash Version: Shockwave Flash 26.0 r0

I have plenty of other PDF pages which display the same problem, if more examples are needed.  The problem PDFs seem to be created on a Mac by exporting PowerPoint to PDF via Quartz PDFContext.

The exact same problem happens when using the KDE PDF viewing program "okular".
 
bug.pdf
96.5 KB Download
Components: Internals>Plugins>PDF
The "ti" in "collection" seems to be a single character itself. I can't explain how that was introduced to the PDF, but if you look at all the "ti"s in the page, they are joined at the top and are selectable as a single unit.
Cc: susanjuniab@chromium.org
Labels: Needs-Triage-M60 M-62
Status: Untriaged (was: Unconfirmed)
@khym.chanur@ Thanks for the issue.

Able to reproduce this issue on Windows-7, Ubuntu 14.04 and Mac OS 10.12.6 using chrome latest stable 60.0.3112.90 and canary 62.0.3181.0 with the below steps.

1. Opened the given .pdf file and copied the word 'collections'.
2. Pasted this on Chrome URL and could see other characters in place of 'ti'.
This is a Non-Regression issue which is observed from 45.0.2413.0 chrome version.

Please find the attached screen cast and confirm if anything is missed here.

Thanks..
pdf_error.webm
3.9 MB View Download
Status: WontFix (was: Untriaged)
This repros in every reader I've tested, it's a problem with the file.

Marking as WAI.

Sign in to add a comment