Copying/pasting text from PDF produces wrong text
Reported by
khym.cha...@gmail.com,
Aug 8 2017
|
|||
Issue descriptionUserAgent: Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/60.0.3112.90 Safari/537.36 Example URL: Steps to reproduce the problem: 1. Copy the word "collection" from the attached PDF. 2. Paste into anything which accepts text. What is the expected behavior? The word "collection" is pasted. What went wrong? The word "collec6on" is pasted. Does it occur on multiple sites: N/A Is it a problem with a plugin? Yes PDF Did this work before? N/A Does this work in other browsers? N/A Chrome version: 60.0.3112.90 Channel: stable OS Version: Fedora 25 Flash Version: Shockwave Flash 26.0 r0 I have plenty of other PDF pages which display the same problem, if more examples are needed. The problem PDFs seem to be created on a Mac by exporting PowerPoint to PDF via Quartz PDFContext. The exact same problem happens when using the KDE PDF viewing program "okular".
,
Aug 8 2017
The "ti" in "collection" seems to be a single character itself. I can't explain how that was introduced to the PDF, but if you look at all the "ti"s in the page, they are joined at the top and are selectable as a single unit.
,
Aug 10 2017
@khym.chanur@ Thanks for the issue. Able to reproduce this issue on Windows-7, Ubuntu 14.04 and Mac OS 10.12.6 using chrome latest stable 60.0.3112.90 and canary 62.0.3181.0 with the below steps. 1. Opened the given .pdf file and copied the word 'collections'. 2. Pasted this on Chrome URL and could see other characters in place of 'ti'. This is a Non-Regression issue which is observed from 45.0.2413.0 chrome version. Please find the attached screen cast and confirm if anything is missed here. Thanks..
,
Aug 10 2017
This repros in every reader I've tested, it's a problem with the file. Marking as WAI. |
|||
►
Sign in to add a comment |
|||
Comment 1 by hnakashima@chromium.org
, Aug 8 2017