New issue
Advanced search Search tips

Issue 845821 link

Starred by 2 users

Issue metadata

Status: WontFix
Owner: ----
Closed: May 2018
Components:
EstimatedDays: ----
NextAction: ----
OS: Mac
Pri: 2
Type: Bug



Sign in to add a comment

Can't copy text from PDF file displayed in chrome's PDF viewer

Reported by m...@sundryltd.com, May 23 2018

Issue description

UserAgent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10_10_5) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/66.0.3359.181 Safari/537.36

Example URL:
http://www.amex-services.de/SEPA/lastschriftmandat.pdf

Steps to reproduce the problem:
1. Open a PDF file with the chrome PDF viewer
2. Attempt to copy text to paste somewhere else.
3. No copy option in right-click menu...command-c also doesn't work.

What is the expected behavior?
You should be able to copy and paste text from un-protected PDF files when viewing them with chrome.

What went wrong?
Cannot copy text from unprotected PDF files when viewed with the built-in chrome PDF viewer.   This definitely happens to me semi-regularly with different PDF files.  The link is the latest example.  I don't recall if I have seen PDFs where this works or if a browser restart makes any difference.  I'll update this if I have future observations or if someone has a specific question.

Does it occur on multiple sites: Yes

Is it a problem with a plugin? Yes PDF

Did this work before? N/A 

Does this work in other browsers? Yes

Chrome version: 66.0.3359.181  Channel: stable
OS Version: OS X 10.10.5
Flash Version: 

Copying text out of a PDF is a very common use case...for instance to get a URL to open or to copy a name to google.  To work around this (when it occurs), have to use other browser, disable chrome's PDF plugin or force download the file and open with a system viewer in order to copy text (e.g. a URL) out of a PDF file.Have to use other browser, disable chrome's PDF plugin or force download the file and open with a system viewer in order to copy text (e.g. a URL) out of a PDF file.
 

Comment 1 by woxxom@gmail.com, May 23 2018

This is a locked PDF that forbids copying.
This is a standard feature of PDF honored by Chrome so the behavior is correct.
The only enhancement Chrome could provide is an indication of this protection being present.
secure.png
24.5 KB View Download

Comment 2 by m...@sundryltd.com, May 23 2018

Interesting.  Preview (macos system viewer) reports that it is encrypted but different permissions.
Screen Shot 2018-05-23 at 11.38.04 .png
204 KB View Download

Comment 3 by m...@sundryltd.com, May 23 2018

There is a 5 year old firefox bug for this (copying works fine in firefox's pdf viewer).
Apple Preview also allows copying text, couldn't find anything about whether apple considers this a bug.

Often my use-case is copying and pasting a paragraph of text from a PDF into google translate...or copying and pasting a URL that wasn't properly hyperlinked in order to open a webpage.  I would consider both of these "accessibility" type features (my handicap being that I don't speak the native document language and can't read it without assistance).

I get that people want to protect their documents and the spec does allow for copy protection (although I read that the flag for accessibility copy protection goes away with PDF 2.0 because this should ALWAYS be allowed).  

Personally I think the permissions standard is a bit silly today since anyone that wants to can work around the restrictions...ie it doesn't prevent pirates from pirating "protected" PDFs and for the average user it is just a terrible user experience.  I am guessing that the spirit of the copy permission was to protect from people literally copying/pirating content and not to prevent people from accessing the content.  Not sure how many people would copy and paste text out of a PDF document into their own document and then re-apply a bunch of formatting and release it themselves...   I think many content authors don't think through the user experience either.  Protection sounds good so they do it even if it doesn't make sense or has negative side effects for users. For example, the example document I linked is a simple form that has to be mailed/faxed...it will either have the correct info when it arrives or it won't.  Nothing prevents a user from creating their own look-a-like (but slightly different) form and faxing it it.  Pretty sure the encryption is lose over post/fax.  ;)

But whether the standard makes sense, what the original intentions were and whether they still make sense today is obviously out of scope for chrome development.

I agree with you that chrome appears to be following the standard and all the other PDF reading applications on my machine don't.  Given that this standard doesn't seem to be widely followed and that when enforced it creates a very negative user experience (why can't I copy?!)...it would definitely be nice if there was some obvious indicators that copying isn't allowed by the document when a user tries to copy.

An alternative approach/interpretation might be to see copying small amounts of text (no images/formatting or full documents) as part of the accessibility permission. 
Components: -Blink Internals>Plugins>PDF
Status: Untriaged (was: Unconfirmed)
I can reproduce this issue. On the example PDF in this bug, text cannot be
cut-and-pasted. On other PDF documents it works fine.
Status: WontFix (was: Untriaged)
Yes, I agree the PDF permission bits are a bit silly. Our PDF Viewer honors them though, just like Acrobat Reader. This is not the first bug report about this issue, so we will stand by our previous decision to leave things as they are. If Acrobat's behavior changes in the future, we can certainly revisit the issue then.

I can't speak for other PDF viewers and their behaviors.

For UI to show permissions, that would fall under bug 652427.

Comment 7 by m...@sundryltd.com, May 24 2018

Thanks for the update.  Can you point me at any documentation around what accessibility features/apis/hooks chromium's PDF viewer provides?  The extract permission is given on these documents and I believe several of my use cases fall under use for accessibility purposes.  Looking for alternatives to just disabling the PDF plugin.
In the PDF Viewer, we do take the "copy for accessibility" bit into account, per spec. See PERMISSION_COPY_ACCESSIBLE in [1]. The PDF linked to from this bug report has that bit flipped to on.

Presumably screen readers that work with Chrome can "read" the text. I don't use them frequently enough to recommend one. Maybe try ChromeVox for starters?

[1] https://chromium.googlesource.com/chromium/src/+/524e60fb/pdf/pdfium/pdfium_engine.cc 

Sign in to add a comment