New issue
Advanced search Search tips

Issue 597370 link

Starred by 1 user

Issue metadata

Status: Assigned
Owner:
Cc:
Components:
EstimatedDays: ----
NextAction: ----
OS: ----
Pri: 3
Type: Bug



Sign in to add a comment

Wrongly detected pagination leads to repeated content in DOM distiller

Project Member Reported by wychen@chromium.org, Mar 23 2016

Issue description

For the "pagenum" pagination algorithm, the following URL has false positive for "next page" link.

http://altwall.net/texts.php?show=bullet&number=2642

The detected next-page link is actually pagination for the comment only. This results in repeated content in the stitched view.

One easy workaround is to skip appending the content if it's exactly the same with the first page, but this probably won't be robust enough.
 

Comment 1 by wychen@chromium.org, Mar 23 2016

Issue 596310 and issue 448463 are related.
Status: Assigned (was: Untriaged)

Sign in to add a comment