You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
With this "tweak", the resulting WARC contains the PDF but "something" seems to prevent it to be displayed on replayweb.page (and in the ZIM as well obviously).
Do I miss something? Is this rather a wombat.js issue?
I'm trying to crawl www.professeurphifix.net and I've an issue with embedded PDFs
Let's focus on https://www.professeurphifix.net/orthographe_impression/ortho_a_1.html as an example.
The code showing the PDF is :
It is hence not explored by default by the crawler, but this is not a big deal thanks to the "recent"
--selectLinks
setting ;)Command used:
With this "tweak", the resulting WARC contains the PDF but "something" seems to prevent it to be displayed on replayweb.page (and in the ZIM as well obviously).
Do I miss something? Is this rather a wombat.js issue?
Sample WARC with the HTML and the PDF:
rec-da74c0c8fc0b-20250328092919995-0.warc.gz
The text was updated successfully, but these errors were encountered: