Piracy: ꜱᴀɪʟ ᴛʜᴇ ʜɪɢʜ ꜱᴇᴀꜱ

64053 readers

402 users here now

⚓ Dedicated to the discussion of digital piracy, including ethical problems and legal advancements.

Rules • Full Version

1. Posts must be related to the discussion of digital piracy

2. Don't request invites, trade, sell, or self-promote

3. Don't request or link to specific pirated titles, including DMs

4. Don't submit low-quality posts, be entitled, or harass others

Loot, Pillage, & Plunder

📜 c/Piracy Wiki (Community Edition):

🏴‍☠️ Other communities

FUCK ADOBE!

!GenP@lemmy.dbzer0.com

Torrenting/P2P:

Gaming:

💰 Please help cover server costs.


Ko-fi	Liberapay

founded 2 years ago

MODERATORS

db0@lemmy.dbzer0.com

sunbrothersco@lemmy.dbzer0.com

Flatworm7591@lemmy.dbzer0.com

RandomLegend@lemmy.dbzer0.com

Andromxda@lemmy.dbzer0.com

CosmicTurtle0@lemmy.dbzer0.com

tenchiken@lemmy.dbzer0.com

unruffled@anarchist.nexus

How do i turn a collection of xhtml files into a pdf? (lemm.ee)

submitted 7 months ago by Irelephant@lemm.ee to c/piracy@lemmy.dbzer0.com

13 comments fedilink hide all child comments

I ripped a lot of xhtml files from a crappy ebook reader online, how do combine these into a pdf?

you are viewing a single comment's thread
view the rest of the comments

[–] deegeese@sopuli.xyz 2 points 7 months ago (1 children)

There are a ton of options depending on your tech level.

How are you with basic Python scripts?

[–] Irelephant@lemm.ee 1 points 7 months ago (3 children)

I made the script to rip them in bash. I know python, lua, js, bash and powershell, anything using these works.

[–] danielquinn@lemmy.ca 3 points 7 months ago

I've used pdfkit to considerable success. It has a few system-level dependencies, but the instructions are pretty straightforward:

# apt-get install wkhtmltopdf
$ pip install pdfkit

[–] deegeese@sopuli.xyz 3 points 7 months ago (1 children)

Surely you can figure out how to use existing libraries for this task, or is there something you’re stuck on?

[–] Irelephant@lemm.ee 2 points 7 months ago (1 children)

Can't really find many good ones. Google isn't returning much, just pdfs about python libraries and the odd abandoned github repo

[–] deegeese@sopuli.xyz 2 points 7 months ago (1 children)

I’d start with wkhtmltopdf/pdfkit

[–] Irelephant@lemm.ee 1 points 4 months ago

Just coming back to this a bit later, wkhtmltopdf is abandoned, is there any alternatives? It works fine for now, but it may not in future.

[–] undefined@lemmy.hogru.ch 2 points 7 months ago* (last edited 7 months ago)

In a production web app I use Gotenberg. It’s definitely overkill for the task at hand, but if you find yourself doing this often I would highly recommend it. It’s dead easy to convert HTML (and I imagine XHTML) to PDF.