Skip to content
This repository was archived by the owner on Mar 28, 2025. It is now read-only.
/ pdfextbook Public archive

Extract pages from PDF on bookmark boundaries.

License

Notifications You must be signed in to change notification settings

raffaem/pdfextbook

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

pdfextbook

Archived

Use PDFsam basic instead. It's FLOSS, has a GUI, and better maintained.

Introduction

Extract pages from PDF files on bookmark boundaries.

PDF bookmarks point to a single page, but all the pages starting from the one pointed to by the bookmark until the one pointed to by the next bookmark at the same hierarchical level (excluded) will be extracted.

Therefore this tool is useful to extract chapters or sections from books or articles.

It is more powerful than the "Extract Bookmarked Pages" tool of Adobe Acrobat (and it's also free).

You will need the following software present in your path:

  • pdftk (needed to extract bookmark metadata and to extract pages)
  • fzf (needed to select which bookmark to extract)

Installation

This software is available on PyPI, so the best way to install it is by using pipx:

pipx install pdfextbook

Alternatively, you can clone the repo and install it with pip:

pip install .

About

Extract pages from PDF on bookmark boundaries.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages