A backend for mdbook in Python for generating PDF based on Chrome DevTools Protocol.

Last update: Dec 27, 2022

Related tags

Overview

mdbook-pdf

A backend for mdbook written in Python for generating PDF based on Chrome DevTools Protocol.

Python library dependency

Usage

Put mdbook-pdf in PATH. Have google-chrome/chromium available (in PATH or location configured).

Ensure you have installed python selenium library, corresponding chromedriver is in PATH or in the book repo or location configured.

Build it with mdbook build command. Make sure the following exists in your book.toml:

[output.html]

[output.html.print]
enable = true

[output.pdf]

if you are using Windows, Put this script in the book repo, add the following line to [output.pdf] in your book.toml:

command = "python ../../mdbook-pdf"

Configuration

Check book.toml for available configurations of [output.pdf].

Known issue

Sometimes the program may crash with errors like this:

selenium.common.exceptions.WebDriverException: Message: unknown error: session deleted because of page crash
from unknown error: cannot determine loading status
from tab crashed

This may be led by resources outrage.

To-do

Rewrite the whole thing in Rust, directly call to Chrome DevTools Protocol instead of using selenium.

Python script that split PDF files.

Automatic PDF Splitter This script can create new single-page PDFs files from multipaged PDFs. Requirements Python 3.0+ # Debian distros sudo apt-get

5 Apr 2, 2022

borb is a library for reading, creating and manipulating PDF files in python.

2.9k Jan 1, 2023

Python lib for Simple PDF text extraction

651 Jan 1, 2023

x-ray is a Python library for finding bad redactions in PDF documents.

A tool to detect whether a PDF has a bad redaction

73 Dec 19, 2022

This book will take you on an exploratory journey through the PDF format, and the borb Python library.

281 Jan 1, 2023

Simple python tool created for downloading PDF.

PDFdownloader Usage Open PDF in full-screen mode Run scan.exe Enter how many pages you want to scan Focus PDF After scanning is done, run merge.exe En

5 Oct 27, 2021

A simple pdf size compressing telegram robot witten in python.

Pdf Compressor Telegram Bot ##About : A simple pdf size compressing telegram robot witten in python. Mostly useful for digital documentation. Deploy t

22 Oct 28, 2022

Converting Html files to pdf using python script, pdfkit module and wkhtmltopdf.

Html-to-pdf-pdfkit-wkhtml- This repository has code for converting local html files and online html resources into pdf. It is an python script which u

1 Nov 9, 2021

Small python-gtk application, which helps the user to merge or split pdf documents and rotate, crop and rearrange their pages using an interactive and intuitive graphical interface

1.8k Dec 29, 2022

Comments

Update mdbook to 0.4.21 and publish to crates.io

Would it be possible to update mdbook to version 0.4.21 and update the release on crates.io? This version contains a fix for https://github.com/rust-lang/mdBook/issues/1860 to enable building using Rust 1.64. In the meantime, I'm creating a patch for this in nixpkgs but would love to be able to pick it up just by updating the version.

opened by tjni 1
feature request: Generate PDF with page outlines
I'm not sure what you mean by "outlines"?

Maybe adding a new keyword in output.html.print, like outline = true/false

[output.html.print] enable = true page-break = true outline = true # new keyword to include the 'ToC' in pdf print.

Originally posted by @schwrzstrbn in https://github.com/rust-lang/mdBook/issues/1817#issuecomment-1163854290
duplicate
opened by schwrzstrbn 1

Releases(v0.1.4)

v0.1.4(Dec 29, 2022)

v0.1.4: add support for export theme that is different than the html version #5
Source code(tar.gz)
Source code(zip)
mdbook-pdf-v0.1.4-aarch64-apple-darwin(8.50 MB)
mdbook-pdf-v0.1.4-aarch64-pc-windows-msvc.exe(6.53 MB)
mdbook-pdf-v0.1.4-aarch64-unknown-linux-gnu(13.49 MB)
mdbook-pdf-v0.1.4-aarch64-unknown-linux-musl(13.42 MB)
mdbook-pdf-v0.1.4-x86_64-apple-darwin(8.58 MB)
mdbook-pdf-v0.1.4-x86_64-pc-windows-msvc.exe(7.29 MB)
mdbook-pdf-v0.1.4-x86_64-unknown-linux-gnu(13.16 MB)
mdbook-pdf-v0.1.4-x86_64-unknown-linux-musl(13.43 MB)
mdbook_pdf_outline-0.1.0-py3-none-any.whl(18.41 KB)
v0.1.3(Oct 5, 2022)

Resolves #4
Source code(tar.gz)
Source code(zip)
mdbook-pdf-v0.1.3-aarch64-apple-darwin(8.40 MB)
mdbook-pdf-v0.1.3-aarch64-pc-windows-msvc.exe(6.46 MB)
mdbook-pdf-v0.1.3-aarch64-unknown-linux-gnu(13.41 MB)
mdbook-pdf-v0.1.3-aarch64-unknown-linux-musl(13.35 MB)
mdbook-pdf-v0.1.3-x86_64-apple-darwin(8.50 MB)
mdbook-pdf-v0.1.3-x86_64-pc-windows-msvc.exe(7.21 MB)
mdbook-pdf-v0.1.3-x86_64-unknown-linux-gnu(13.05 MB)
mdbook-pdf-v0.1.3-x86_64-unknown-linux-musl(13.34 MB)
mdbook_pdf_outline-0.1.0-py3-none-any.whl(18.41 KB)
v0.1.2(Feb 8, 2022)

Enable to provide the static hosting site URL. If you have relative links that link outside the book, it can get fixed.
Source code(tar.gz)
Source code(zip)
mdbook-pdf-v0.1.2-aarch64-unknown-linux-gnu.zip(3.76 MB)
mdbook-pdf-v0.1.2-x86_64-apple-darwin.zip(2.75 MB)
mdbook-pdf-v0.1.2-x86_64-freebsd.zip(2.97 MB)
mdbook-pdf-v0.1.2-x86_64-pc-windows-msvc.zip(2.40 MB)
mdbook-pdf-v0.1.2-x86_64-unknown-linux-gnu.zip(3.74 MB)
v0.1.1(Feb 7, 2022)
Wait for MathJax loading finished before generating PDF.

Expand all the details elements before generating PDF.

Source code(tar.gz)
Source code(zip)
mdbook-pdf-v0.1.1-aarch64-unknown-linux-gnu.zip(3.74 MB)
mdbook-pdf-v0.1.1-x86_64-apple-darwin.zip(2.74 MB)
mdbook-pdf-v0.1.1-x86_64-pc-windows-msvc.zip(2.38 MB)
mdbook-pdf-v0.1.1-x86_64-unknown-linux-gnu.zip(3.74 MB)
v0.1.0(Jan 21, 2022)

Initial release for the mdbook-pdf.

A backend for mdBook written in Rust for generating PDF based on headless chrome and Chrome DevTools Protocol.
Source code(tar.gz)
Source code(zip)
mdbook-pdf-v0.1.0-aarch64-unknown-linux-gnu.zip(3.62 MB)
mdbook-pdf-v0.1.0-x86_64-apple-darwin.zip(2.63 MB)
mdbook-pdf-v0.1.0-x86_64-pc-windows-msvc.zip(2.28 MB)
mdbook-pdf-v0.1.0-x86_64-unknown-linux-gnu.zip(3.62 MB)

Owner

Hollow Man

Pursuing LZU BEng CS | '20 @alibaba SoC & @linuxfoundation LiFT Scholarship China | '21 GSoC @openSUSE | '20 & '21 @isrc-cas OSPP Summer

GitHub Repository

PyMuPDF is a Python binding with support for MuPDF

PyMuPDF is a Python binding with support for MuPDF (current version 1.18.*), a lightweight PDF, XPS, and E-book viewer, renderer, and toolkit, which is maintained and developed by Artifex Software, I

1.9k Jan 03, 2023

Small python-gtk application, which helps the user to merge or split pdf documents and rotate, crop and rearrange their pages using an interactive and intuitive graphical interface

1.8k Dec 29, 2022

A tool for certificate PDF generation.

certificate-pdf-generator 获奖证书PDF批量生成工具 | a Tool for certificate PDF generation. ⚠️ 下载前请注意本项目使用了LFS来存储PDF等大文件。在克隆或下载本仓库前，请先使用apt等包管理器安装git-lfs包。如果已经克

4 Nov 28, 2022

Performing the following operations using python on PDF.

Python PDF Handling Tutorial Python is a highly versatile language with a huge set of libraries. It is a high level language with simple syntax. Pytho

131 Dec 16, 2022

pystitcher stitches your PDF files together, generating nice customizable bookmarks for you using a declarative markdown file as input

pystitcher pystitcher stitches your PDF files together, generating nice customizable bookmarks for you using a declarative input in the form of a mark

387 Dec 10, 2022

WeasyPrint is a smart solution helping web developers to create PDF documents.

WeasyPrint is a smart solution helping web developers to create PDF documents. It turns simple HTML pages into gorgeous statistical reports, invoices, tickets…

5.4k Jan 08, 2023

minipdf is a package for creating simple, single-page PDF documents.

minipdf minipdf is a package for creating simple, single-page PDF documents. Installation You can install the development version from GitHub with: #

41 Dec 19, 2022

DietPDF aims at reducing PDF file size while not degrading quality nor losing metadata

6 Jul 27, 2022

Extract the table in the PDF，outputs the data similar to the json format

extract the table in the PDF，outputs the data similar to the json format

3 Nov 25, 2021

Excalibur: A web interface to extract tabular data from PDFs

Excalibur: A web interface to extract tabular data from PDFs Excalibur is a web interface to extract tabular data from PDFs, written in Python 3! It i

1.2k Jan 04, 2023

A python library for extracting text from PDFs without losing the formatting of the PDF content.

Multilingual PDF to Text Install Package from Pypi Install it using pip. pip install multilingual-pdf2text The library uses Tesseract which can be ins

49 Nov 07, 2022

Scans pdfs for links written in plaintext and checks if they are active or returns an error code.

Scans pdfs for links written in plaintext and checks if they are active or returns an error code. It then generates a report of its findings. Extract references (pdf, url, doi, arxiv) and metadata fr

22 Nov 21, 2022

Simple HTML and PDF document generator for Python - with built-in support for popular data analysis and plotting libraries.

Esparto is a simple HTML and PDF document generator for Python. Its primary use is for generating shareable single page reports with content from popular analytics and data science libraries.

76 Dec 12, 2022

Camelot is a Python library that makes it easy for anyone to extract tables from PDF files

Camelot: PDF Table Extraction for Humans Camelot is a Python library that makes it easy for anyone to extract tables from PDF files! Note: You can als

3.3k Jan 06, 2023

A bulk pdf generator. This application can generate PDFs in bulk by using just one click.

A bulk html pdf generator. This application can generate PDFs in bulk by using just one click. Screenshots Requirements 🧱 Your system must have the f

3 Apr 23, 2022

Merge multiple PDF files into one.

PDF Merger Merge multiple PDF files into one. Usage % python pdf_merger.py -h usage: pdf_merger.py [-h] [-o OUTPUT] [-f [FILES ...]] optional argumen

6 Oct 03, 2022

PyPDF2 is a pure-python PDF library capable of splitting, merging together, cropping, and transforming the pages of PDF files.

PyPDF2 is a pure-python PDF library capable of splitting, merging together, cropping, and transforming the pages of PDF files. It can also add custom data, viewing options, and passwords to PDF files

5k Jan 04, 2023

A backend for mdbook in Python for generating PDF based on Chrome DevTools Protocol.

Related tags

Overview

mdbook-pdf

Usage

Configuration

Known issue

To-do

You might also like...

Python script that split PDF files.

borb is a library for reading, creating and manipulating PDF files in python.

Python lib for Simple PDF text extraction

x-ray is a Python library for finding bad redactions in PDF documents.

This book will take you on an exploratory journey through the PDF format, and the borb Python library.

Simple python tool created for downloading PDF.

A simple pdf size compressing telegram robot witten in python.

Converting Html files to pdf using python script, pdfkit module and wkhtmltopdf.

Small python-gtk application, which helps the user to merge or split pdf documents and rotate, crop and rearrange their pages using an interactive and intuitive graphical interface

Comments

Update mdbook to 0.4.21 and publish to crates.io

feature request: Generate PDF with page outlines

Releases(v0.1.4)

v0.1.4(Dec 29, 2022)

v0.1.3(Oct 5, 2022)

v0.1.2(Feb 8, 2022)

v0.1.1(Feb 7, 2022)

v0.1.0(Jan 21, 2022)

Owner

Hollow Man

PyMuPDF is a Python binding with support for MuPDF

Small python-gtk application, which helps the user to merge or split pdf documents and rotate, crop and rearrange their pages using an interactive and intuitive graphical interface

A tool for certificate PDF generation.

Performing the following operations using python on PDF.

pystitcher stitches your PDF files together, generating nice customizable bookmarks for you using a declarative markdown file as input

WeasyPrint is a smart solution helping web developers to create PDF documents.

minipdf is a package for creating simple, single-page PDF documents.

DietPDF aims at reducing PDF file size while not degrading quality nor losing metadata

Extract the table in the PDF，outputs the data similar to the json format

Excalibur: A web interface to extract tabular data from PDFs

A python library for extracting text from PDFs without losing the formatting of the PDF content.

Scans pdfs for links written in plaintext and checks if they are active or returns an error code.

Simple HTML and PDF document generator for Python - with built-in support for popular data analysis and plotting libraries.

Camelot is a Python library that makes it easy for anyone to extract tables from PDF files

A bulk pdf generator. This application can generate PDFs in bulk by using just one click.

Merge multiple PDF files into one.

PyPDF2 is a pure-python PDF library capable of splitting, merging together, cropping, and transforming the pages of PDF files.

x-ray is a Python library for finding bad redactions in PDF documents.

Table automatically extraction from PDF Document

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched