compiled in Puppy 3.01 (working in Puppy 4.3.1 and hopefully in newer puppies)
- http://dokupuppylinux.co.cc/programs:pdf_manipulation
includes:
cmapdump, fontdump, mupdfinfo, pdfclean, pdfdraw, pdfextract, pdfshow
cmapdump
Code: Select all
usage: cmapdump output.c lots of cmap files
Code: Select all
usage: fontdump output.c input.dat
Code: Select all
usage: pdfinfo [options] [file.pdf ... ]
-d - password for decryption
-f list fonts
-i list images
-m list dimensions
-p list patterns
-s list shadings
-x list form and postscript xobjects
NOTE: I renamed pdfinfo in mupdfinfo in order to not conflict with pdfinfo from xpdf-utils
Code: Select all
usage: pdfclean [options] input.pdf [output.pdf] [pages]
-p - password
-g garbage collect unused objects
-gg in addition to -g compact xref table
-ggg in addition to -gg merge duplicate objects
-d decompress streams
pages comma separated list of ranges
Code: Select all
usage: pdfdraw [options] input.pdf [pages]
-o - output filename (%d for page number)
supported formats: pgm, ppm, pam, png
-p - password
-r - resolution in dpi (default: 72)
-A disable accelerated functions
-a save alpha channel (only pam and png)
-g render in grayscale
-m show timing information
-t show text (-tt for xml)
-x show display list
-5 show md5 checksums
pages comma separated list of ranges
Code: Select all
usage: pdfextract [options] file.pdf [object numbers]
-p password
-r convert images to rgb
Code: Select all
usage: pdfshow [options] file.pdf [xref] [trailer] [pagetree] [object numbers]
-b print streams as binary data
-e print encoded streams (don't decode)
-p password
MuPDF has support for all non-interactive PDF 1.7 features, and the toolkit provides a simple API for accessing the internal structures of the PDF document. Example code for navigating interactive links and bookmarks, encrypting PDF files, extracting fonts, images, and searchable text, and rendering pages to image files is provided.