I would like to compress scanned text (monochrome or few colours) and store it in pdf (maybe djvu) files. I remember that I got very good results with Windows/Acrobat and "ZRLE" compressed monochrome tiff embedded into pdf. The algorithm was loossless as far as I remember. Now I search a way to obtain good results on linux. It should be storage saving and avoid loss (I do not mind loosing colours, but I do not want e.g. jpeg compression which would create noisy results for text scans). I need it for batch conversion, so I was thinking of the ImageMagick convert command. But which output format should I use so I get good results and to be able to embed it into pdf files (for example using pdflatex)? Or is it generally better to use djvu files?
efficient image compression for pdf embedding with linux
2.6k views Asked by highsciguy At
2
There are 2 answers
1
yms
On
DJVU is not a bad choice, but if you want to stay in PDF for better compatibility you may want to look into lossless JBIG2 compression.
Quote from Wikipedia:
Overall, the algorithm used by JBIG2 to compress text is very similar to the JB2 compression scheme used in the DjVu file format for coding binary images.
Related Questions in LINUX
- Is there some way to use printf to print a horizontal list of decrementing hex digits in NASM assembly on Linux
- Why does Hugo generate different taxonomy-related HTML on different OS's?
- Writes in io_uring do not advance the file offset
- Why `set -o pipefail` gives different output even though the pipe is not failing
- what really controls the permissions: UID or eUID?
- Compiling eBPF program in Docker fails due to missing '__u64' type
- Docker container unable to make HTTPS requests to external API
- Whow to use callback_query_handler in Python 3.10
- Create kea runtime directory at startup in Yocto image
- Problem on CPU scheduling algorithms in OS
- How to copy files into the singularity sandbox?
- Android kernel error: undefined reference to `get_hw_version_platform'
- Is there a need for BPF Linux namespace?
- Error when trying to execute a binary compiled in a Kali Linux machine on an Ubuntu system
- Issue with launching application after updating ElectronJs to version 28.0.0 on Windows and Linux
Related Questions in PDF
- How to use custom font during html to pdf conversion?
- How to get content of BLOCK types LAYOUT_TITLE, LAYOUT_SECTION_HEADER and LAYOUT_xx in Textract
- PDF form checkbox/radio button ignores content stream
- Suggest python library for rendering html to pdf files
- Problems with the order in which PDF files are created
- Centering a map element on a generated PDF
- download all pdf files from website doesn't support wildcard
- How to enter external pdf into quarto book while keeping page layout+numbering
- How do I create a website that combines user input and standard text and converts it into a pdf?
- Excel VBA error 1004 on PDF export - not a path issue
- downloading pdf using requests not working
- Creating pdf on Firestore with Pdfplum: Template path "no such object"
- Export password protected PDF from QGIS
- XPS convert PDF with Ghostscript
- Download PDF in ASP.NET MVC application
Related Questions in IMAGE-PROCESSING
- RuntimeError: Given groups=1, weight of size [64, 1, 3, 3], expected input[1, 3, 416, 416] to have 1 channels, but got 3 channels instead
- Unable to open shape_predictor_68_face_landmarks.dat
- When transferring mri t1 to mni152 spaces, the dimensions change and lose information, is that not a problem?
- How to detect the exact boundary of a Sudoku using OpenCV when there are multiple external boundaries?
- Nuke BlinkScript: Why does the convolution kernel scale down the image?
- CV2 Python - image merging based on homography matrix - error in mergeing
- Python pillow library text align center
- Implementing Image Processing for Dimension Measurement in Arduino-based Packaging System
- AI tools for generating clean clipping paths
- efficient way to remove a background from an image in python
- I want to segment an MRI image of the spine and obtain only the vertebrae using Matlab
- Find Gradient Magnitude using skimage.feature.hog module
- AR Image Display Issue
- Using python OpenCV to crop an image based on reference marks
- Python: Generating an image using Multiprocessing freezes
Related Questions in IMAGEMAGICK
- ImageMagick / Bash : pipe ignored(?) when filename format variable used
- imagemagick for extracting the fewest continuous clockwise external boundary points from black and white image
- Converting Kodak PCD Files to TIF?
- How can I install ImageMagick and Imagick latest in a Plesk server
- How to remove grey/black artifacts at the edges of a png with transparent background image using ImageMagick
- How to retrieve the "Title" field from Lightroom on a JPEG using ImageMagick?
- Image Magick: insert image in another image: preserving aspect ratio with minimum instead of maximum size
- How to remove ringing/anti-aliasing/gradients from PNG with ImageMagick?
- flatten layers aligned by centers instead of aligned by top-left corners
- How to composite more than 2 images with imagemagick?
- How to stop gravity option screwing up imagemagick parenthesised inputs?
- ImageMagick convert composite many crops from the same image?
- using caret ^ with imagemagick on m1 mac, zsh: no matches found
- How to convert Magick::Image from ImageMagick 7 to QImage?
- Why is the use-Tag from SVG in ImageMagick ignored?
Related Questions in DJVU
- Where are PDFs stored from ddjvu or djvu2pdf?
- linux tools that deal with metadata
- Extracting text from DjVu with Apache Tika
- How to convert all djvu files to pdf
- How to view DJVU file in UIWebview
- DjVu thumbnails are not being generated at all
- ArrayBuffer to blob conversion
- How to access the custom class members from a call back method
- Loading files in-browser from a server and .djvu files do not load
- How to check if the user's browser can view DJVU files using JavaScript
- How to convert DJVU file into PDF
- PInvoke - Issue while calling DJVU function from C# code. Attempted to read or write protected memory
- Is there a good quality djvu to pdf converter out there?
- How to convert hundreds of DJVU files to TIFF files by parsing a JavaScript file?
- VB.NET set embedded object src to byte array? dynamically set src value
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
jbig2enc encoder for images using jbig2 compression, was originally written for GoogleBooks by Adam Langley
https://github.com/agl/jbig2enc
I forked to include latest improvements By Rubypdf and others
https://github.com/DingoDog/jbig2enc
I also built several binaries of jbig2enc for puppy linux (it can be working also on other distributions)
http://dokupuppylinux.info/programs:encoders