Font Fixing Using PDFix SDK

Automatically detects and fixes issues related to ISO 14289-1:2014, Clause 7.21.7 (Unicode character mapping requirements). It ensures that all text content has valid and complete Unicode mapping by repairing missing ToUnicode CMaps and applying OCR-based text reconstruction when Unicode information is absent, guaranteeing reliable text extraction and accessibility compliance.

Getting Started

To use this Docker application, you'll need to have Docker installed on your system. If Docker is not installed, please follow the instructions on the official Docker website to install it.

Run a Docker Container

The first run will pull the docker image, which may take some time. Make your own image for more advanced use.

Run Docker Container for Font Fixing

To run docker container as CLI you should share the folder with PDF to process using -v parameter. In this example it's current folder.

docker run -v $(pwd):/data -w /data --rm pdfix/font-fix-pdfix:latest fix-missing-unicode -i /data/input.pdf -o /data/output.pdf

If you want to use other OCR engine then default Tesseract OCR use parameter --engine with one of values Easy for Easy OCR or Rapid for Rapid OCR. If you want to fill other then space character when OCR fails to recognize character you can set it using parameter --default_char followed by your desired character.

For more detailed information about the available command-line arguments, you can run the following command:

docker run --rm pdfix/font-fix-pdfix:latest --help

Exporting Configuration for Integration

To export the configuration JSON file, use the following command:

docker run -v $(pwd):/data -w /data --rm pdfix/font-fix-pdfix:latest config -o config.json

License

PDFix SDK

Help & Support

To report an issue please contact us at support@pdfix.net. For more information visit https://pdfix.net

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.github/workflows		.github/workflows
examples		examples
src		src
test		test
.gitignore		.gitignore
.mypy.ini		.mypy.ini
.ruff.toml		.ruff.toml
Dockerfile		Dockerfile
README.md		README.md
config.json		config.json
download_models.py		download_models.py
requirements.txt		requirements.txt
test.sh		test.sh
update_version.sh		update_version.sh
update_versions_repository.sh		update_versions_repository.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Font Fixing Using PDFix SDK

Table of Contents

Getting Started

Run a Docker Container

Run Docker Container for Font Fixing

Exporting Configuration for Integration

License

Help & Support

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Font Fixing Using PDFix SDK

Table of Contents

Getting Started

Run a Docker Container

Run Docker Container for Font Fixing

Exporting Configuration for Integration

License

Help & Support

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages