A Docker image that automatically detects the language of a PDF file. It uses a configuration file for customizable options and can be run with various command-line arguments.
To use this Docker application, you'll need to have Docker installed on your system. If Docker is not installed, please follow the instructions on the official Docker website to install it.
To run docker container as CLI you should share the folder with PDF to process using -i
parameter. In this example it's current folder.
docker run -v $(pwd):/data -w /data --rm pdfix/lang-detect:latest lang-detect -i input.pdf -o output.pdf
Just detect language and save language code to txt file
docker run -v $(pwd):/data -w /data --rm pdfix/lang-detect:latest lang-detect -i input.pdf -o output.txt
With PDFix License add these arguments.
--name ${LICENSE_NAME} --key ${LICENSE_KEY}
First run will pull the docker image, which may take some time. Make your own image for more advanced use.
For more detailed information about the available command-line arguments, you can run the following command:
docker run --rm pdfix/lang-detect:latest --help
Comming soon. Please contact us.
To export the configuration JSON file, use the following command:
docker run -v $(pwd):/data -w /data --rm pdfix/lang-detect:latest config -o config.json
- PDFix license https://pdfix.net/terms
Trial version of the PDFix SDK may apply a watermark on the page and redact random parts of the PDF including the scanned image in background. Contact us to get an evaluation or production license.
To obtain a PDFix SDK license or report an issue please contact us at [email protected]. For more information visit https://pdfix.net