Language Detection

A Docker image that automatically detects the language of a PDF file. It uses a configuration file for customizable options and can be run with various command-line arguments.

Getting Started

To use this Docker application, you'll need to have Docker installed on your system. If Docker is not installed, please follow the instructions on the official Docker website to install it.

Run using Command Line Interface

To run docker container as CLI you should share the folder with PDF to process using -i parameter. In this example it's current folder.

docker run -v $(pwd):/data -w /data --rm pdfix/lang-detect:latest lang-detect -i input.pdf -o output.pdf

Just detect language and save language code to txt file

docker run -v $(pwd):/data -w /data --rm pdfix/lang-detect:latest lang-detect -i input.pdf -o output.txt

With PDFix License add these arguments.

--name ${LICENSE_NAME} --key ${LICENSE_KEY}

First run will pull the docker image, which may take some time. Make your own image for more advanced use.

For more detailed information about the available command-line arguments, you can run the following command:

docker run --rm pdfix/lang-detect:latest --help

Run OCR using REST API

Comming soon. Please contact us.

Exporting Configuration for Integration

To export the configuration JSON file, use the following command:

docker run -v $(pwd):/data -w /data --rm pdfix/lang-detect:latest config -o config.json

License

PDFix license https://pdfix.net/terms

Trial version of the PDFix SDK may apply a watermark on the page and redact random parts of the PDF including the scanned image in background. Contact us to get an evaluation or production license.

Help & Support

To obtain a PDFix SDK license or report an issue please contact us at [email protected]. For more information visit https://pdfix.net

Name		Name	Last commit message	Last commit date
Latest commit History 77 Commits
.github/workflows		.github/workflows
example		example
src		src
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
config.json		config.json
requirements.txt		requirements.txt
test.sh		test.sh
update_version.sh		update_version.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Language Detection

Table of Contents

Getting Started

Run using Command Line Interface

Run OCR using REST API

Exporting Configuration for Integration

License

Help & Support

About

Releases 2

Packages

Contributors 2

Languages

pdfix/lang-detect

Folders and files

Latest commit

History

Repository files navigation

Language Detection

Table of Contents

Getting Started

Run using Command Line Interface

Run OCR using REST API

Exporting Configuration for Integration

License

Help & Support

About

Topics

Resources

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 2

Languages

Packages