Automatic reading of text in the image of a document

The aim of this repository is to create a program capable of reading a text written in an image. The image contains a photograph of a page of a document which can be taken with a free angle of view (inclination, perspective effect, etc.). It is therefore a question of estimating this angle of view in order to straighten the image so that it seems to be taken from the front. Then, it is necessary to implement a detection of lines, words and then letters in order to reconstitute the entire text.

Prerequisites

matplotlib
opencv
numpy
scipy
python=3.8
jupyter
pytesseract

See more in requirements.txt and environment.yml

Installation

Install Jupyter Notebook by following the instructions from the official Jupyter website.
Install the required Python libraries by running the following command in the terminal:

pip install -r requirements.txt

or create a virtual env with conda (preferred method)

conda env create -f environment.yml

Download and install Tesseract OCR from the official Tesseract GitHub repository here.

Usage

Clone or download the repository to your local machine.
Open the auto_read.ipynb file in Jupyter Notebook.
Replace the path to the sample image in the code with the path to your own image.
Replace the path to the tesseract.exe in the code with the path relative to your own installation.
Run the cells in the Jupyter Notebook to see the output.

With CLI

python auto_read.py [path to your image (.png or .jpg)]

Result

The output will be the extracted text from the document image. The text will be displayed in a readable format and can be used for further processing or analysis.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
auto_read		auto_read
docs		docs
img		img
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
auto_read.ipynb		auto_read.ipynb
auto_read.py		auto_read.py
environment.yml		environment.yml
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Automatic reading of text in the image of a document

Prerequisites

Installation

Usage

With CLI

Result

About

Releases

Packages

Languages

License

bewygs/auto-readable

Folders and files

Latest commit

History

Repository files navigation

Automatic reading of text in the image of a document

Prerequisites

Installation

Usage

With CLI

Result

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages