Document Denoiser

Leveraging CNNs in Auto Encoder Architecture to remove noise from scanned documents and get encoder - decoder pair models for encoding and decoding clean documents.

This project aims to showcase the practical use of Autoencodersfor denoising documents while leveraging their inherent capacity for image encoding. My model, constructed using Convolutional Neural Networks within the Autoencoder Architecture, is trained on a dataset provided by RM.J. Castro-Bleda, S. España-Boquera, J. Pastor-Pellicer, F. Zamora-Martinez, available through the UCI machine learning repository.

We will also explore the Vanilla Auto Encoder for the same purpose.

The model's objective is to remove or reduce noise found in textual documents, such as watermarks and wrinkles, and provide clean versions along with their corresponding encoded images.

Quick Glance

Noisy Image - Encoded Image - Clean Image

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
a1		a1
test		test
train		train
train_cleaned		train_cleaned
.DS_Store		.DS_Store
Document Denoiser.ipynb		Document Denoiser.ipynb
README.md		README.md
aecnn_history.csv		aecnn_history.csv
autoencoder.h5		autoencoder.h5
decoder.h5		decoder.h5
encoder.h5		encoder.h5
img1.png		img1.png
img2.png		img2.png
img3.png		img3.png
img4.png		img4.png
img5.jpeg		img5.jpeg
img6.png		img6.png
img7.png		img7.png
img8.jpg		img8.jpg
sae_history.csv		sae_history.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Document Denoiser

Quick Glance

About

Releases

Packages

Languages

itsadnanlone/documentDenoiser

Folders and files

Latest commit

History

Repository files navigation

Document Denoiser

Quick Glance

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages