Leveraging CNNs in Auto Encoder Architecture to remove noise from scanned documents and get encoder - decoder pair models for encoding and decoding clean documents.
This project aims to showcase the practical use of Autoencodersfor denoising documents while leveraging their inherent capacity for image encoding. My model, constructed using Convolutional Neural Networks within the Autoencoder Architecture, is trained on a dataset provided by RM.J. Castro-Bleda, S. España-Boquera, J. Pastor-Pellicer, F. Zamora-Martinez, available through the UCI machine learning repository.
We will also explore the Vanilla Auto Encoder for the same purpose.
The model's objective is to remove or reduce noise found in textual documents, such as watermarks and wrinkles, and provide clean versions along with their corresponding encoded images.
![Noisy Image - Clean Image](https://private-user-images.githubusercontent.com/155386596/294009334-fa32ae8a-44a9-4513-83e7-24ee740b70e6.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MzkyNjYyODQsIm5iZiI6MTczOTI2NTk4NCwicGF0aCI6Ii8xNTUzODY1OTYvMjk0MDA5MzM0LWZhMzJhZThhLTQ0YTktNDUxMy04M2U3LTI0ZWU3NDBiNzBlNi5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwMjExJTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDIxMVQwOTI2MjRaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT03MjM5MTg5ZWNiMDE3ZDhhZDA4MTc4MGE5NTI1ZWE0OTUyYjY1YjBlMzU1MWUzNjg4ODM0YWY4YzBjZTY5ZGUzJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9.K4Vgkzt6uhikbP5uohvn7Yv1ePINqarm2QJWI0sP8yI)
![Noisy Image - Clean Image](https://private-user-images.githubusercontent.com/155386596/294009937-9b5fbcd9-d32f-489e-8905-d70787cce79d.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MzkyNjYyODQsIm5iZiI6MTczOTI2NTk4NCwicGF0aCI6Ii8xNTUzODY1OTYvMjk0MDA5OTM3LTliNWZiY2Q5LWQzMmYtNDg5ZS04OTA1LWQ3MDc4N2NjZTc5ZC5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwMjExJTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDIxMVQwOTI2MjRaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT0zNDMzYjNkNjI0MzdkZWJhNzA4ODliY2JjYjQwNmI2MjEyMzUzNjA2ZTkxYjMyZmVmNTAyNTRkOWM0YjQxODE0JlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9.e5fYGKoZ0mMMh0b2NOMl4lURuhBtg_qPBz_e1W7qlLg)
![Noisy Image - Encoded Image - Clean Image](https://private-user-images.githubusercontent.com/155386596/294010030-f042e089-24c5-45fb-b719-fe8524fec2bc.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MzkyNjYyODQsIm5iZiI6MTczOTI2NTk4NCwicGF0aCI6Ii8xNTUzODY1OTYvMjk0MDEwMDMwLWYwNDJlMDg5LTI0YzUtNDVmYi1iNzE5LWZlODUyNGZlYzJiYy5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwMjExJTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDIxMVQwOTI2MjRaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT00OTBmMzc1ZGIxNWNiZjRhYWNlZGMyODE3MjY3ZTUxM2VmNmIxM2E5NzhjZjA1Y2Q3ZmJlNjRkMTIzZjI3ZGI4JlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9.e465xYdalYEhg8vA4vurIm9DrHFSXu8r59u1f1eNzig)