Neural Document Classification

Document image classification with neural networks on a subset of the RVL-CDIP dataset [1].

Getting Started

The classification problem is tackled with two different approaches:

Visual approach over the image pixels with dense only and convolutional neural networks: skeleton.ipynb
Textual approach over the recognized image words with bag-of-words and word embedding models: skeleton_ocr.ipynb

It is recommended to begin with the visual approach as it includes more details about the computing environment setup and the dataset.

For a better experience, execute the notebooks within a Google Colab environment.

[1] A. W. Harley, A. Ufkes, K. G. Derpanis, "Evaluation of Deep Convolutional Nets for Document Image Classification and Retrieval," in ICDAR, 2015

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
test		test
.gitignore		.gitignore
README.md		README.md
download_dataset.py		download_dataset.py
download_tobacco_original_dataset.py		download_tobacco_original_dataset.py
extract_dataset.py		extract_dataset.py
ocr_input.py		ocr_input.py
skeleton.ipynb		skeleton.ipynb
skeleton_ocr.ipynb		skeleton_ocr.ipynb