Skip to content
This repository has been archived by the owner on Jan 21, 2022. It is now read-only.

niklasbo/menu-reader-pdf2img

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Menu Reader: PDF to Image Service

Handles website crawling and converting pdf to image

Archive Note 📒

Due to Covid-19 pandemic this service was archived. Everybody works remotely so there is no need for this anymore.

API

Path Function
/pdf2image crawls the website, finds pdfs and saves it in database

Environment Variables

  • MENU_BASE_URL, string, base page to crawl, example: https://www.my-menu-provider.de/

  • MENU_OVERVIEW_URL_ENDING, string, specific page which contains links to the pdfs, example: speiseplan.html

  • MENU_OVERVIEW_HREF_IDENTIFIER, string, the part which identify a link to a pdf <a href=Identifier>, example: assets/context/my-menu-provider/Speiseplan/

  • MONGODB_CONNECTION_STRING, string, connection string, example: mongodb+srv://user:password@cluster/dbname?retryWrites=true&w=majority

  • MONGODB_DATABASE, string, database name, example: menudb

  • MONGODB_COLLECTION, string, collection to write the base64 encoded images to, example: weeknum-and-image

  • IMAGE_DPI, int, image quality in dpi, example: 160

  • PDF_PAGE_NUMBER, int, page number of menu in pdfs, example: 1

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published