Ignoring headers and footers with PyPDFLoader #8548
Unanswered
olegshirokikh
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Using
PyPDFLoader
orDirectoryLoader
withloader_cls=PyPDFLoader
, is there any way to ignore headers and/or footers on PDF pages?PyPDF
module itself supports some callback mechanism for this kind of task, but not sure if it's possible to integrate with Langchain's APIsHere's the standard usage example from Langchain docs.
Beta Was this translation helpful? Give feedback.
All reactions