-
Notifications
You must be signed in to change notification settings - Fork 2.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat(community): Extend DocxLoader to load .doc files #7421
base: main
Are you sure you want to change the base?
Conversation
The latest updates on your projects. Learn more about Vercel for Git ↗︎
1 Skipped Deployment
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you! Some small nits and questions
Can you please also run yarn format
from root? The .mdx
files get formatted as well
docs/core_docs/docs/integrations/document_loaders/file_loaders/docx.mdx
Outdated
Show resolved
Hide resolved
} catch (e) { | ||
console.error(e); | ||
throw new Error( | ||
"Failed to load word-extractor. Please install it with eg. `npm install word-extractor`." |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
No way to just use mammoth
is there?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
No, mammoth
can only handle .docx
Extends the existing
DocxLoader
to handle loading.doc
files.Uses word-extractor as a peer dependency for loading .doc