-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
GROBID author extraction is broken #462
Labels
type: bug
Something isn't working
Comments
drjova
added a commit
to inspirehep/inspire-next
that referenced
this issue
Apr 17, 2024
drjova
added a commit
to inspirehep/inspire-next
that referenced
this issue
Apr 17, 2024
drjova
added a commit
to inspirehep/inspire-next
that referenced
this issue
Apr 17, 2024
drjova
added a commit
to inspirehep/inspirehep
that referenced
this issue
Apr 17, 2024
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Due to kermitt2/grobid#1093, author extraction through GROBID is broken. This affects both the author extraction through the editor on hep and in the workflows on next.
As a workaround, we should request explicitly the XML format by passing an
accept: application/xml
header in https://github.com/inspirehep/inspire-next/blob/fab1c19d33d7a5c6ba543aedcc9f9ddef814787a/inspirehep/modules/workflows/tasks/actions.py#L1094 and https://github.com/inspirehep/inspirehep/blob/da50059baa6b7470681e8587c7934f9cc9d324f9/backend/inspirehep/matcher/api.py#L236.The text was updated successfully, but these errors were encountered: