-
-
Notifications
You must be signed in to change notification settings - Fork 114
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Implement extract_from_text
to get neutral citations for pasuperct
#1251
Comments
@flooie is this a neutral citation? I was checking reporters-db, and it's listed as a variation of a state citation The indigo book lists it as a "public domain" citation |
This is working as far as parsing goes; but there is a validation bug in Courtlistener that makes us unable to ingest it. We have PRs addressing the problem. After that, it's a matter of re-running (Not so sure about the start date)
Output: |
We gained around 1638 citations from this run. There may be more due to a bug on the first backscrape (missing pagination); which I will collect in another issue |
Neutral citations are present inside the document's text, but we are not collecting them. Once we implement this, and freelawproject/courtlistener#4520 is merged, we can collect those citations
Example
Related to #858 (comment)
The text was updated successfully, but these errors were encountered: