- the Council votes API was discontinued in May 2024. The function
elx_council_votes()
is no longer exported
- tempfile created for XML download now gets deleted
- some http calls were still not failing gracefully
- some leftover .data in tidyselect
- minor changes to documentation
- cleaned up http calls code
- calls to
elx_council_votes()
andelx_curia_list()
now fail gracefully - .data replaced by quoted variables for tidyselect functions
- Internet-using vignettes moved to site-only articles
- breaking change:
elx_run_query()
now strips URIs (except Eurovoc ones) by default and keeps only the identifier to reduce object size - where
elx_fetch_data()
is used to retrieve texts from an html document, it now uses by defaultrvest::html_text2()
instead ofrvest::html_text()
. This is slower but more resembling of how the page renders in some cases. New argumenthtml_text = "text2"
controls the setting. - new feature:
elx_make_query(..., include_court_origin = TRUE)
retrieves the country of origin of a court case. As per Eur-Lex documentation, this is primarily intended to be the country of the national court referring a preliminary question, but other countries are present in the data as well at the moment. Recommended to interact with court procedure - new feature:
elx_make_query(..., include_original_language = TRUE)
retrieves the authentic language of a document, typically a court case
- new feature:
elx_make_query(..., include_directory_code = TRUE)
retrieves the directory code of the document in question rather than its label (which can still be retrieved usinginclude_directory = TRUE
). This is useful to distinguish identical labels in different directories
- minor changes to vignettes and examples to reduce build time
- more stable connection through
elx_curia_list()
- fun CRAN policy compliance stuff
- all date variables retrieved through
elx_make_query(include_... = TRUE)
are now properly named - new experimental feature:
elx_make_query(include_citations_detailed = TRUE)
retrieves additional details about the citation where available; the retrieval is currently slow
elx_make_query(include_directory = TRUE)
now retrieves the directory code instead of URI- minor clean up of internals
- vignette lightly touched up
- new feature:
elx_make_query(include_proposal = TRUE)
retrieves the CELEX of a proposal of a requested legal act - the returned results from
elx_make_query()
no longer include previous versions of the same record (new versions typically fix incorrect or missing metadata). This reduces the number of duplicates previously appearing in the results
elx_make_query(include_author = TRUE)
now returns the human-readable label (institutional authors) instead of URI
elx_fetch_data(type = "notice", notice = c("tree","branch", "object"))
now mirrors the behaviour ofelx_download_xml()
but instead of saving to path gives access to XML notice in R- retrieve data on the Judge-Rapporteur, Advocate-General, court formation and court-curated scholarship using new
include_
options inelx_make_query()
- fixed bug in
elx_download_xml()
parameter checking elx_download_xml(notice = "object")
now retrieves metadata correctly
- download XML notices associated with Cellar URLs with
elx_download_xml()
- retrieve European Case Law Identifier (ECLI) with
elx_make_query(include_ecli = TRUE)
- host of smaller code improvements in
elx_fetch_data()
- more consistent and strict error generation across all server-interacting functions
- started adding unit tests
elx_run_query()
now fails gracefully in presence of internet/server problemselx_fetch_data()
now automatically fixes urls with parentheses (e.g. "32019H1115(01)" used to fail)
- minor fixes to vignette
elx_parse_xml
no longer an exported function
- it is now possible to select all resource types available with
elx_make_query(resource_type = "any")
. Since there are nearly 1 million CELEX codes, use with discretion and expect long execution times - results can be restricted to a particular directory code with
elx_make_query(directory = "18")
(directory code "18" denotes Common Foreign and Security Policy) - results can be restricted to a particular sector with
elx_make_query(sector = 2)
(sector code 2 denotes EU international agreements)
- new feature: request date of court case submission
elx_make_query(include_date_lodged = TRUE)
- new feature: request type of court procedure and outcome
elx_make_query(include_court_procedure = TRUE)
- new feature: request directory code of legal act
elx_make_query(include_directory = TRUE)
elx_curia_list()
has a new default parameterparse = TRUE
which creates separate columns forecli
,see_case
,appeal
applying regular expressions oncase_info
- new feature: request citations referenced in target resource with elx_make_query(include_citations = TRUE); retrieved in CELEX form
- new feature: request document author(s) with
elx_make_query(include_author = TRUE)
- XML parsing is now more efficient due to utilising (rather than stripping) namespaces (but still room for improvement)
- fixed bug in elx_label_eurovoc whereby resulting data frames contained list-columns
- hotfix for critical bug in xml parsing that scrambled column with legal basis where this was requested
- improvement to legal basis harvesting thanks to help from Eur-Lex insiders
- legal basis results are now slightly more comprehensive and correct
- legal basis results now include a new column detailing the "suffix" (paragraph, subparagraph, etc.) in string form
- minor updates to documentation
elx_fetch_data()
now prefers CELEX-based URLs (instead of Cellar URIs) as input, as they appear to yield fewer missing documents
elx_fetch_data("text")
now retrieves plain text from html, pdf and MS Word documents- the type of source file is documented
- added handling of multiple files: all available text is retrieved and concatenated
- so far no support for images requiring OCR for text extraction for the sake of limiting dependencies and avoiding prolonging execution time
- fixed serious bugs in
elx_curia_list()
- fixed bug in
elx_label_eurovoc()
elx_council_votes()
made fully operational
- optimization, reducing dependencies, etc.
- addition of proposals and national implementing laws to possible SPARQL queries
- EuroVoc topics, retrievable in all EU languages, can now be included in SPARQL results
- new date options (force, end of validity, transposition)
- added
elx_curia_list()
to retrieve full list of EU court cases
- switch from XML to xml2
- SPARQL package dependency removed
- cascading language options for
elx_fetch_data()