Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update how "series" is indexed #247

Open
ggeisler opened this issue Jul 15, 2016 · 3 comments
Open

Update how "series" is indexed #247

ggeisler opened this issue Jul 15, 2016 · 3 comments

Comments

@ggeisler
Copy link

ggeisler commented Jul 15, 2016

A suggestion from the Metadata group for updating how we index series:

Currently the only way that archival series information is indexed in Spotlight was developed specifically for the Feigenbaum collection, and involves using a regular expression to pull series, box, and folder information out of the <location><physicalLocation> element. However, archival series is more appropriately and commonly expressed in <relatedItem type="host" displayLabel="Series">, as it represents intellectual rather than physical arrangement. The request is to add this element to Spotlight's indexing of series, in order to make it available for faceting and to avoid the need for future metadata remediation.

An example where series info is in the <relatedItem> field is the Posada collection: https://searchworks.stanford.edu/view/4561410

@caaster
Copy link

caaster commented Aug 5, 2016

@caaster to investigate with @arcadia in order to articulate acceptance criteria

@arcadiafalcone
Copy link

arcadiafalcone commented Aug 24, 2016

  1. If there is an instance of <relatedItem> with attributes type="host" and displayLabel="Series", concatenate the values of its subelements (concatenation string TBD) and index that combined value as a series.
  2. Series should remain a nonrepeatable element, with the criteria above as primary (continue to current parsing of <physicalLocation> if the above doesn't return a value).
  3. The end result for the user is that the combined value of the subelements of <relatedItem type="host" displayLabel="Series"> is available for the Series facet in Spotlight.

Current series indexing is in stanford-mods/physical_location.rb.

@cbeer
Copy link
Member

cbeer commented Jan 18, 2017

Is it possible to simply apply the same concatenation rules for the relatedItem's title as we are currently using for parsing the main object?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

6 participants