-
Notifications
You must be signed in to change notification settings - Fork 490
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
10288 Add keywordTermURI metadata in keyword block #10371
10288 Add keywordTermURI metadata in keyword block #10371
Conversation
…h its name Controlled Vocabulary URL
Heya @stevenferey. Have you had a chance to check out the guidelines for metadata text at https://docs.google.com/document/d/1tY5t3gjrIgAGoRxVMWQSCh46fnbSmnFDLQ7aLkNLhJ8? |
datasetfieldtype.keywordValue.title=Term | ||
datasetfieldtype.keywordVocabulary.title=Controlled Vocabulary Name | ||
datasetfieldtype.keywordVocabularyURI.title=Controlled Vocabulary URL | ||
datasetfieldtype.keywordVocabularyURL.title=Controlled Vocabulary URL |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why does this need to change to URL from URI? (While you're adding Term URI
?) Minimally I think we should be consistent and, unless we're sure that there are no URNs in use, sticking with URI might be the better option. (I do see that we focus on URLs in the descriptions and the idea of web locations for things instead of talking about URIs as identifiers that might also be resolvable on the web. I'm not sure what the best way to resolve that is, or that it needs to be handled in this PR.) @jggautier - any thoughts?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm not sure and might not understand this enough. But maybe I could help see if there are URNs in use?
In case this info is helpful, I think we used "keywordVocabularyURI" for the database name because DDI Codebook uses "vocabURI". And maybe we thought that most people would be more familiar with the term "URLs", so that's what we used for the label people see on the page and in the tooltips. Of course this was all before I joined Dataverse.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
After discussion with Recherche-Data-Gouv team, we decided to rollback modifications from keywordVocabularyURI
to keywordVocabularyURL
. Update is coming.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Update is done @qqmyers :)
…adata-in-Keyword-block_v2 10288 - rollback keywordVocabularyURL to keywordVocabularyURI
@@ -0,0 +1,5 @@ | |||
-- update of the "keywordVocabularyURI" metadata to make it consistent with its name "Controlled Vocabulary URL" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is obsolete now?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The PR description also still mentions changing from URI to URL (not in the code, but might confuse in review or QA)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes absolutely, I delete the script and update the PR description
@jggautier - any reason this can't get on the board now to move forward? |
Hmm maybe. Could this get on the board in the next sprint's "This Sprint" column while we wait to hear back from @stevenferey about the tooltip text and how it does or doesn't follow the guidelines for metadata text? Or should that be resolved first? I'd like to make sure that those guidelines are easy to find and are helpful when folks are working on the text associated with metadata, like the field labels and tooltips shown in the UI. @stevenferey, I asked about this a few weeks ago because the current tooltip text for this Term URI field makes me think that you haven't seen those guidelines because they're hard to find or that the guidelines could be clearer. |
Sorry for the late response, Is this description better? We listen to your suggestions for improvement. |
Hey @stevenferey. Thanks for taking a look! The more important thing is to avoid adding instructions in the tooltip for how depositors should add metadata, since people looking for data also see those tooltips and that information isn't relevant for them. And we hope that the watermark and the field validation will ensure that depositors are entering what you expect. So I think we should remove the second sentence about entering an absolute URI. And the rest of the changes I suggested below are mostly style changes for consistency: A URI that points to the web presence of the Keyword Term |
It would be cool if we could encourage folks to use a tool that would apply our style guidelines to their text. The paid versions of Grammarly and a less popular tool called ProWritingAid say they can do this. Maybe there are free tools, too! |
Thank you for this feedback, I pushed a label update to follow the recommendation. |
@qqmyers after you asked me last if this could get on the board to move forward, I asked if it could while we waited to hear back about the tooltip text or if that should be addressed first. I think the issue with that text's been addressed, so I can't think of any reason why this couldn't get on the board to move forward. |
} | ||
} | ||
|
||
if (StringUtils.isNotBlank(subject)) { | ||
subject_check = writeOpenTag(xmlw, "subjects", subject_check); | ||
writeSubjectElement(xmlw, subjectScheme, schemeURI, subject, language); | ||
// we prioritize the keywordTermURI metadata to populate schemeURI | ||
writeSubjectElement(xmlw, subjectScheme, StringUtils.isNotBlank(keywordTermURI) ? keywordTermURI : keywordVocabURI, subject, language); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@stevenferey - The DataCite Schema includes both a schemeURI and valueURI attributes - should this now include both rather than swapping the term URI into the schemeURI attribute?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@qqmyers , Thank you for this feedback
Indeed we can consider using the two metadata as specified in the xsd
We would obtain the following export:
<subjects>
<subject>Agricultural Sciences</subject>
<subject valueURI="https://term-uri.com/" schemeURI="https://vocabulary-uri.com/" subjectScheme="cvoc name">keyword term</subject>
</subjects>
Does this correspond to the proposal?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@qqmyers, The branch is updated with the modification of the OpenAire export. Thanks a lot
@stevenferey - I think the change looks good. There is a (probably unrelated) failing test though that may be related to destroying a dataset - can you merge with develop again to pick up #10566. Details: expected: <404> but was: <500> |
Hello @qqmyers, as @stevenferey is in holiday for a full month, I will handle the next exchanges. |
@luddaniel - thanks. We're seeing the test fail in other branches, one with only docs changes, so definitely unrelated. That said, we're looking to see if we can understand why it started failing recently and will probably want to pick up your try/catch fix. I'll let you know if there's anything else to do. |
Yes, I cherry-picked it here: Thanks, @luddaniel! |
@stevenferey we cherry-picked a commit from this PR into #10601 and merged it. Can you please merge develop into this PR? Thanks. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think this is done at this point. The last build failed with the AWS instance not starting but probably not related. (Should rerun at some point.)
I think this is how all fields of type url work, i.e. not related to the one field added in the PR. |
What this PR does / why we need it:
To continue on the subject of OntoPortal integration
Which issue(s) this PR closes:
Closes #10288
Special notes for your reviewer:
In addition to the addition of the
keywordTermURI
field, we added a script allowing administrators to optionally migrate data fromkeywordValue
tokeywordTermURI
when the data starts with "http".Does this PR introduce a user interface change? If mockups are available, please link/include them here:
Before data migration, here is an example with a
keywordValue
containing data starting with "http" :After optional data migration :
Is there a release notes update needed for this change?:
Yes, a proposal is visible in the file doc/release-notes/10288-add-term_uri-metadata-in-keyword-block.md