-
Notifications
You must be signed in to change notification settings - Fork 52
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Details about SemanticTextAnnotation #640
Comments
Does the specification imply that the new type bioschema:SemanticTextAnnotation (STA in the follow) is being introduced too? Else, How can one distinguish a STA from any other schema:CreativeWork? Via dct:conformsTo? Is schema:CreativeWork the best base class for this? I've seen schemaorg/schemaorg#1905 that use schema:Action or schema:CreateAction. A STA could be modelled as an action in the sense of the action that created the association between the original text and some entity. An advantage of that would be that we could use schema:instrument (eg, to trace the software that produced the annotation). Also, schema:result would play the role that now is played by schema:mainEntity, while schema:object would be used in place of schema:text and schema:subjectOf. An advantage of that would be that it could link both the original document (eg, an article) and some structured information about the specific text fragment the annotation is derived from (eg, the sentence and its position within the article). Currently, a way to do the same is maybe: As an alternative, you might want to keep things under CreativeWork, but at least add bioschema:SemanticTextAnnotation to the domain of schema:instrument. |
Thanks, @gtsueng. What about As for the rest: If a new
Also, I think the general problem here is
Basically, yes, with this other approach, the annotation is an action, which of input is a text and the output is a defined term (or, as I wrote, something else). |
Hi @marco-brandizi-- On the matter of While the bioschemas:SemanticTextAnnotation has not been defined a new bioschemas Type that will be pushed to schema.org, it is still a bioschemas profile. From a JSON-schema/JSON-LD expression/definition standpoint, this is still a subclass of CreativeWork, even if it is not a bioschemas type. If you look at the JSON-LD file defining this "class", it's literally a subclass of CreativeWork. Hence, you could point your |
This profile has never been used and has been hanging there since the beginning. The date on the latest profile comes from a massive update with sameAs as recommended on all profiles. This profile is a candidate for deprecation. |
@ljgarcia thanks for the clarification. I think I'll stick to the action approach for now. But I think there's a number of text mining use cases (and in general, provenance tracking cases) that require some definitions like this profile. |
I'd like some clarifications on the SemanticTextAnnotation profile, which appears to be a new proposed profile here to associate texts with semantic annotations.
Does the specification imply that the new type
bioschema:SemanticTextAnnotation
(STA in the follow) is being introduced too? Else, How can one distinguish a STA from any otherschema:CreativeWork
? Viadct:conformsTo
?Restricting
schema:mainEntity
toschema:DefinedTerm
is very limiting to me: I have use cases where the output of a text mining tool can be: an ontology term, a recognised entity like a gene (schema:mainEntity
would point to its URL or even to its symbol or accession, via a plain string value), or a more complex structure, such as a reified statment, as shown here. In schema.org this hasschema:Thing
in the range, I think it would be useful to keep it like that for STAs too (most schema validators accepts plain text by default). Also, note that other annotation models have the same general approach (eg, Web Annotation Model)schema:CreativeWork
the best base class for this? I've seen similar derivation examples that useschema:Action
orschema:CreateAction
. A STA could be modelled as an action in the sense of the action that created the association between the original text and some entity.schema:instrument
(eg, to trace the software that produced the annotation).schema:result
would play the role that now is played byschema:mainEntity
, whileschema:object
would be used in place ofschema:text
andschema:subjectOf
. An advantage of that would be that it could link both the original document (eg, an article) and some structured information about the specific text fragment the annotation is derived from (eg, the sentence and its position within the article). Currently, a way to do the same is maybe:schema:isBasedOn <sentence as CreativeWork> [schema:partOf <article>]
, but to me, it looks like less explicit and more confusing.CreativeWork
, but at least addbioschema:SemanticTextAnnotation
to the domain ofschema:instrument
.The text was updated successfully, but these errors were encountered: