Replies: 2 comments 6 replies
-
Hi! You should choose an encoding scheme that is easy to query using regular expressions. What easy means is in the eye of the user 😃 E.g. when using a fixed length for a single year (like 0100 instead of 100) so you can make range queries like Also, you should consider whether it is more practical to have this encoded into two different annotations for the start and end date. This probably does not make it easier to find open-ranges, but it allows using the "value identity" operator |
Beta Was this translation helpful? Give feedback.
-
Generally I find ISO dates easiest to work with, because they sort properly and have predictable numbers of digits, e.g. today is: 2023-05-19 including leading zeros etc. So you can get the thirteenth century like this: meta::date=/12.*/ Or thirteenth to fourteenth: meta::date=/1[23].*/ If you need dates BC it's a little more complicated but if they're all BC it's easy, and otherwise you can use a disjunction. |
Beta Was this translation helpful? Give feedback.
-
I would like to specify the date of composition for each work in my corpus, which I am likely to represent as a range (a fictitious example:
-120:-110
). Ideally, users should be able to specify a range in their query in ANNIS, and be able to retrieve all works/tokens falling at least within part of that range (for example,-130:-115
should also retrieve-120:-110
, because the latter partly falls within the former).Are there corpora where the date issue has already been tackled? In any case, what would be the best strategy to express this kind of annotation in PAULA XML? Should one first identify one "chunck" for all tokens of a work and then attach the date annotation to it?
Beta Was this translation helpful? Give feedback.
All reactions