-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
what is ruby (was emphasis markers) #6
Comments
Please write the absolute URL. |
https://github.com/martindholmes/rubyForTEI/blob/main/Images/26478854.jpg they're all in the images folder in this repo |
Thank you very much!
I hope each URL would be indicated as full path. |
Ruby in HTML5 might be able to be utilized in various cases of appended small characters and marks as a layout function. However, semantically, ruby is different as we wrote in the original proposal. So, the emphasis marks should not be encoded by
|
I already picked an example that includes https://github.com/martindholmes/rubyForTEI/blob/main/ruby_samples.xml |
Before we make a decision on what should be included, we need to answer the basic questions about how we define/identify ruby (as opposed to other annotation types), and what the intention of this current sprint is. I don't think we have any chance of getting complex and exhaustive documentation into the Guidelines for this release. I would really like to get the basic use-cases covered and released before we go on to the complexities, but I think that's for Council to decide. |
Thank you for consideration of the example. Aside from inclusion of the example into the Guidelines, I wonder whether the IDS should be included directly in |
I like the CSS approach to encoding emphasis; that aligns with our current recommendations on writing modes too. |
I think the examples which are described in the original proposal are enough for basic usage of tons of Japanese modern books, comics, subtitles of movies and other types of modern Japanese texts. While consideration of complex examples is fun and important for some users, I would like to brush up the proposed ones to put into the Guidelines, if possible. I would be glad if you would list what should we do for it. |
I would suggest that we will form a specialized guidelines outside of the TEI P5 guidelines for the complex usages like some other TEI-related communities. It was an advice by some experienced TEIers. According to it, we are now voluntarily trying to make Japanese guidelines for East Asian materials so far: https://github.com/TEI-EAJ/jp_guidelines/wiki . As we don't have any budget for it, it is very slow. But we would like to extend it to other languages such as Chinese, Korean, Vietnamese, English, and so on, if you and others would collaborate with us. Are you interested in that? |
For the East Asian ignorant among us, could someone explain what an “emphasis marker” is? |
@sydb Here what mentioned as "emphasis markers" are those highlighted punctuation marks in the given image. The English Wikipedia also has a brief introduction about it: https://en.wikipedia.org/wiki/Emphasis_point It is, in form, each separate mark that is typically put on the center-right side of a character, but conceptually, highlighting of important spans of text, only embodied discretely. In this sense it functionally resembles Sperrsatz or such practices. Though emphasis markers are sometimes put by the author, they are traditionally more often done by the reader (for their own sake) or the reviewer before publishing. There is no standardized shape of emphasis markers, so they often take different appearances in the same text, and look same with other punctuations, even ambiguous on the layout (which is the case seen in the image; a clause delimiter interrupts a continuous emphasis span). |
@knagasaki I have offered my help repeatedly, that is still the case. I thought, however, that this is what we are doing here already? @martindholmes the current sprint started November 02/2020, these are all still my original issues, as for budget and timetable s.a., I was also unaware of the release schedule timing.
This is crucial, if
Yes for demonstration purposes I picked a sequence that contains such a character, the use of IDS inside @sydb there is a link to the CSS definitions in the xml, but @747 has covered most of it thank you.
I wasn't going to point out the mix-n-match of regular punctuation with emphasis markers quite yet, but this is why documents containing both ruby and emphasis markers are important to cover imv, they are by no means a rare occurrence. I would contest the |
Thank you for your repeated offers. We're welcome to involve your activities on our Github repositories. And then, I'm very interested in the budget, if possible. Our activities have been conducted by free... |
The definition of ruby in the original proposal can be summarized as a type of interlinear glosses which represents pronunciation of the targeted text and can be regarded as a parallel text. I would like to quote an example from the image below: in a part of the following image: The encoded text is below: <p type="長歌" xml:id="manyo0131">
<lb/><ruby xml:id="seg00007">
<rb xml:id="seg00008">
<note place="right" resp="#定家" subtype="摘句" targetEnd="#note0175e" type="合点" xml:id="note00175">〽石見乃海つのヽうらわ</note>
石見乃海角乃浦廻乎浦無等人社見良目瀉<note place="right" resp="#定家" type="異文" targetEnd="#note00176e">無</note>无
<anchor type="noteEnd" xml:id="note00176e"/>等 <anchor type="noteEnd" xml:id="note0175e"/><note resp="#万葉集" type="割書" xml:id="note00177">一云礒
<milestone unit="wbr" />无登</note>
</rb>
<rt rend="left" xml:id="seg00009">イハミノウミツノヽウラハ
<note place="right" resp="#定家" type="異訓" xml:id="note00178">
ヲ
</note>
ニ
<anchor type="noteEnd"/>
ウラナシトヒトコソミラメカタナシト
</rt>
</ruby>
<lb/>
<ruby xml:id="seg00010">
<rb xml:id="seg00011">
人社見良目能咲八師浦者無友縦畫屋師滷者
<note resp="#万葉集" type="割書" xml:id="note00179">一云礒者</note>
無勒
</rb>
<rt rend="left" xml:id="seg00012">
ヒトコソミラメヨシヱヤシウラハナクトモヨシヱヤシシカタハナクトモ
</rt>
</ruby>
</p> Most of the annotations are marked up by |
If the example which you provided fits the definition of the |
Thank you very much for your significant contribution to improve the section! As far as I see, I know the following passage:
and an example: <glyph xml:id="U507D-var">
<!-- more properties here -->
<mapping type="IDS">⿻人為</mapping>
<mapping type="standard">偽</mapping>
</glyph> It recommends to use IDS in the header, but I couldn't find recommendation to use IDS in the base text. Actually, it is a little difficult to process an element |
Thank you very much for the nice example. I think it would be great to add it to this repo, as we can discuss a number of issues on it. About the markup exampleIn principle, I see no advantages of: <seg type="ruby">
<seg type="rb">...
<note type="rt" subtype="xyz">…</note>
</seg>
</seg> over <ruby>
<rb> … </rb>
<rt type="xyz"> ..</rb>
</ruby>
The typology you use here would make for great suggested values for the use of Some more detailed commentsNote Given your example I cannot see how the left side annotations ( Note What is a rubyHowever the real crux I believe stems from the desire to quickly address
On both grounds I lean towards the broader definition, incorporating all East Asian documents, since their history of mutual influence is significant. Phonation is a spurious concept when 漢子 are involved. Think of the Yuan dynasty, which frequently sounded-out Mongolian phonetics using 漢子 in writing. I would argue that both the Manchu example and the one |
Actually, https://candra.dhii.jp/iipsrv/iipsrv.fcgi?IIIF=/ongi_pub/1240226/1240226_0102.tif/2219,1018,549,1330/100,/0/default.jpg https://candra.dhii.jp/iipsrv/iipsrv.fcgi?IIIF=/nincho/C40-4508-1_01/0037.tif/1566,766,259,909/100,/0/default.jpg The Chinese tradition spread as copied texts in the East Asian world. However, the marginal areas of China culture such as Korean Peninsula and Japanese islands also developed their way of reading the Chinese texts historically as However, I've heard that in modern textbooks in China Anyway, I think it is not necessary to show an example from ancient document in the Guidelines. (But I will do it in a specialized guidelines). |
While I couldn't read Manchu, it might be a case of <lb/>有
<note type="原評" targetEnd="#noteEnd01">豪傑也及于江上淂之<note>
一漁父<anchor xml:id="noteEnd01">
撑船知其意乃渡乃… However, in the case of <lb/>
<ruby><rt>原評豪傑也及于江上淂之</rt>
<rb>有一漁父撑船知其意乃渡</rb></ruby>乃… In these case, the relationships between the gloss and the targeted text are different, |
The benefits are:
I do like your approach with The point of introducing a new element in my mind is to more accurately describe the textual features we find, and to make encoding easier. I dislike that the current definition of ruby prevents their use on the examples i ve given here. I also think that #5 could significantly lower the markup burden, especially of modern cases, so irrespective of the status of other East Asian documents,
It took me 5 tries to find a page that combines commentarial half-width with what i d call source: 董子春秋繁露 seq. 68 |
Thank you for listing the three. They are important and I basically agree that we should aim the benefits.
Regarding the consistency with other markup standards,
I'm sorry for my misleading. I've just copied and pasted the encoded example from a current TEI-validated text. As the
While I suggested the previous comment comparing usage of <lb/>有
<ruby><rt>原評豪傑也及于江上淂之</rt>
<rb>一漁父</rb></ruby>撑船知其意乃渡乃…
In this case, the relationship between the annotation and the targeted text is represented, but rendering information, that is, the location of the And then, I would like to know your preferable (or acceptable) definition of |
If we focus only the layout, interlinear glosses and emphasis markers seem to be found in the following image of a manuscript. Do you think whether the glosses should be encoded via https://dutchanglosaxonist.files.wordpress.com/2017/10/dotglosses1.jpg?w=730 |
Yes, both the function of providing (early) dutch readings of the latin words, and the layout choice for presenting it, fits the definition of ruby. |
I actually prefer your final version, <lb/>有
<ruby>
<rt>原評豪傑也及于江上淂之</rt>
<rb>一漁父</rb>
</ruby>
撑船知其意乃渡乃… but this version seem similarly acceptable, it should be left to encoders to pick how they interpret the relationship between base and target, <lb/>
<ruby>
<rt>原評豪傑也及于江上淂之</rt>
<rb>有一漁父撑船知其意乃渡</rb>
</ruby>
乃… |
Thank you for reply. Then, |
Thank you for reply on your thought. If you accept both, I think it is better that |
Am I missing the image of this example? Just curious… |
@knagasaki I ll try to sum up our very helpful discussion of various examples so far. I think that we should try to bring it to a conclusion, before we can put a final shape on the Guidelines prose to be presented to council. We agree that What is still an open question is if
I'll try focus our discussion about some of the specific remaining question by posting in the other issues. In both case, I would argue that #5 should be considered for the draft of the submission regardless of broad or narrow definition. If we adopt a broad definition #4 and #10 should also be considered. |
@kzhr https://dutchanglosaxonist.files.wordpress.com/2017/10/dotglosses1.jpg?w=730 end of post Its a very nice find. I don't have non-East-Asian examples readily available. |
Thank you for summarizing both. My concern in the broad definition is that |
So, I prefer the narrow definition. As we wrote in the original document, By the way, the Guidelines are implicitly close to the Western documents from my view. To extend the user community of TEI globally, it is necessary to include various conventions of textual cultures in the world. Establishing the SIG East Asian / Japanese followed by the Indic one was a symbolic matter in the long history of the TEI, I think. As this proposal is the first experience for the SIG, we're preparing for the next proposal to encode characteristic textual elements in most part of East Asian area. |
fair enough let's see what council says about |
and narrow definition vs broad definition? |
I don't believe this is ruby: |
Just a clarification: our current schema prescribes one rb followed by one or more rts, rather than the other way around; some of the encodings above have the rt first. |
It is better to be able to make order of |
@knagasaki It's easy to do that, but what's the reason for it? In general, I tend to prefer loose ordering for element content, but in this case it seems to me that the rb is the basis (by definition) and the rt is supplementary, and depends on it. The position of the rt is handled by |
For processing, it is better that |
@knagasaki Shall I leave the schema as it is (with rb always first), and then we can see if people need to use the alternative sequence in their work? For something like this, I feel that the order of elements, if it's variable, should have significance, but I don't see any significance here. |
I see. I can not show the evidence for it now. So, if a certain number of people will request it, I will suggest it again in the future. |
My colleague gave me some examples of ruby in textbooks in China! |
Nobody is saying you need |
@duncdrum I think @knagasaki makes a good point about the fact that existing communities already have encoding norms. I see no reason why you shouldn't use ruby tags for this if you want, but I don't think we should recommend it or exemplify it, just to introduce confusion into communities completely unfamiliar with ruby. |
Again, nobody said the anglo-saxon example should go into the Guidelines, its not in this repo either. I m all for addressing the needs of existing communities, like editors working on Chinese documents where they already have to use But from the looks of it there will be a difference in semantics of |
Import from main repo
see
Images/26478854.jpg
Emphasis markers when interspersed with ruby are another special case. Special because of the markup headaches with frequent problems of overlap, not because this is a rare occurrence. Technically the markers belong to the
rb
text, but they appear as part of the ruby stream/lineThee Guidelines should feature one of these complex examples to clarify how we suggest encoders deal with this:
@style/@rend
, … what about texts with both emphasis markers and ruby?<gaiji>
can appear both in<rb>
and<rt>
see #2
Not too happy with that example, comments and suggestions welcome.
The text was updated successfully, but these errors were encountered: