diff --git a/documentation/Cyrillic/Cyrillic.md b/documentation/Cyrillic/Cyrillic.md index 479dd4436..24964fb55 100644 --- a/documentation/Cyrillic/Cyrillic.md +++ b/documentation/Cyrillic/Cyrillic.md @@ -4,34 +4,34 @@ ## Table of content + Introduction [:arrows_counterclockwise:](#intro) -+ [Cyrillic characters](#cyrch) -+ [Cyrillic extensions](#cyrext) -+ [Extended Cyrillic](#extcyr) -+ [Historic letters. Uncode range (0460 : 0481)](#histletr) -+ [Old Cyrillic](#oldcyr) -+ [Old Church Slavonic combining letters](#oldcs) -+ [Historic miscellaneous and numeric signs](#hmns) -+ [Abbreviation mark](#abrmrk) -+ [Punctuation mark](#puncmrk) -+ [Combining marks for Old Cyrillic](#cmbmrkoldcyr) -+ [Combining half marks](#cmbmhlfmrk) -+ [Modifier letter](#mdltr) -+ [Additions for Nivkh](#nivkh) -+ [Komi letters](#komi) -+ [Khanty letters](#khanty) -+ [Mordvin letters](#mordvin) -+ [Kurdish letters](#kurdish) -+ [Aleut letters](#aleut) -+ [Chuvash letters](#chuvash) -+ [Abkhazian letters](#abkhazian) -+ [Azerbaijani letters](#azerbaijani) -+ [Orok letters](#orok) -+ [Historic letter variants](#histltr) -+ [Miscellaneous characters](#miscltr) -+ [Letters for Old Abkhasian orthography](#oldabhltr) -+ [Intonation marks for Lithuanian dialectology](#intmrklith) -+ [Phonetic transcription](#phontrs) -+ [Sources](#src) ++ Cyrillic characters [:arrows_counterclockwise:](#cyrch) ++ Cyrillic extensions [:arrows_counterclockwise:](#cyrext) ++ Extended Cyrillic [:arrows_counterclockwise:](#extcyr) ++ Historic letters. Uncode range (0460 : 0481) [:arrows_counterclockwise:](#histletr) ++ Old Cyrillic [:arrows_counterclockwise:](#oldcyr) ++ Old Church Slavonic combining letters [:arrows_counterclockwise:](#oldcs) ++ Historic miscellaneous and numeric signs [:arrows_counterclockwise:](#hmns) ++ Abbreviation mark [:arrows_counterclockwise:](#abrmrk) ++ Punctuation mark [:arrows_counterclockwise:](#puncmrk) ++ Combining marks for Old Cyrillic [:arrows_counterclockwise:](#cmbmrkoldcyr) ++ Combining half marks [:arrows_counterclockwise:](#cmbmhlfmrk) ++ Modifier letter [:arrows_counterclockwise:](#mdltr) ++ Additions for Nivkh [:arrows_counterclockwise:](#nivkh) ++ Komi letters [:arrows_counterclockwise:](#komi) ++ Khanty letters [:arrows_counterclockwise:](#khanty) ++ Mordvin letters [:arrows_counterclockwise:](#mordvin) ++ Kurdish letters [:arrows_counterclockwise:](#kurdish) ++ Aleut letters [:arrows_counterclockwise:](#aleut) ++ Chuvash letters [:arrows_counterclockwise:](#chuvash) ++ Abkhazian letters [:arrows_counterclockwise:](#abkhazian) ++ Azerbaijani letters [:arrows_counterclockwise:](#azerbaijani) ++ Orok letters [:arrows_counterclockwise:](#orok) ++ Historic letter variants [:arrows_counterclockwise:](#histltr) ++ Miscellaneous characters [:arrows_counterclockwise:](#miscltr) ++ Letters for Old Abkhasian orthography [:arrows_counterclockwise:](#oldabhltr) ++ Intonation marks for Lithuanian dialectology [:arrows_counterclockwise:](#intmrklith) ++ Phonetic transcription [:arrows_counterclockwise:](#phontrs) ++ Sources [:arrows_counterclockwise:](#src) ## Introduction [:arrows_counterclockwise:](#tc_intro) @@ -50,7 +50,7 @@ As of Unicode version 15.1, Cyrillic script is encoded across several blocks: | **Combining Half Marks: [U+FE2E–U+FE2F](https://www.unicode.org/charts/PDF/UFE20.pdf),** 2 Cyrillic characters | In the tables below, small letters are ordered according to their Unicode numbers; capital letters are placed immediately before the corresponding small letters. Standard Unicode names and canonical decompositions are included. | -## [Cyrillic characters](#tc_cyrch) +## Cyrillic characters [:arrows_counterclockwise:](#tc_cyrch) ### Basic Cyrillic alphabet. Unicode range (0410 : 044F) | Code | Char | Name Canonical decomposition | Code | Char | Name Canonical decomposition | Comment | @@ -88,7 +88,7 @@ As of Unicode version 15.1, Cyrillic script is encoded across several blocks: | 042E | Ю | CYRILLIC CAPITAL LETTER YU | 044E | ю | CYRILLIC SMALL LETTER YU | | | 042F | Я | CYRILLIC CAPITAL LETTER YA | 044F | я | CYRILLIC SMALL LETTER YA | | -### [Cyrillic extensions. Unicode range (0400 : 040F, 0450 : 045F)](#tc_cyrext) +### Cyrillic extensions. Unicode range (0400 : 040F, 0450 : 045F) [:arrows_counterclockwise:](#tc_cyrext) | Code | Char | Name Canonical decomposition | Code | Char | Name Canonical decomposition | Comment | |--|--|--|--|--|--|--| @@ -111,7 +111,7 @@ As of Unicode version 15.1, Cyrillic script is encoded across several blocks: -### [Extended Cyrillic](#tc_extcyr) +### Extended Cyrillic [:arrows_counterclockwise:](#tc_extcyr) | Code | Char | Name Canonical decomposition | Code | Char | Name Canonical decomposition | Comment | |----|----|----|----|----|----|----| @@ -168,7 +168,7 @@ As of Unicode version 15.1, Cyrillic script is encoded across several blocks: | 04F8 | Ӹ | CYRILLIC CAPITAL LETTER YERU WITH DIAERESIS042B 0308 | 04F9 | ӹ | CYRILLIC SMALL LETTER YERU WITH DIAERESIS044B 0308 | -### [Historic letters. Uncode range (0460 : 0481)](#tc_histletr) +### Historic letters. Uncode range (0460 : 0481) [:arrows_counterclockwise:](#tc_histletr) | Code | Char | Name Canonical decomposition | Code | Char | Name Canonical decomposition | Comment | |--|--|--|--|--|--|--| @@ -191,7 +191,7 @@ As of Unicode version 15.1, Cyrillic script is encoded across several blocks: | 0480 | Ҁ | CYRILLIC CAPITAL LETTER KOPPA | 0481 | ҁ | CYRILLIC SMALL LETTER KOPPA | From the ancient Greek Ϙ "Koppa (letter)" | -### [Old Cyrillic](#tc_oldcyr) +### Old Cyrillic [:arrows_counterclockwise:](#tc_oldcyr) | Code | Char | Name Canonical decomposition | Code | Char | Name Canonical decomposition | Comment | |----|----|----|----|----|----|----| | A640 | Ꙁ | CYRILLIC CAPITAL LETTER ZEMLYA | A641 | ꙁ | CYRILLIC SMALL LETTER ZEMLYA | @@ -222,7 +222,7 @@ As of Unicode version 15.1, Cyrillic script is encoded across several blocks: | A69A | Ꚛ | CYRILLIC CAPITAL LETTER CROSSED O | A69B | ꚛ | CYRILLIC SMALL LETTER CROSSED O | | -### [Old Church Slavonic combining letters](#tc_oldcs) +### Old Church Slavonic combining letters [:arrows_counterclockwise:](#tc_oldcs) | Code | Char | Name Canonical decomposition | Comment | |----|----|----|----| @@ -270,7 +270,7 @@ As of Unicode version 15.1, Cyrillic script is encoded across several blocks: | A69F | ꚟ | COMBINING CYRILLIC LETTER IOTIFIED E | -### [Historic miscellaneous and numeric signs](#tc_hmns) +### Historic miscellaneous and numeric signs [:arrows_counterclockwise:](#tc_hmns) | Code | Char | Name Canonical decomposition | Comment | |--|--|--|--| @@ -287,14 +287,14 @@ As of Unicode version 15.1, Cyrillic script is encoded across several blocks: | A672 |   ꙲ | COMBINING CYRILLIC THOUSAND MILLIONS SIGN | | -### [Abbreviation mark](#tc_abrmrk) +### Abbreviation mark [:arrows_counterclockwise:](#tc_abrmrk) | Code | Char | Name Canonical decomposition | Comment | |----|----|----|----| | A66F | ꙯ | COMBINING CYRILLIC VZMET | * used with Cyrillic letters to indicate abbreviation | | -### [Punctuation mark](#tc_puncmrk) +### Punctuation mark [:arrows_counterclockwise:](#tc_puncmrk) | Code | Char | Name Canonical decomposition | Comment | |----|----|----|----| @@ -302,7 +302,7 @@ As of Unicode version 15.1, Cyrillic script is encoded across several blocks: | A67E | ꙾ | CYRILLIC KAVYKA | * used to mark off word that has alternative reading | -### [Combining marks for Old Cyrillic](#tc_cmbmrkoldcyr) +### Combining marks for Old Cyrillic [:arrows_counterclockwise:](#tc_cmbmrkoldcyr) | Code | Char | Name Canonical decomposition | Comment | |----|----|----|----| @@ -310,7 +310,7 @@ As of Unicode version 15.1, Cyrillic script is encoded across several blocks: | A67D | ꙽ | COMBINING CYRILLIC PAYEROK | * may indicate an omitted yer | -### [Combining half marks](#tc_cmbmhlfmrk) +### Combining half marks [:arrows_counterclockwise:](#tc_cmbmhlfmrk) | Code | Char | Name Canonical decomposition | Comment | |----|----|----|----| @@ -318,14 +318,14 @@ As of Unicode version 15.1, Cyrillic script is encoded across several blocks: | FE2F | ︯ | COMBINING CYRILLIC TITLO RIGHT HALF | | -### [Modifier letter](#tc_mdltr) +### Modifier letter [:arrows_counterclockwise:](#tc_mdltr) | Code | Char | Name Canonical decomposition | Comment | |----|----|----|----| | A67F | ꙿ | CYRILLIC PAYEROK | * may indicate an omitted yer | -### [Additions for Nivkh](#tc_nivkh) +### Additions for Nivkh [:arrows_counterclockwise:](#tc_nivkh) | Code | Char | Name Canonical decomposition | Code | Char | Name Canonical decomposition | Comment | |----|----|----|----|----|----|----| @@ -334,7 +334,7 @@ As of Unicode version 15.1, Cyrillic script is encoded across several blocks: | 04FE | Ӿ | CYRILLIC CAPITAL LETTER HA WITH STROKE | 04FF | ӿ | CYRILLIC SMALL LETTER HA WITH STROKE | | -### [Komi letters](#tc_komi) +### Komi letters [:arrows_counterclockwise:](#tc_komi) | Code | Char | Name Canonical decomposition | Code | Char | Name Canonical decomposition | Comment | |----|----|----|----|----|----|----| @@ -350,7 +350,7 @@ As of Unicode version 15.1, Cyrillic script is encoded across several blocks: | 052C | Ԭ | CYRILLIC CAPITAL LETTER DCHE | 052D | ԭ | CYRILLIC SMALL LETTER DCHE | | -### [Khanty letters](#tc_khanty) +### Khanty letters [:arrows_counterclockwise:](#tc_khanty) | Code | Char | Name Canonical decomposition | Code | Char | Name Canonical decomposition | Comment | |----|----|----|----|----|----|----| @@ -359,7 +359,7 @@ As of Unicode version 15.1, Cyrillic script is encoded across several blocks: | 052E | Ԯ | CYRILLIC CAPITAL LETTER EL WITH DESCENDER | 052F | ԯ | CYRILLIC SMALL LETTER EL WITH DESCENDER | | -### [Mordvin letters](#tc_mordvin) +### Mordvin letters [:arrows_counterclockwise:](#tc_mordvin) | Code | Char | Name Canonical decomposition | Code | Char | Name Canonical decomposition | Comment | |----|----|----|----|----|----|----| @@ -368,7 +368,7 @@ As of Unicode version 15.1, Cyrillic script is encoded across several blocks: | 0518 | Ԙ | CYRILLIC CAPITAL LETTER YAE | 0519 | ԙ | CYRILLIC SMALL LETTER YAE | Ligatures of Я and Е; я and е | -### [Kurdish letters](#tc_kurdish) +### Kurdish letters [:arrows_counterclockwise:](#tc_kurdish) | Code | Char | Name Canonical decomposition | Code | Char | Name Canonical decomposition | Comment | |----|----|----|----|----|----|----| @@ -376,14 +376,14 @@ As of Unicode version 15.1, Cyrillic script is encoded across several blocks: | 051C | Ԝ | CYRILLIC CAPITAL LETTER WE | 051D | ԝ | CYRILLIC SMALL LETTER WE | Based on the Latin letter W w | | | -### [Aleut letters](#tc_aleut) +### Aleut letters [:arrows_counterclockwise:](#tc_aleut) | Code | Char | Name Canonical decomposition | Code | Char | Name Canonical decomposition | Comment | |----|----|----|----|----|----|----| | 051E | Ԟ | CYRILLIC CAPITAL LETTER ALEUT KA | 051F | ԟ | CYRILLIC SMALL LETTER ALEUT KA | * used for [q] in Aleut | -### [Chuvash letters](#tc_chuvash) +### Chuvash letters [:arrows_counterclockwise:](#tc_chuvash) | Code | Char | Name Canonical decomposition | Code | Char | Name Canonical decomposition | Comment | |----|----|----|----|----|----|----| @@ -391,28 +391,28 @@ As of Unicode version 15.1, Cyrillic script is encoded across several blocks: | 0522 | Ԣ | CYRILLIC CAPITAL LETTER EN WITH MIDDLE HOOK | 0523 | ԣ | CYRILLIC SMALL LETTER EN WITH MIDDLE HOOK | = palatalized n | -### [Abkhazian letters](#tc_abkhazian) +### Abkhazian letters [:arrows_counterclockwise:](#tc_abkhazian) | Code | Char | Name Canonical decomposition | Code | Char | Name Canonical decomposition | Comment | |----|----|----|----|----|----|----| | 0524 | Ԥ | CYRILLIC CAPITAL LETTER PE WITH DESCENDER | 0525 | ԥ | CYRILLIC SMALL LETTER PE WITH DESCENDER | * used in modern Abkhaz orthography | -### [Azerbaijani letters](#tc_azerbaijani) +### Azerbaijani letters [:arrows_counterclockwise:](#tc_azerbaijani) | Code | Char | Name Canonical decomposition | Code | Char | Name Canonical decomposition | Comment | |----|----|----|----|----|----|----| | 0526 | Ԧ | CYRILLIC CAPITAL LETTER SHHA WITH DESCENDER | 0527 | ԧ | CYRILLIC SMALL LETTER SHHA WITH DESCENDER | -### [Orok letters](#tc_orok) +### Orok letters [:arrows_counterclockwise:](#tc_orok) | Code | Char | Name Canonical decomposition | Code | Char | Name Canonical decomposition | Comment | |----|----|----|----|----|----|----| | 0528 | Ԩ | CYRILLIC CAPITAL LETTER EN WITH LEFT HOOK | 0529 | ԩ | CYRILLIC SMALL LETTER EN WITH LEFT HOOK | -### [Historic letter variants](#tc_histltr) +### Historic letter variants [:arrows_counterclockwise:](#tc_histltr) | Code | Char | Name Canonical decomposition | Comment | |----|----|----|----| @@ -427,7 +427,7 @@ As of Unicode version 15.1, Cyrillic script is encoded across several blocks: | 1C88 | ᲈ | CYRILLIC SMALL LETTER UNBLENDED UK | | -### [Miscellaneous characters](#tc_miscltr) +### Miscellaneous characters [:arrows_counterclockwise:](#tc_miscltr) | Code | Char | Name Canonical decomposition | Comment | | |----|----|----|----|----| @@ -436,7 +436,7 @@ As of Unicode version 15.1, Cyrillic script is encoded across several blocks: | 20DD | ⃝ | COMBINING ENCLOSING CIRCLE | = Cyrillic combining ten thousands sign; symbol for myriads | | -### [Letters for Old Abkhasian orthography](#tc_oldabhltr) +### Letters for Old Abkhasian orthography [:arrows_counterclockwise:](#tc_oldabhltr) | Code | Char | Name Canonical decomposition | Code |Char|Name Canonical decomposition|Comment| |----|----|----|----|----|----|----| @@ -454,14 +454,14 @@ As of Unicode version 15.1, Cyrillic script is encoded across several blocks: | A696 | Ꚗ | CYRILLIC CAPITAL LETTER SHWE | A697 | ꚗ | CYRILLIC SMALL LETTER SHWE | -### [Intonation marks for Lithuanian dialectology](#tc_intmrklith) +### Intonation marks for Lithuanian dialectology [:arrows_counterclockwise:](#tc_intmrklith) | Code | Char | Name Canonical decomposition | Code |Char|Name Canonical decomposition|Comment| |----|----|----|----|----|----|----| | A69C | ꚜ | MODIFIER LETTER CYRILLIC HARD SIGN | A69D | ꚝ | MODIFIER LETTER CYRILLIC SOFT SIGN | | -### [Phonetic transcription](#tc_phontrs) +### Phonetic transcription [:arrows_counterclockwise:](#tc_phontrs) | Code | Char | Name Canonical decomposition | Comment | |----|----|----|----| @@ -530,7 +530,7 @@ As of Unicode version 15.1, Cyrillic script is encoded across several blocks: | 1E08F | 𞂏 | COMBINING CYRILLIC SMALL LETTER BYELORUSSIAN-UKRAINIAN I | -### [Sources](#tc_src) +### Sources [:arrows_counterclockwise:](#tc_src) Wikipedia. [Cyrillic script in Unicode](https://en.wikipedia.org/wiki/Cyrillic_script_in_Unicode) Wikipedia. [List of Cyrillic letters](https://en.wikipedia.org/wiki/List_of_Cyrillic_letters) Wikipedia. [Cyrillic script](https://en.wikipedia.org/wiki/Cyrillic_script)