From 268eeee39191e4a51c1226f01d8280842c878be5 Mon Sep 17 00:00:00 2001 From: Robin Leroy Date: Fri, 14 Feb 2025 00:06:03 +0100 Subject: [PATCH] Regenerate UCD again --- .../ucd/dev/auxiliary/GraphemeBreakTest.html | 68 +++++++++---------- .../ucd/dev/auxiliary/GraphemeBreakTest.txt | 14 ++-- 2 files changed, 41 insertions(+), 41 deletions(-) diff --git a/unicodetools/data/ucd/dev/auxiliary/GraphemeBreakTest.html b/unicodetools/data/ucd/dev/auxiliary/GraphemeBreakTest.html index 4a15a2af7..2e2a34ef7 100644 --- a/unicodetools/data/ucd/dev/auxiliary/GraphemeBreakTest.html +++ b/unicodetools/data/ucd/dev/auxiliary/GraphemeBreakTest.html @@ -7,7 +7,7 @@

Grapheme_Cluster_Break Chart

Unicode Version: 17.0.0

-

Date: 2025-02-13, 22:55:10 GMT

+

Date: 2025-02-13, 23:05:20 GMT

This page illustrates the application of the Grapheme_Cluster_Break specification. The material here is informative, not normative.

The first chart shows where breaks would appear between different sample characters or strings. The sample characters are chosen mechanically to represent the different properties used by the specification.

Each cell shows the break-status for the position between the character(s) in its row header and the character(s) in its column header. The × symbol indicates no break, while the ÷ symbol indicates a break. The cells with × are also shaded to make it easier to scan the table. For example, in the cell at the intersection of the row headed by “CR” and the column headed by “LF”, there is a × symbol, indicating that there is no break between CR and LF.

After the heavy blue line in the table are additional rows, either with different sample characters or for sequences.

In the row and column headers of the Table, in the Rules, when hovering over characters in the Samples, and in the comments in the associated list of test cases GraphemeBreakTest.txt:

  1. The following sets are used:
      @@ -326,61 +326,61 @@

      Sample Strings

      37 -     -◌္   -   +     +◌္   +      ◌့   38 -     +     ◌်   -◌္   -   -◌္   -   +◌္   +   +◌္   +   39      ◌ᬁ   -   -   -   -   -   -   -   -   -   -   -   +   +   +   +   +   +   +   +   +   +   +   ◌ᬸ   40 -     -◌្   -   -◌្   -   +     +◌្   +   +◌្   +   ◌ី   41 -     -   -   -   +     +   +   +   42 -     -   -   -   -   +     +   +   +   +      diff --git a/unicodetools/data/ucd/dev/auxiliary/GraphemeBreakTest.txt b/unicodetools/data/ucd/dev/auxiliary/GraphemeBreakTest.txt index c3acbba71..83a08c153 100644 --- a/unicodetools/data/ucd/dev/auxiliary/GraphemeBreakTest.txt +++ b/unicodetools/data/ucd/dev/auxiliary/GraphemeBreakTest.txt @@ -1,5 +1,5 @@ # GraphemeBreakTest-17.0.0.txt -# Date: 2025-02-13, 22:55:10 GMT +# Date: 2025-02-13, 23:05:20 GMT # © 2025 Unicode®, Inc. # Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries. # For terms of use and license, see https://www.unicode.org/terms_of_use.html @@ -782,12 +782,12 @@ ÷ 003F × 094D ÷ 0924 ÷ # ÷ [0.2] QUESTION MARK (XXmLinkingConsonantmExtPict) × [9.0] DEVANAGARI SIGN VIRAMA (Extend_ConjunctLinker) ÷ [999.0] DEVANAGARI LETTER TA (LinkingConsonant) ÷ [0.3] ÷ 0915 × 094D × 094D × 0924 ÷ # ÷ [0.2] DEVANAGARI LETTER KA (LinkingConsonant) × [9.0] DEVANAGARI SIGN VIRAMA (Extend_ConjunctLinker) × [9.0] DEVANAGARI SIGN VIRAMA (Extend_ConjunctLinker) × [9.3] DEVANAGARI LETTER TA (LinkingConsonant) ÷ [0.3] ÷ 0AB8 × 0AFB × 0ACD × 0AB8 × 0AFB ÷ # ÷ [0.2] GUJARATI LETTER SA (LinkingConsonant) × [9.0] GUJARATI SIGN SHADDA (Extend_ConjunctExtendermConjunctLinker) × [9.0] GUJARATI SIGN VIRAMA (Extend_ConjunctLinker) × [9.3] GUJARATI LETTER SA (LinkingConsonant) × [9.0] GUJARATI SIGN SHADDA (Extend_ConjunctExtendermConjunctLinker) ÷ [0.3] -÷ 1019 × 1039 ÷ 1018 ÷ 102C × 1037 ÷ # ÷ [0.2] MYANMAR LETTER MA (XXmLinkingConsonantmExtPict) × [9.0] MYANMAR SIGN VIRAMA (Extend_ConjunctExtendermConjunctLinker) ÷ [999.0] MYANMAR LETTER BHA (XXmLinkingConsonantmExtPict) ÷ [999.0] MYANMAR VOWEL SIGN AA (XXmLinkingConsonantmExtPict) × [9.0] MYANMAR SIGN DOT BELOW (Extend_ConjunctExtendermConjunctLinker) ÷ [0.3] -÷ 1004 × 103A × 1039 ÷ 1011 × 1039 ÷ 1011 ÷ # ÷ [0.2] MYANMAR LETTER NGA (XXmLinkingConsonantmExtPict) × [9.0] MYANMAR SIGN ASAT (Extend_ConjunctExtendermConjunctLinker) × [9.0] MYANMAR SIGN VIRAMA (Extend_ConjunctExtendermConjunctLinker) ÷ [999.0] MYANMAR LETTER THA (XXmLinkingConsonantmExtPict) × [9.0] MYANMAR SIGN VIRAMA (Extend_ConjunctExtendermConjunctLinker) ÷ [999.0] MYANMAR LETTER THA (XXmLinkingConsonantmExtPict) ÷ [0.3] -÷ 1B12 × 1B01 ÷ 1B32 × 1B44 ÷ 1B2F ÷ 1B32 × 1B44 ÷ 1B22 × 1B44 ÷ 1B2C ÷ 1B32 × 1B44 ÷ 1B22 × 1B38 ÷ # ÷ [0.2] BALINESE LETTER OKARA TEDUNG (XXmLinkingConsonantmExtPict) × [9.0] BALINESE SIGN ULU CANDRA (Extend_ConjunctExtendermConjunctLinker) ÷ [999.0] BALINESE LETTER SA (XXmLinkingConsonantmExtPict) × [9.0] BALINESE ADEG ADEG (Extend_ConjunctExtendermConjunctLinker) ÷ [999.0] BALINESE LETTER WA (XXmLinkingConsonantmExtPict) ÷ [999.0] BALINESE LETTER SA (XXmLinkingConsonantmExtPict) × [9.0] BALINESE ADEG ADEG (Extend_ConjunctExtendermConjunctLinker) ÷ [999.0] BALINESE LETTER TA (XXmLinkingConsonantmExtPict) × [9.0] BALINESE ADEG ADEG (Extend_ConjunctExtendermConjunctLinker) ÷ [999.0] BALINESE LETTER YA (XXmLinkingConsonantmExtPict) ÷ [999.0] BALINESE LETTER SA (XXmLinkingConsonantmExtPict) × [9.0] BALINESE ADEG ADEG (Extend_ConjunctExtendermConjunctLinker) ÷ [999.0] BALINESE LETTER TA (XXmLinkingConsonantmExtPict) × [9.0] BALINESE VOWEL SIGN SUKU (Extend_ConjunctExtendermConjunctLinker) ÷ [0.3] -÷ 179F × 17D2 ÷ 178F × 17D2 ÷ 179A × 17B8 ÷ # ÷ [0.2] KHMER LETTER SA (XXmLinkingConsonantmExtPict) × [9.0] KHMER SIGN COENG (Extend_ConjunctExtendermConjunctLinker) ÷ [999.0] KHMER LETTER TA (XXmLinkingConsonantmExtPict) × [9.0] KHMER SIGN COENG (Extend_ConjunctExtendermConjunctLinker) ÷ [999.0] KHMER LETTER RO (XXmLinkingConsonantmExtPict) × [9.0] KHMER VOWEL SIGN II (Extend_ConjunctExtendermConjunctLinker) ÷ [0.3] -÷ 1B26 ÷ 1B17 × 1B44 ÷ 1B13 ÷ # ÷ [0.2] BALINESE LETTER NA (XXmLinkingConsonantmExtPict) ÷ [999.0] BALINESE LETTER NGA (XXmLinkingConsonantmExtPict) × [9.0] BALINESE ADEG ADEG (Extend_ConjunctExtendermConjunctLinker) ÷ [999.0] BALINESE LETTER KA (XXmLinkingConsonantmExtPict) ÷ [0.3] -÷ 1B27 ÷ 1B13 × 1B44 ÷ 1B0B ÷ 1B0B × 1B04 ÷ # ÷ [0.2] BALINESE LETTER PA (XXmLinkingConsonantmExtPict) ÷ [999.0] BALINESE LETTER KA (XXmLinkingConsonantmExtPict) × [9.0] BALINESE ADEG ADEG (Extend_ConjunctExtendermConjunctLinker) ÷ [999.0] BALINESE LETTER RA REPA (XXmLinkingConsonantmExtPict) ÷ [999.0] BALINESE LETTER RA REPA (XXmLinkingConsonantmExtPict) × [9.1] BALINESE SIGN BISAH (SpacingMark) ÷ [0.3] +÷ 1019 × 1039 × 1018 ÷ 102C × 1037 ÷ # ÷ [0.2] MYANMAR LETTER MA (LinkingConsonant) × [9.0] MYANMAR SIGN VIRAMA (Extend_ConjunctLinker) × [9.3] MYANMAR LETTER BHA (LinkingConsonant) ÷ [999.0] MYANMAR VOWEL SIGN AA (XXmLinkingConsonantmExtPict) × [9.0] MYANMAR SIGN DOT BELOW (Extend_ConjunctExtendermConjunctLinker) ÷ [0.3] +÷ 1004 × 103A × 1039 × 1011 × 1039 × 1011 ÷ # ÷ [0.2] MYANMAR LETTER NGA (LinkingConsonant) × [9.0] MYANMAR SIGN ASAT (Extend_ConjunctExtendermConjunctLinker) × [9.0] MYANMAR SIGN VIRAMA (Extend_ConjunctLinker) × [9.3] MYANMAR LETTER THA (LinkingConsonant) × [9.0] MYANMAR SIGN VIRAMA (Extend_ConjunctLinker) × [9.3] MYANMAR LETTER THA (LinkingConsonant) ÷ [0.3] +÷ 1B12 × 1B01 ÷ 1B32 × 1B44 × 1B2F ÷ 1B32 × 1B44 × 1B22 × 1B44 × 1B2C ÷ 1B32 × 1B44 × 1B22 × 1B38 ÷ # ÷ [0.2] BALINESE LETTER OKARA TEDUNG (XXmLinkingConsonantmExtPict) × [9.0] BALINESE SIGN ULU CANDRA (Extend_ConjunctExtendermConjunctLinker) ÷ [999.0] BALINESE LETTER SA (LinkingConsonant) × [9.0] BALINESE ADEG ADEG (Extend_ConjunctLinker) × [9.3] BALINESE LETTER WA (LinkingConsonant) ÷ [999.0] BALINESE LETTER SA (LinkingConsonant) × [9.0] BALINESE ADEG ADEG (Extend_ConjunctLinker) × [9.3] BALINESE LETTER TA (LinkingConsonant) × [9.0] BALINESE ADEG ADEG (Extend_ConjunctLinker) × [9.3] BALINESE LETTER YA (LinkingConsonant) ÷ [999.0] BALINESE LETTER SA (LinkingConsonant) × [9.0] BALINESE ADEG ADEG (Extend_ConjunctLinker) × [9.3] BALINESE LETTER TA (LinkingConsonant) × [9.0] BALINESE VOWEL SIGN SUKU (Extend_ConjunctExtendermConjunctLinker) ÷ [0.3] +÷ 179F × 17D2 × 178F × 17D2 × 179A × 17B8 ÷ # ÷ [0.2] KHMER LETTER SA (LinkingConsonant) × [9.0] KHMER SIGN COENG (Extend_ConjunctLinker) × [9.3] KHMER LETTER TA (LinkingConsonant) × [9.0] KHMER SIGN COENG (Extend_ConjunctLinker) × [9.3] KHMER LETTER RO (LinkingConsonant) × [9.0] KHMER VOWEL SIGN II (Extend_ConjunctExtendermConjunctLinker) ÷ [0.3] +÷ 1B26 ÷ 1B17 × 1B44 × 1B13 ÷ # ÷ [0.2] BALINESE LETTER NA (LinkingConsonant) ÷ [999.0] BALINESE LETTER NGA (LinkingConsonant) × [9.0] BALINESE ADEG ADEG (Extend_ConjunctLinker) × [9.3] BALINESE LETTER KA (LinkingConsonant) ÷ [0.3] +÷ 1B27 ÷ 1B13 × 1B44 × 1B0B ÷ 1B0B × 1B04 ÷ # ÷ [0.2] BALINESE LETTER PA (LinkingConsonant) ÷ [999.0] BALINESE LETTER KA (LinkingConsonant) × [9.0] BALINESE ADEG ADEG (Extend_ConjunctLinker) × [9.3] BALINESE LETTER RA REPA (LinkingConsonant) ÷ [999.0] BALINESE LETTER RA REPA (LinkingConsonant) × [9.1] BALINESE SIGN BISAH (SpacingMark) ÷ [0.3] # # Lines: 764 #