/usr/share/unicode/auxiliary/GraphemeBreakTest.html is in unicode-data 10.0.0-3.
This file is owned by root:root, with mode 0o644.
The actual contents of the file can be viewed below.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 | <!DOCTYPE HTML PUBLIC '-//W3C//DTD HTML 4.01 Transitional//EN' 'http://www.w3.org/TR/html4/loose.dtd'>
<html><head><meta http-equiv='Content-Type' content='text/html; charset=utf-8'>
<title>Grapheme Break Chart</title>
<style type='text/css'>
td, th { vertical-align: top }
</style></head>
<body bgcolor='#FFFFFF'>
<h2>Grapheme_Cluster_Break Chart</h2>
<p><b>Unicode Version:</b> 10.0.0</p>
<p><b>Date:</b> 2017-03-30, 17:12:51 GMT</p>
<p>This page illustrates the application of the Grapheme_Cluster_Break specification. The material here is informative, not normative.</p> <p>The first chart shows where breaks would appear between different sample characters or strings. The sample characters are chosen mechanically to represent the different properties used by the specification.</p><p>Each cell shows the break-status for the position between the character(s) in its row header and the character(s) in its column header. The × symbol indicates no break, while the ÷ symbol indicated a break. The cells with × are also shaded to make it easier to scan the table. For example, in the cell at the intersection of the row headed by “CR” and the column headed by “LF”, there is a × symbol, indicating that there is no break between CR and LF.</p>
<p>After the heavy blue line in the table are additional rows, either with different sample characters or for sequences. Some column headers may be composed, reflecting “treat as” or “ignore” rules.</p>
<p>If your browser handles titles (tooltips), then hovering the mouse over the row header will show a sample character of that type. Hovering over a column header will show the sample character, plus its abbreviated general category and script. Hovering over the intersected cells shows the rule number that produces the break-status. For example, hovering over the cell at the intersection of LVT and T shows ×, with the rule 8.0. Checking below the table, rule 8.0 is “( LVT | T) × T”, which is the one that applies to that case. Note that a rule is invoked only when no lower-numbered rules have applied.</p>
<h3><a href='#table' name='table'>Table</a></h3>
<table border='1' cellspacing='0' width='100%'><tr><th width='5%'></th><th width='5%' class='lbclass' title='U+0020 SPACE, gc=Zs, sc=Zyyy'>Other</th><th width='5%' class='lbclass' title='U+000D <CARRIAGE RETURN (CR)>, gc=Cc, sc=Zyyy'>CR</th><th width='5%' class='lbclass' title='U+000A <LINE FEED (LF)>, gc=Cc, sc=Zyyy'>LF</th><th width='5%' class='lbclass' title='U+0001 <START OF HEADING>, gc=Cc, sc=Zyyy'>Control</th><th width='5%' class='lbclass' title='U+0300 COMBINING GRAVE ACCENT, gc=Mn, sc=Zinh'>Extend</th><th width='5%' class='lbclass' title='U+0600 ARABIC NUMBER SIGN, gc=Cf, sc=Arab'>Prepend</th><th width='5%' class='lbclass' title='U+0903 DEVANAGARI SIGN VISARGA, gc=Mc, sc=Deva'>SpacingMark</th><th width='5%' class='lbclass' title='U+1100 HANGUL CHOSEONG KIYEOK, gc=Lo, sc=Hang'>L</th><th width='5%' class='lbclass' title='U+1160 HANGUL JUNGSEONG FILLER, gc=Lo, sc=Hang'>V</th><th width='5%' class='lbclass' title='U+11A8 HANGUL JONGSEONG KIYEOK, gc=Lo, sc=Hang'>T</th><th width='5%' class='lbclass' title='U+AC00 HANGUL SYLLABLE GA, gc=Lo, sc=Hang'>LV</th><th width='5%' class='lbclass' title='U+AC01 HANGUL SYLLABLE GAG, gc=Lo, sc=Hang'>LVT</th><th width='5%' class='lbclass' title='U+1F1E6 REGIONAL INDICATOR SYMBOL LETTER A, gc=So, sc=Zyyy'>RI</th><th width='5%' class='lbclass' title='U+261D WHITE UP POINTING INDEX, gc=So, sc=Zyyy'>E_Base</th><th width='5%' class='lbclass' title='U+1F3FB EMOJI MODIFIER FITZPATRICK TYPE-1-2, gc=Sk, sc=Zyyy'>E_Modifier</th><th width='5%' class='lbclass' title='U+200D ZERO WIDTH JOINER, gc=Cf, sc=Zinh'>ZWJ</th><th width='5%' class='lbclass' title='U+2640 FEMALE SIGN, gc=So, sc=Zyyy'>Glue_After_Zwj</th><th width='5%' class='lbclass' title='U+1F466 BOY, gc=So, sc=Zyyy'>EBG</th></tr>
<tr><th class='lbclass' title='U+0020 SPACE'>Other</th><th title='999.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='9.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='9.1' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='9.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th></tr>
<tr><th class='lbclass' title='U+000D <CARRIAGE RETURN (CR)>'>CR</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='3.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th></tr>
<tr><th class='lbclass' title='U+000A <LINE FEED (LF)>'>LF</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th></tr>
<tr><th class='lbclass' title='U+0001 <START OF HEADING>'>Control</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th></tr>
<tr><th class='lbclass' title='U+0300 COMBINING GRAVE ACCENT'>Extend</th><th title='999.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='9.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='9.1' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='9.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th></tr>
<tr><th class='lbclass' title='U+0600 ARABIC NUMBER SIGN'>Prepend</th><th title='9.2' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='5.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='9.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='9.2' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='9.1' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='9.2' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='9.2' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='9.2' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='9.2' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='9.2' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='9.2' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='9.2' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='9.2' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='9.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='9.2' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='9.2' bgcolor='#CCCCFF' class='pairItem'>×</th></tr>
<tr><th class='lbclass' title='U+0903 DEVANAGARI SIGN VISARGA'>SpacingMark</th><th title='999.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='9.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='9.1' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='9.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th></tr>
<tr><th class='lbclass' title='U+1100 HANGUL CHOSEONG KIYEOK'>L</th><th title='999.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='9.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='9.1' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='6.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='6.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='6.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='6.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='9.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th></tr>
<tr><th class='lbclass' title='U+1160 HANGUL JUNGSEONG FILLER'>V</th><th title='999.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='9.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='9.1' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='7.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='7.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='9.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th></tr>
<tr><th class='lbclass' title='U+11A8 HANGUL JONGSEONG KIYEOK'>T</th><th title='999.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='9.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='9.1' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='8.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='9.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th></tr>
<tr><th class='lbclass' title='U+AC00 HANGUL SYLLABLE GA'>LV</th><th title='999.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='9.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='9.1' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='7.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='7.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='9.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th></tr>
<tr><th class='lbclass' title='U+AC01 HANGUL SYLLABLE GAG'>LVT</th><th title='999.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='9.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='9.1' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='8.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='9.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th></tr>
<tr><th class='lbclass' title='U+1F1E6 REGIONAL INDICATOR SYMBOL LETTER A'>RI</th><th title='999.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='9.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='9.1' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='12.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='9.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th></tr>
<tr><th class='lbclass' title='U+261D WHITE UP POINTING INDEX'>E_Base</th><th title='999.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='9.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='9.1' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='10.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='9.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th></tr>
<tr><th class='lbclass' title='U+1F3FB EMOJI MODIFIER FITZPATRICK TYPE-1-2'>E_Modifier</th><th title='999.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='9.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='9.1' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='9.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th></tr>
<tr><th class='lbclass' title='U+200D ZERO WIDTH JOINER'>ZWJ</th><th title='999.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='9.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='9.1' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='9.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='11.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='11.0' bgcolor='#CCCCFF' class='pairItem'>×</th></tr>
<tr><th class='lbclass' title='U+2640 FEMALE SIGN'>Glue_After_Zwj</th><th title='999.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='9.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='9.1' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='9.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th></tr>
<tr><th class='lbclass' title='U+1F466 BOY'>EBG</th><th title='999.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='9.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='9.1' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='10.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='9.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th></tr>
<tr><td bgcolor='#0000FF' colSpan='19' style='font-size: 1px'> </td></tr>
<tr><th class='lbclass' title='U+0378 <reserved-0378>'>Other</th><th title='999.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='5.0' class='pairItem'>÷</th><th title='9.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='9.1' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='9.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th></tr>
<tr><th class='lbclass' title='U+D800 <surrogate-D800>'>Control</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th><th title='4.0' class='pairItem'>÷</th></tr>
</table>
<h3><a href='#rules' name='rules'>Rules</a></h3>
<p>This section shows the rules. They are mechanically modified for programmatic generation of the tables and test code, and thus do not match the UAX rules precisely. In particular:</p><ol><li>The rules are cast into a form that is more like regular expressions.</li><li>The rules “sot ÷”, “÷ eot”, and “÷ Any” are added mechanically, and have artificial numbers.</li><li>The rules are given decimal numbers using tenths, and are written without prefix. For example, rule GB9a is given the number 9.1.</li><li>Any “treat as” or “ignore” rules are handled as discussed in UAX #29, and thus reflected in a transformation of the rules usually not visible here. In addition, final rules like “Any ÷ Any” may be recast as the equivalent expression “÷ Any”.</li><li>In some cases, the numbering and form of a rule is changed due to “treat as” rules.</li></ol><p>For the original rules, see UAX #29.</p>
<table>
<tr><th style='text-align:right'><a href='#r0.2' name='r0.2'>0.2</a></th><td style='text-align:right'>sot </td><td>÷</td><td></td></tr>
<tr><th style='text-align:right'><a href='#r0.3' name='r0.3'>0.3</a></th><td style='text-align:right'></td><td>÷</td><td> eot</td></tr>
<tr><th style='text-align:right'><a href='#r3.0' name='r3.0'>3.0</a></th><td style='text-align:right'>CR </td><td>×</td><td> LF</td></tr>
<tr><th style='text-align:right'><a href='#r4.0' name='r4.0'>4.0</a></th><td style='text-align:right'>( Control | CR | LF ) </td><td>÷</td><td></td></tr>
<tr><th style='text-align:right'><a href='#r5.0' name='r5.0'>5.0</a></th><td style='text-align:right'></td><td>÷</td><td> ( Control | CR | LF )</td></tr>
<tr><th style='text-align:right'><a href='#r6.0' name='r6.0'>6.0</a></th><td style='text-align:right'>L </td><td>×</td><td> ( L | V | LV | LVT )</td></tr>
<tr><th style='text-align:right'><a href='#r7.0' name='r7.0'>7.0</a></th><td style='text-align:right'>( LV | V ) </td><td>×</td><td> ( V | T )</td></tr>
<tr><th style='text-align:right'><a href='#r8.0' name='r8.0'>8.0</a></th><td style='text-align:right'>( LVT | T) </td><td>×</td><td> T</td></tr>
<tr><th style='text-align:right'><a href='#r9.0' name='r9.0'>9.0</a></th><td style='text-align:right'></td><td>×</td><td> (Extend | ZWJ)</td></tr>
<tr><th style='text-align:right'><a href='#r9.1' name='r9.1'>9.1</a></th><td style='text-align:right'></td><td>×</td><td> SpacingMark</td></tr>
<tr><th style='text-align:right'><a href='#r9.2' name='r9.2'>9.2</a></th><td style='text-align:right'>Prepend </td><td>×</td><td></td></tr>
<tr><th style='text-align:right'><a href='#r10.0' name='r10.0'>10.0</a></th><td style='text-align:right'>(E_Base | EBG) Extend* </td><td>×</td><td> E_Modifier</td></tr>
<tr><th style='text-align:right'><a href='#r11.0' name='r11.0'>11.0</a></th><td style='text-align:right'>ZWJ </td><td>×</td><td> (Glue_After_Zwj | EBG)</td></tr>
<tr><th style='text-align:right'><a href='#r12.0' name='r12.0'>12.0</a></th><td style='text-align:right'>^ (RI RI)* RI </td><td>×</td><td> RI</td></tr>
<tr><th style='text-align:right'><a href='#r13.0' name='r13.0'>13.0</a></th><td style='text-align:right'>[^RI] (RI RI)* RI </td><td>×</td><td> RI</td></tr>
<tr><th style='text-align:right'><a href='#r999.0' name='r999.0'>999.0</a></th><td style='text-align:right'></td><td>÷</td><td> Any</td></tr>
</table>
<h3><a href='#samples' name='samples'>Sample Strings</a></h3>
<p>The following samples illustrate the application of the rules. The blue lines indicate possible break points. If your browser supports titles (tooltips), then positioning the mouse over each character will show its name, while positioning between characters shows the number of the rule responsible for the break-status.</p>
<table>
<tr><th style='text-align:right'><a href='#s1' name='s1'>1</a></th><td><font size='5'>
<span title='0.2'><span style='border-right: 1px solid blue'> </span> </span><span title='U+000D <CARRIAGE RETURN (CR)> (CR)'>□</span><span title='3.0'><span> </span> </span><span title='U+000A <LINE FEED (LF)> (LF)'>□</span><span title='4.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0061 LATIN SMALL LETTER A (Other)'>a</span><span title='5.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+000A <LINE FEED (LF)> (LF)'>□</span><span title='4.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0308 COMBINING DIAERESIS (Extend)'>◌̈</span><span title='0.3'><span style='border-right: 1px solid blue'> </span> </span>
</font></td></tr>
<tr><th style='text-align:right'><a href='#s2' name='s2'>2</a></th><td><font size='5'>
<span title='0.2'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0061 LATIN SMALL LETTER A (Other)'>a</span><span title='9.0'><span> </span> </span><span title='U+0308 COMBINING DIAERESIS (Extend)'>◌̈</span><span title='0.3'><span style='border-right: 1px solid blue'> </span> </span>
</font></td></tr>
<tr><th style='text-align:right'><a href='#s3' name='s3'>3</a></th><td><font size='5'>
<span title='0.2'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0020 SPACE (Other)'> </span><span title='9.0'><span> </span> </span><span title='U+200D ZERO WIDTH JOINER (ZWJ)'>□</span><span title='999.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0646 ARABIC LETTER NOON (Other)'>ن</span><span title='0.3'><span style='border-right: 1px solid blue'> </span> </span>
</font></td></tr>
<tr><th style='text-align:right'><a href='#s4' name='s4'>4</a></th><td><font size='5'>
<span title='0.2'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0646 ARABIC LETTER NOON (Other)'>ن</span><span title='9.0'><span> </span> </span><span title='U+200D ZERO WIDTH JOINER (ZWJ)'>□</span><span title='999.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0020 SPACE (Other)'> </span><span title='0.3'><span style='border-right: 1px solid blue'> </span> </span>
</font></td></tr>
<tr><th style='text-align:right'><a href='#s5' name='s5'>5</a></th><td><font size='5'>
<span title='0.2'><span style='border-right: 1px solid blue'> </span> </span><span title='U+1100 HANGUL CHOSEONG KIYEOK (L)'>ᄀ</span><span title='6.0'><span> </span> </span><span title='U+1100 HANGUL CHOSEONG KIYEOK (L)'>ᄀ</span><span title='0.3'><span style='border-right: 1px solid blue'> </span> </span>
</font></td></tr>
<tr><th style='text-align:right'><a href='#s6' name='s6'>6</a></th><td><font size='5'>
<span title='0.2'><span style='border-right: 1px solid blue'> </span> </span><span title='U+AC00 HANGUL SYLLABLE GA (LV)'>가</span><span title='7.0'><span> </span> </span><span title='U+11A8 HANGUL JONGSEONG KIYEOK (T)'>ᆨ</span><span title='999.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+1100 HANGUL CHOSEONG KIYEOK (L)'>ᄀ</span><span title='0.3'><span style='border-right: 1px solid blue'> </span> </span>
</font></td></tr>
<tr><th style='text-align:right'><a href='#s7' name='s7'>7</a></th><td><font size='5'>
<span title='0.2'><span style='border-right: 1px solid blue'> </span> </span><span title='U+AC01 HANGUL SYLLABLE GAG (LVT)'>각</span><span title='8.0'><span> </span> </span><span title='U+11A8 HANGUL JONGSEONG KIYEOK (T)'>ᆨ</span><span title='999.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+1100 HANGUL CHOSEONG KIYEOK (L)'>ᄀ</span><span title='0.3'><span style='border-right: 1px solid blue'> </span> </span>
</font></td></tr>
<tr><th style='text-align:right'><a href='#s8' name='s8'>8</a></th><td><font size='5'>
<span title='0.2'><span style='border-right: 1px solid blue'> </span> </span><span title='U+1F1E6 REGIONAL INDICATOR SYMBOL LETTER A (RI)'>🇦</span><span title='12.0'><span> </span> </span><span title='U+1F1E7 REGIONAL INDICATOR SYMBOL LETTER B (RI)'>🇧</span><span title='999.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+1F1E8 REGIONAL INDICATOR SYMBOL LETTER C (RI)'>🇨</span><span title='999.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0062 LATIN SMALL LETTER B (Other)'>b</span><span title='0.3'><span style='border-right: 1px solid blue'> </span> </span>
</font></td></tr>
<tr><th style='text-align:right'><a href='#s9' name='s9'>9</a></th><td><font size='5'>
<span title='0.2'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0061 LATIN SMALL LETTER A (Other)'>a</span><span title='999.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+1F1E6 REGIONAL INDICATOR SYMBOL LETTER A (RI)'>🇦</span><span title='13.0'><span> </span> </span><span title='U+1F1E7 REGIONAL INDICATOR SYMBOL LETTER B (RI)'>🇧</span><span title='999.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+1F1E8 REGIONAL INDICATOR SYMBOL LETTER C (RI)'>🇨</span><span title='999.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0062 LATIN SMALL LETTER B (Other)'>b</span><span title='0.3'><span style='border-right: 1px solid blue'> </span> </span>
</font></td></tr>
<tr><th style='text-align:right'><a href='#s10' name='s10'>10</a></th><td><font size='5'>
<span title='0.2'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0061 LATIN SMALL LETTER A (Other)'>a</span><span title='999.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+1F1E6 REGIONAL INDICATOR SYMBOL LETTER A (RI)'>🇦</span><span title='13.0'><span> </span> </span><span title='U+1F1E7 REGIONAL INDICATOR SYMBOL LETTER B (RI)'>🇧</span><span title='9.0'><span> </span> </span><span title='U+200D ZERO WIDTH JOINER (ZWJ)'>□</span><span title='999.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+1F1E8 REGIONAL INDICATOR SYMBOL LETTER C (RI)'>🇨</span><span title='999.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0062 LATIN SMALL LETTER B (Other)'>b</span><span title='0.3'><span style='border-right: 1px solid blue'> </span> </span>
</font></td></tr>
<tr><th style='text-align:right'><a href='#s11' name='s11'>11</a></th><td><font size='5'>
<span title='0.2'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0061 LATIN SMALL LETTER A (Other)'>a</span><span title='999.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+1F1E6 REGIONAL INDICATOR SYMBOL LETTER A (RI)'>🇦</span><span title='9.0'><span> </span> </span><span title='U+200D ZERO WIDTH JOINER (ZWJ)'>□</span><span title='999.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+1F1E7 REGIONAL INDICATOR SYMBOL LETTER B (RI)'>🇧</span><span title='13.0'><span> </span> </span><span title='U+1F1E8 REGIONAL INDICATOR SYMBOL LETTER C (RI)'>🇨</span><span title='999.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0062 LATIN SMALL LETTER B (Other)'>b</span><span title='0.3'><span style='border-right: 1px solid blue'> </span> </span>
</font></td></tr>
<tr><th style='text-align:right'><a href='#s12' name='s12'>12</a></th><td><font size='5'>
<span title='0.2'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0061 LATIN SMALL LETTER A (Other)'>a</span><span title='999.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+1F1E6 REGIONAL INDICATOR SYMBOL LETTER A (RI)'>🇦</span><span title='13.0'><span> </span> </span><span title='U+1F1E7 REGIONAL INDICATOR SYMBOL LETTER B (RI)'>🇧</span><span title='999.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+1F1E8 REGIONAL INDICATOR SYMBOL LETTER C (RI)'>🇨</span><span title='13.0'><span> </span> </span><span title='U+1F1E9 REGIONAL INDICATOR SYMBOL LETTER D (RI)'>🇩</span><span title='999.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0062 LATIN SMALL LETTER B (Other)'>b</span><span title='0.3'><span style='border-right: 1px solid blue'> </span> </span>
</font></td></tr>
<tr><th style='text-align:right'><a href='#s13' name='s13'>13</a></th><td><font size='5'>
<span title='0.2'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0061 LATIN SMALL LETTER A (Other)'>a</span><span title='9.0'><span> </span> </span><span title='U+200D ZERO WIDTH JOINER (ZWJ)'>□</span><span title='0.3'><span style='border-right: 1px solid blue'> </span> </span>
</font></td></tr>
<tr><th style='text-align:right'><a href='#s14' name='s14'>14</a></th><td><font size='5'>
<span title='0.2'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0061 LATIN SMALL LETTER A (Other)'>a</span><span title='9.0'><span> </span> </span><span title='U+0308 COMBINING DIAERESIS (Extend)'>◌̈</span><span title='999.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0062 LATIN SMALL LETTER B (Other)'>b</span><span title='0.3'><span style='border-right: 1px solid blue'> </span> </span>
</font></td></tr>
<tr><th style='text-align:right'><a href='#s15' name='s15'>15</a></th><td><font size='5'>
<span title='0.2'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0061 LATIN SMALL LETTER A (Other)'>a</span><span title='9.1'><span> </span> </span><span title='U+0903 DEVANAGARI SIGN VISARGA (SpacingMark)'>ः</span><span title='999.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0062 LATIN SMALL LETTER B (Other)'>b</span><span title='0.3'><span style='border-right: 1px solid blue'> </span> </span>
</font></td></tr>
<tr><th style='text-align:right'><a href='#s16' name='s16'>16</a></th><td><font size='5'>
<span title='0.2'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0061 LATIN SMALL LETTER A (Other)'>a</span><span title='999.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0600 ARABIC NUMBER SIGN (Prepend)'>□</span><span title='9.2'><span> </span> </span><span title='U+0062 LATIN SMALL LETTER B (Other)'>b</span><span title='0.3'><span style='border-right: 1px solid blue'> </span> </span>
</font></td></tr>
<tr><th style='text-align:right'><a href='#s17' name='s17'>17</a></th><td><font size='5'>
<span title='0.2'><span style='border-right: 1px solid blue'> </span> </span><span title='U+261D WHITE UP POINTING INDEX (E_Base)'>☝</span><span title='10.0'><span> </span> </span><span title='U+1F3FB EMOJI MODIFIER FITZPATRICK TYPE-1-2 (E_Modifier)'>🏻</span><span title='999.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+261D WHITE UP POINTING INDEX (E_Base)'>☝</span><span title='0.3'><span style='border-right: 1px solid blue'> </span> </span>
</font></td></tr>
<tr><th style='text-align:right'><a href='#s18' name='s18'>18</a></th><td><font size='5'>
<span title='0.2'><span style='border-right: 1px solid blue'> </span> </span><span title='U+1F466 BOY (EBG)'>👦</span><span title='10.0'><span> </span> </span><span title='U+1F3FB EMOJI MODIFIER FITZPATRICK TYPE-1-2 (E_Modifier)'>🏻</span><span title='0.3'><span style='border-right: 1px solid blue'> </span> </span>
</font></td></tr>
<tr><th style='text-align:right'><a href='#s19' name='s19'>19</a></th><td><font size='5'>
<span title='0.2'><span style='border-right: 1px solid blue'> </span> </span><span title='U+200D ZERO WIDTH JOINER (ZWJ)'>□</span><span title='11.0'><span> </span> </span><span title='U+1F466 BOY (EBG)'>👦</span><span title='10.0'><span> </span> </span><span title='U+1F3FB EMOJI MODIFIER FITZPATRICK TYPE-1-2 (E_Modifier)'>🏻</span><span title='0.3'><span style='border-right: 1px solid blue'> </span> </span>
</font></td></tr>
<tr><th style='text-align:right'><a href='#s20' name='s20'>20</a></th><td><font size='5'>
<span title='0.2'><span style='border-right: 1px solid blue'> </span> </span><span title='U+200D ZERO WIDTH JOINER (ZWJ)'>□</span><span title='11.0'><span> </span> </span><span title='U+2640 FEMALE SIGN (Glue_After_Zwj)'>♀</span><span title='0.3'><span style='border-right: 1px solid blue'> </span> </span>
</font></td></tr>
<tr><th style='text-align:right'><a href='#s21' name='s21'>21</a></th><td><font size='5'>
<span title='0.2'><span style='border-right: 1px solid blue'> </span> </span><span title='U+200D ZERO WIDTH JOINER (ZWJ)'>□</span><span title='11.0'><span> </span> </span><span title='U+1F466 BOY (EBG)'>👦</span><span title='0.3'><span style='border-right: 1px solid blue'> </span> </span>
</font></td></tr>
<tr><th style='text-align:right'><a href='#s22' name='s22'>22</a></th><td><font size='5'>
<span title='0.2'><span style='border-right: 1px solid blue'> </span> </span><span title='U+1F466 BOY (EBG)'>👦</span><span title='999.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+1F466 BOY (EBG)'>👦</span><span title='0.3'><span style='border-right: 1px solid blue'> </span> </span>
</font></td></tr>
</table>
<hr width='50%'>
<div align='center'>
<center>
<table cellspacing='0' cellpadding='0' border='0'>
<tr>
<td><a href='http://www.unicode.org/unicode/copyright.html'>
<img src='http://www.unicode.org/img/hb_notice.gif' border='0' alt='Access to Copyright and terms of use' width='216' height='50'></a></td>
</tr>
</table>
<script language='Javascript' type='text/javascript' src='http://www.unicode.org/webscripts/lastModified.js'>
</script>
</center>
</div>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
|