This file is indexed.

/usr/share/doc/HOWTO/ja-html/Unicode-HOWTO-2.html is in doc-linux-ja-html 2006.05.25-1.1.

This file is owned by root:root, with mode 0o644.

The actual contents of the file can be viewed below.

  1
  2
  3
  4
  5
  6
  7
  8
  9
 10
 11
 12
 13
 14
 15
 16
 17
 18
 19
 20
 21
 22
 23
 24
 25
 26
 27
 28
 29
 30
 31
 32
 33
 34
 35
 36
 37
 38
 39
 40
 41
 42
 43
 44
 45
 46
 47
 48
 49
 50
 51
 52
 53
 54
 55
 56
 57
 58
 59
 60
 61
 62
 63
 64
 65
 66
 67
 68
 69
 70
 71
 72
 73
 74
 75
 76
 77
 78
 79
 80
 81
 82
 83
 84
 85
 86
 87
 88
 89
 90
 91
 92
 93
 94
 95
 96
 97
 98
 99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2 Final//EN">
<HTML>
<HEAD>
 <META NAME="GENERATOR" CONTENT="SGML-Tools 1.0.9">
 <TITLE>The Unicode HOWTO: ¥¤¥ó¥È¥í¥À¥¯¥·¥ç¥ó</TITLE>
 <LINK HREF="Unicode-HOWTO-3.html" REL=next>
 <LINK HREF="Unicode-HOWTO-1.html" REL=previous>
 <LINK HREF="Unicode-HOWTO.html#toc2" REL=contents>
</HEAD>
<BODY>
<A HREF="Unicode-HOWTO-3.html">¼¡¤Î¥Ú¡¼¥¸</A>
<A HREF="Unicode-HOWTO-1.html">Á°¤Î¥Ú¡¼¥¸</A>
<A HREF="Unicode-HOWTO.html#toc2">Ìܼ¡¤Ø</A>
<HR>
<H2><A NAME="s2">2. ¥¤¥ó¥È¥í¥À¥¯¥·¥ç¥ó</A></H2>

<P>
<P>
<H2><A NAME="ss2.1">2.1 ¤Ê¤¼ Unicode ¤ò»È¤¦¤Î¤Ç¤¹¤«¡©</A>
</H2>

<P>
<P>°Û¤Ê¤Ã¤¿¹ñ¤Î¿Í¡¹¤Ï¡¢¤½¤ì¤¾¤ì¤ÎÊì¹ñ¸ì¤Îñ¸ì¤òɽ¸½¤¹¤ë¤Î¤Ë°Û¤Ê¤Ã¤¿Ê¸»ú¤ò»È
ÍѤ·¤Æ¤¤¤Þ¤¹¡£¸½ºß¤Ç¤Ï email ¥·¥¹¥Æ¥à¤ä web ¥Ö¥é¥¦¥¶¤Ê¤É¡¢¤Û¤È¤ó¤É¤Î¥¢¥×
¥ê¥±¡¼¥·¥ç¥ó¤Ï 8 ¥Ó¥Ã¥È¥¯¥ê¡¼¥ó¤Ç¤¹¡£¤Ä¤Þ¤ê ISO-8859-1 ¤Î¤è¤¦¤Ê 8 ¥Ó¥Ã¥È
ʸ»ú¥»¥Ã¥È¤Çɽ¸½¤µ¤ì¤ë¥Æ¥­¥¹¥È¤Î¼è¤ê°·¤¤¤äɽ¼¨¤òÀµ¤·¤¯¹Ô¤¨¤ë¤È¤¤¤¦¤³¤È¤Ç
¤¹¡£
<P>À¤³¦¤Ë¤Ï 256 ¤è¤ê¤âÍÚ¤«¤Ë¿¤¯¤Îʸ»ú¤¬¤¢¤ê¤Þ¤¹¡£Î㤨¤Ð¥­¥ê¥ëʸ»ú¡¢¥Ø¥Ö¥é
¥¤¸ì¡¢¥¢¥é¥Ó¥¢¸ì¡¢Ãæ¹ñ¸ì¡¢ÆüËܸ졢´Ú¹ñ¸ì¡¢¥¿¥¤¸ì¤Ê¤É¤Ç¤¹¡£¿·¤·¤¤Ê¸»ú¤â»þ¡¹
ºî¤é¤ì¤Æ¤¤¤Þ¤¹¡£ÍøÍѼԤËÌäÂê¤Ë¤Ê¤Ã¤Æ¤¯¤ë¤³¤È¤Ë¤Ï¼¡¤Î¤è¤¦¤Ê¤â¤Î¤¬¤¢¤ê¤Þ¤¹¡£
<P>
<UL>
<LI>°ì¤Ä¤Îʸ½ñ¤ÎÃæ¤Ë°Û¤Ê¤ëʸ»ú¥»¥Ã¥È¤Îʸ»ú¤òº®ºß¤µ¤»¤ë¤³¤È¤¬¤Ç¤­¤Ê¤¤¾ì¹ç¡£Îã
¤ò¤¢¤²¤ë¤È TeX, xdvi, PostScript ¤ò»È¤Ã¤Æ¤¤¤ë¾ì¹ç¤Ë¤Ï¡¢¥É¥¤¥Ä¸ì¤ä¥Õ¥é¥ó
¥¹¸ì¤Îʸ½ñ¤Ç¥í¥·¥¢¸ì¤Ç¤Î°úÍѤò¤¹¤ë¤³¤È¤¬¤Ç¤­¤Þ¤¹¤¬¡¢¤¿¤À¤Î¥Æ¥­¥¹¥È¥Õ¥¡¥¤
¥ë¤Ç¤Ï̵Íý¤Ç¤¹¡£
</LI>
<LI>¤½¤ì¤¾¤ì¤Îʸ½ñ¤¬¸ÇÍ­¤Îʸ»ú¥»¥Ã¥È¤ò»ý¤Á¡¢Ê¸»ú¥»¥Ã¥È¤Îǧ¼±¤¬¼«Æ°¤Ç¤Ê¤±¤ì¤Ð¡¢
¥æ¡¼¥¶¡¼¤¬²ðºß¤·¤Æ¤³¤ì¤ò¼êÆ°¤Ç¹Ô¤ï¤Ê¤±¤ì¤Ð¤Ê¤ê¤Þ¤»¤ó¡£Î㤨¤Ð XTeamLinux
distribution ¤Î¥Û¡¼¥à¥Ú¡¼¥¸ 
<A HREF="http://www.xteamlinux.com.cn/">http://www.xteamlinux.com.cn/</A> ¤ò¸«¤ë¤¿¤á¤Ë¤Ï¡¢Netscape ¤Ë¤½¤Î 
web ¥Ú¡¼¥¸¤Ï GB2312 ¥³¡¼¥É¤Ç¤¢¤ë¤È»Ø¼¨¤¹¤ëɬÍפ¬¤¢¤ê¤Þ¤¹¡£

</LI>
<LI>¥æ¡¼¥í¤Î¤è¤¦¤Ê¿·¤·¤¤¥·¥ó¥Ü¥ë¤âÀ¸¤ß½Ð¤µ¤ì¤Æ¤¤¤Þ¤¹¡£ISO ¤Ï¿·¤·¤¤É¸½à 
ISO-8859-15 ¤òȯɽ(issue)¤·¤Þ¤·¤¿¡£¤³¤ì¤Ï¤Û¤È¤ó¤É ISO-8859-1 ¤ÈƱ¤¸¤Ç¤¹
¤¬¡¢¤Û¤È¤ó¤É»È¤ï¤ì¤Ê¤¤Ê¸»ú(¸Å¤¤Ä̲ߤΥޡ¼¥¯)¤ò¼è¤ê½ü¤¤¤Æ¡¢¥æ¡¼¥í¤Î¥Þ¡¼¥¯
¤ÈÃÖ¤­´¹¤¨¤Þ¤·¤¿¡£¥æ¡¼¥¶¡¼¤¬¤³¤Îɸ½à¤ò»ÈÍѤ¹¤ë¤³¤È¤Ë¤·¤¿¾ì¹ç¡¢¥Ç¥£¥¹¥¯Æâ
¤Ë°ã¤Ã¤¿Ê¸»ú¥»¥Ã¥È¤Îʸ½ñ¤ò»ý¤Ä¤³¤È¤Ë¤Ê¤ê¤Þ¤¹¡£¤Ä¤Þ¤ê¡¢Ê¸»ú¥»¥Ã¥È¤Î¤³¤È¤ò
¾ï¤Ë¹Íθ¤¹¤ëÆü¡¹¤Î¤Ï¤¸¤Þ¤ê¤È¤¤¤¦¤³¤È¤Ç¤¹¡£¤Ç¤¹¤¬¥³¥ó¥Ô¥å¡¼¥¿¤Ïʪ»ö¤ò¥·¥ó
¥×¥ë¤Ë¤¹¤ë¤¿¤á¤Î¤â¤Î¤Ç¤¢¤ê¡¢¤è¤êÊ£»¨¤Ë¤¹¤ë¤â¤Î¤Ç¤Ï¤¢¤ê¤Þ¤»¤ó¡£</LI>
</UL>
<P>¤³¤ÎÌäÂê¤ò²ò·è¤¹¤ë¤Ë¤Ï¡¢¥ï¡¼¥ë¥É¥ï¥¤¥É¤Ë»ÈÍѤǤ­¤ëʸ»ú¥»¥Ã¥È¤ò»È¤¦¤³¤È¤Ç
¤¹¡£¤½¤Îʸ»ú¥»¥Ã¥È¤È¤Ï Unicode 
<A HREF="http://www.unicode.org/">http://www.unicode.org/</A> ¤Î¤³¤È¤Ç¤¹¡£Unicode ¤Ë´Ø¤¹¤ë¾ÜºÙ¤Ï 
`<CODE>man 7 unicode</CODE>' ¤ò¼Â¹Ô¤·¤Æ¤¯¤À¤µ¤¤¡£(manpage ¤Ï man-pages-1.20 ¥Ñ¥Ã
¥±¡¼¥¸¤Ë´Þ¤Þ¤ì¤Æ¤¤¤Þ¤¹)
<P>
<H2><A NAME="ss2.2">2.2 Unicode ¤Î¥¨¥ó¥³¡¼¥Ç¥£¥ó¥°</A>
</H2>

<P>
<P>Unicode ¥¨¥ó¥³¡¼¥Ç¥£¥ó¥°¤ò»È¤¦¤È¡¢Ê¸»ú¥»¥Ã¥È¤ò°·¤¦¥æ¡¼¥¶¡¼¥×¥í¥°¥é¥à¤ÎÌä
Âê¤Ï¡¢¡Ö¤É¤¦¤ä¤Ã¤Æ 1 ¥ª¥¯¥Æ¥Ã¥È(8 ¥Ó¥Ã¥È)¤Ç Unicode ʸ»ú¤òÁ÷¤ë¤Î¤«¡×¤È¤¤
¤¦µ»½ÑŪ¤ÊÌäÂê¤À¤±¤Ë¤Ê¤ê¤Þ¤¹¡£8 ¥Ó¥Ã¥È¤È¤¤¤¦Ã±°Ì¤Ï¡¢Â¿¤¯¤Î¥³¥ó¥Ô¥å¡¼¥¿¤Ç¡¢
¥¢¥É¥ì¥¹¤òɽ¸½¤¹¤ëºÇ¾®Ã±°Ì¤Ç¤¹¡£¤Þ¤¿¤³¤Î 8 ¥Ó¥Ã¥È¤È¤¤¤¦Ã±°Ì¤Ï¡¢TCP/IP ¥Í¥Ã
¥È¥ï¡¼¥¯¤Ç¤Î¥³¥Í¥¯¥·¥ç¥ó¤Ë¤â»ÈÍѤµ¤ì¤Æ¤¤¤Þ¤¹¡£1 ʸ»ú¤òɽ¸½¤¹¤ë¤Î¤Ë 1 ¥Ð
¥¤¥È¤ò»ÈÍѤ¹¤ë¤È¤¤¤¦¤Î¤ÏÎò»ËŪ¤Ê¶öÁ³¤Ç¤¢¤ê¡¢¤³¤ì¤Ï¥³¥ó¥Ô¥å¡¼¥¿¤Î³«È¯¤¬¥è¡¼
¥í¥Ã¥Ñ¤È¥¢¥á¥ê¥«¤Ç»Ï¤Þ¤Ã¤¿¤³¤È¤Ë¤è¤ê¤Þ¤¹¡£¤³¤ì¤é¤Î¹ñ¡¹¤Ç¤ÏŤ¤´Ö¡¢96 ¼ï
Îà¤Îʸ»ú¤Ç½¼Ê¬¤È¤µ¤ì¤Æ¤­¤Þ¤·¤¿¡£
<P>
<P>Unicode ʸ»ú¤ò¥Ð¥¤¥È¤Ç¥¨¥ó¥³¡¼¥É¤¹¤ëÊýË¡¤Ë¤Ï¡¢Ä̾ï 4 ¼ïÎढ¤ê¤Þ¤¹¡£
<P>
<DL>
<DT><B>UTF-8</B><DD><P>128 ʸ»ú¤¬ 1 ¥Ð¥¤¥È¤Ç¥¨¥ó¥³¡¼¥É¤µ¤ì¤Þ¤¹(ASCII ʸ»ú)¡£1920 ʸ»ú¤¬ 2 ¥Ð¥¤
¥È¤Ç¥¨¥ó¥³¡¼¥É¤µ¤ì¤Þ¤¹(¥í¡¼¥Þ»ú¡¢¥®¥ê¥·¥ãʸ»ú¡¢¥­¥ê¥ëʸ»ú¡¢¥³¥×¥È¸ì¡¢¥¢
¥ë¥á¥Ë¥¢¸ì¡¢¥Ø¥Ö¥é¥¤¸ì¡¢¥¢¥é¥Ó¥¢¸ì¤Îʸ»ú)¡£63488 ʸ»ú¤¬ 3¥Ð¥¤¥È¤Ç¥¨¥ó¥³¡¼
¥É¤µ¤ì¤Þ¤¹(Ãæ¹ñ¸ì¤äÆüËܸì¤Ê¤É)¡£»Ä¤ê¤Î 2147418112 ʸ»ú¤Ï 4 ¡Á 6 ¥Ð¥¤¥È¤ò
»È¤Ã¤Æ¥¨¥ó¥³¡¼¥É¤¹¤ë¤³¤È¤¬¤Ç¤­¤Þ¤¹(¤Þ¤À³ä¤êÅö¤Æ¤é¤ì¤Æ¤¤¤Þ¤»¤ó)¡£UTF-8 ¤Ë
´Ø¤¹¤ë¾ÜºÙ¤Ï `<CODE>man 7 utf-8</CODE>' ¤ò¼Â¹Ô¤·¤Æ¤¯¤À¤µ¤¤¡£(manpage ¤Ï 
ldpman-1.20 ¥Ñ¥Ã¥±¡¼¥¸¤Ë´Þ¤Þ¤ì¤Æ¤¤¤Þ¤¹)
<P>
<DT><B>UCS-2</B><DD><P>Á´¤Æ¤Îʸ»ú¤Ï 2 ¥Ð¥¤¥È¤Çɽ¸½¤µ¤ì¤Þ¤¹¡£¤³¤Î¥¨¥ó¥³¡¼¥Ç¥£¥ó¥°¤Ç¤Ï Unicode¤Î
»Ï¤á¤Î 65536 ʸ»ú¤À¤±¤òɽ¸½¤Ç¤­¤Þ¤¹¡£
<P>
<DT><B>UTF-16</B><DD><P>¤³¤ì¤Ï UCS-2 ¤Î³ÈÄ¥¤Ç 1112064 ¤Î Unicode ʸ»ú¤òɽ¸½¤¹¤ë¤³¤È¤¬¤Ç¤­¤Þ¤¹¡£
Unicode ¤Î»Ï¤á¤Î 65536 ʸ»ú¤Ï 2 ¥Ð¥¤¥È¤Ç¡¢»Ä¤ê¤Ï 4 ¥Ð¥¤¥È¤Çɽ¸½¤µ¤ì¤Þ¤¹¡£
<P>
<DT><B>UCS-4</B><DD><P>Á´¤Æ¤Îʸ»ú¤Ï 4 ¥Ð¥¤¥È¤Çɽ¸½¤µ¤ì¤Þ¤¹¡£
<P>
</DL>
<P>¥Æ¥­¥¹¥È¤ò¥¨¥ó¥³¡¼¥É¤¹¤ë¤Î¤ËɬÍפȤʤëÍÆÎÌ(¥è¡¼¥í¥Ã¥Ñ¤Î¸À¸ì¤Ç¤Ï 1 ʸ»ú¤¢
¤¿¤ê 8¥Ó¥Ã¥È¤Ç¡¢Ãæ¹ñ¸ì¡¢ÆüËܸ졢´Ú¹ñ¸ì¤Ç¤Ï¤è¤ê¿¤¯¤Î¥Ó¥Ã¥È¿ô)¤ò¡¢¸½ºß»È
ÍѤµ¤ì¤Æ¤¤¤ë¥¨¥ó¥³¡¼¥Ç¥£¥ó¥°¤ÈÈæ¤Ù¤¿¤â¤Î¤¬°Ê²¼¤Ë¤Ê¤ê¤Þ¤¹¡£¤³¤ì¤Ï¥Ç¥£¥¹¥¯
¤Ç»ÈÍѤ¹¤ëÍÆÎ̤䡢¥Í¥Ã¥È¥ï¡¼¥¯¤Ç¤Î¥À¥¦¥ó¥í¡¼¥É®Å٤˱ƶÁ¤·¤Þ¤¹¡Ê°µ½Ì¤ò¤·
¤Æ¤¤¤Ê¤¤¾ì¹ç¡Ë¡£
<P>
<DL>
<DT><B>UTF-8</B><DD><P>US ASCII ¤Ê¤éÊѲ½¤Ê¤·¡¢ISO-8859-1 ¤Ê¤é¿ô¥Ñ¡¼¥»¥ó¥ÈÁý¤¨¡¢Ãæ¹ñ¸ì¡¢ÆüËܸ졢
´Ú¹ñ¸ì¤Ç¤Ï 1.5 ÇÜ¡¢¥®¥ê¥·¥ãʸ»ú¤ä¥­¥ê¥ëʸ»ú¤Ç¤Ï 2 Çܤˤʤê¤Þ¤¹¡£
<P>
<DT><B>UCS-2 ¤ª¤è¤Ó UTF-16</B><DD><P>Ãæ¹ñ¸ì¡¢ÆüËܸ졢´Ú¹ñ¸ì¤Ç¤ÏÊѲ½¤Ê¤·¡¢ASCII¡¢ISO-8859-1¡¢¥®¥ê¥·¥ãʸ»ú¡¢
¥­¥ê¥ëʸ»ú¤Ç¤Ï 2 Çܤˤʤê¤Þ¤¹¡£
<P>
<DT><B>UCS-4</B><DD><P>Ãæ¹ñ¸ì¡¢ÆüËܸ졢´Ú¹ñ¸ì¤Ç¤Ï 2ÇÜ¡¢ASCII¡¢ISO-8859-1¡¢¥®¥ê¥·¥ãʸ»ú¡¢¥­¥ê¥ë
ʸ»ú¤Ç¤Ï 3 Çܤˤʤê¤Þ¤¹¡£
</DL>
<P>UCS-2, UTF-16, UCS-4 ¤Ç US ¤ä¥è¡¼¥í¥Ã¥Ñ¤Îʸ½ñ¤ò½ñ¤¯¾ì¹ç¤Ë¤ÏASCII ¤ä 
ISO-8859-1 ¤Ç½ñ¤¤¤¿¤È¤­¤è¤ê¤â¥µ¥¤¥º¤¬Â礭¤¯¤Ê¤ë¤³¤È¤¬¤¢¤ë¤¿¤á¡¢¤½¤ì¤é¤Î
¥¨¥ó¥³¡¼¥Ç¥£¥ó¥°¤¬¹­¤¯»È¤ï¤ì¤ë¤³¤È¤Ï¤Ê¤µ¤½¤¦¤Ç¤¹¡£ Microsoft ¤Î Win32
API ¤Ï UCS-2 ¥¨¥ó¥³¡¼¥Ç¥£¥ó¥°¤ò(¾¯¤Ê¤¯¤È¤â) 1995 ǯ¤«¤é¥µ¥Ý¡¼¥È¤·¤Æ¤¤¤Þ
¤¹¤¬¡¢UCS-2¤Ïʸ½ñ¤òµ­½Ò¤¹¤ë¤Î¤Ë¹­¤¯»È¤ï¤ì¤Æ¤Ï¤¤¤Þ¤»¤ó¡£ÆüËܤǤϥ·¥Õ¥È 
JIS ¤¬¤¤¤Þ¤À°ìÈÌŪ¤Ç¤¹¡£
<P>°ìÊý¡¢US ¤ä¥è¡¼¥í¥Ã¥Ñ¤ÎÍøÍѼԤˤϥڥʥë¥Æ¥£¤¬¤Ê¤¯¡¢¤Þ¤¿Â¿¤¯¤Î¥Æ¥­¥¹¥ÈÁà
ºî¤ò¹Ô¤¦¥×¥í¥°¥é¥à¤Ï UTF-8 ¥µ¥Ý¡¼¥È¤Î¤¿¤á¤ÎÊѹ¹¤¬É¬Íפʤ¤¤Î¤Ç¡¢UTF-8 ¤Ï
¹­¤¯»È¤ï¤ì¤ë²ÄǽÀ­¤¬¤¢¤ê¤Þ¤¹¡£
<P>¤³¤ì¤«¤é¡¢¥Æ¥­¥¹¥È¤Î¥¨¥ó¥³¡¼¥Ç¥£¥ó¥°¤È¤·¤Æ UTF-8 ¤ò»È¤¦¤è¤¦¤Ë Linux ¥·¥¹
¥Æ¥à¤òÊѹ¹¤¹¤ëÊýË¡¤Ë¤Ä¤¤¤ÆÀâÌÀ¤·¤Æ¤¤¤­¤Þ¤¹¡£
<P>
<H3>C/C++ ³«È¯¼Ô¤Ø¤ÎÊä­ÀâÌÀ</H3>

<P>Microsoft ¤¬ Win32 API ¤Ç¼è¤Ã¤Æ¤¤¤ë¥¢¥×¥í¡¼¥Á¤Ç¤Ï¡¢³«È¯¼Ô¤¬ Unicode ÈǤÎ
¥×¥í¥°¥é¥à¤òºîÀ®¤¹¤ë¤³¤È¤Ï´Êñ¤Ç¤¹¡£"#define UNICODE" ¤ò¥×¥í¥°¥é¥à¤Î
ÀèƬ¤ÇÀë¸À¤·¤Æ¡¢¥³¥ó¥Ñ¥¤¥ë¥¨¥é¡¼¤¬¤Ê¤¯¤Ê¤ë¤Þ¤Ç `<CODE>char</CODE>' ¤ò 
`<CODE>TCHAR</CODE>' ¤ØÊѹ¹¤·¤Þ¤¹¡£¤³¤ÎÊýË¡¤ÎÌäÂê¤Ï¡¢ºÇ½ªÅª¤Ë 2 ¤Ä¤Î¥Ð¡¼¥¸¥ç
¥ó¤Î¥×¥í¥°¥é¥à¤¬¤Ç¤­¤Æ¤·¤Þ¤¦¤³¤È¤Ç¤¹¡£1 ¤Ä¤Ï UCS-2 ¤Î¥Æ¥­¥¹¥È¤ò°·¤¨¤Þ¤¹
¤¬¡¢8 ¥Ó¥Ã¥È¤Î¥¨¥ó¥³¡¼¥Ç¥£¥ó¥°¤ÏÂÌÌܤǤ¹¡£¤â¤¦ 1 ¤Ä¤ÏµìÍè¤Î 8 ¥Ó¥Ã¥È¥¨¥ó
¥³¡¼¥Ç¥£¥ó¥°¤·¤«°·¤¨¤Þ¤»¤ó¡£
<P>¤µ¤é¤Ë UCS-2 ¤È UCS-4 ¤Ë¤Ï¥¨¥ó¥Ç¥£¥¢¥ó¤ÎÌäÂ꤬¤¢¤ê¤Þ¤¹¡£The Internet
Assigned Numbers Authority (IANA) character set registry 
<A HREF="http://www.isi.edu/in-notes/iana/assignments/character-sets">http://www.isi.edu/in-notes/iana/assignments/character-sets</A> ¤Ï 
ISO-10646-UCS-2 ¤Ë¤Ä¤¤¤Æ¤³¤Î¤è¤¦¤Ë½Ò¤Ù¤Æ¤¤¤Þ¤¹¡§
<BLOCKQUOTE>
¡Ö¤³¤ì¤Ë¤Ï¥Í¥Ã¥È
¥ï¡¼¥¯¥Ð¥¤¥È¥ª¡¼¥À¡¼¤ò»ØÄꤹ¤ëɬÍפ¬¤¢¤ë: ɸ½à¤ÏÄê¤á¤é¤ì¤Æ¤¤¤Ê¤¤¡×
</BLOCKQUOTE>
 ¥Í¥Ã¥È¥ï¡¼¥¯¥Ð¥¤¥È¥ª¡¼¥À¡¼¤Ï¥Ó¥Ã¥°¥¨¥ó¥Ç¥£¥¢¥ó¤Ç¤¹¡£¤Þ¤¿ RFC
2152 ¤Ë¡¢¤è¤êÌÀ³Î¤Ëµ­½Ò¤µ¤ì¤Æ¤¤¤Þ¤¹¡§¡ÖISO/IEC 10646-1:1993(E) ¤Ë¤ÏUCS-2 
¤Îʸ»ú¤¬¥ª¥¯¥Æ¥Ã¥È¤Çɽ¸½¤µ¤ì¤ë»þ¤Ë¤Ï¡¢ºÇ¤âÂ礭¤¤¥ª¥¯¥Æ¥Ã¥È¤¬»Ï¤á¤ËÍè¤ë¤È
¼¨¤µ¤ì¤Æ¤¤¤Þ¤¹¡×¤È¤³¤í¤¬ Microsoft¤Ï¼«¼Ò¤Î C/C++ ³«È¯¥Ä¡¼¥ë¤Ç¤Ï¥Þ¥·¥ó°Í
¸¤Î¥¨¥ó¥Ç¥£¥¢¥ó(¤Ä¤Þ¤ê intel x86 ·Ï¤Î¥×¥í¥»¥Ã¥µ¤Ç¤Ï¥ê¥È¥ë¥¨¥ó¥Ç¥£¥¢¥ó)
¤ò»ÈÍѤ¹¤ë¤³¤È¤È¡¢¥É¥­¥å¥á¥ó¥È¤Î»Ï¤á¤Ë¥Ð¥¤¥È¥ª¡¼¥À¡¼¤Î¥Þ¡¼¥¯¤â¤·¤¯¤ÏÅý·×
Ū¸¡½ÐË¡(statistical heuristics)¤ò»ÈÍѤ¹¤ë¤³¤È¤ò¿ä¾©¤·¤Æ¤¤¤Þ¤¹¡£
<P>(ÌõÃí¡§ heuristics ¤È¤ÏÎ㤨¤Ð¡¢¥Ð¥¤¥È¥ª¡¼¥À¤¬Æþ¤ìÂؤï¤Ã¤Æ¤¤¤ë¤è¤¦¤Ê¾õÂÖ
¤Ç¤Ï²¿¤Î¼ê¤¬¤«¤ê¤â¤Ê¤¤¤È¡¢¤É¤ó¤Ê¥­¥ã¥é¥¯¥¿¥»¥Ã¥È¤«¤ï¤«¤é¤Ê¤¤¡£¤Ç¤âÎ㤨¤Ð
ÆüËܸì¤Îʸ¾Ï¤Î¾ì¹ç¤Ë¤ÏÅý·×Ū¤Ë¡¢'¡¢' ¤ä `¡£' ¤Ê¤É¤Ï¤½¤ì¤Ê¤ê¤ÎÉÑÅ٤Ǹ½¤ì
¤ë¤Èͽ¬¤µ¤ì¤ë¤Î¤Ç¡¢¤â¤·¤½¤¦¤Ê¤éÆüËܸ줸¤ã¤Ê¤¤¤«¤ÈȽÃǤ¹¤ë¤è¤¦¤Ê¤³¤È¤Ç¤¹)
<P>¤½¤ì¤ËÂФ·¤Æ UTF-8 ¤Î¥¢¥×¥í¡¼¥Á¤Ç¤Ï¡¢`<CODE>char*</CODE>' ¤ò C ¤Îɸ½à¤Îʸ»ú
Î󷿤ΤޤޤȤ·¤Æ¤¤¤Þ¤¹¡£·ë²Ì¤È¤·¤Æ¥×¥í¥°¥é¥à¤Ï ASCII ¥Æ¥­¥¹¥È¤ò´Ä¶­ÊÑ¿ô
¤Ë´Ø¤ï¤é¤º°·¤¦¤³¤È¤¬¤Ç¤­¡¢¤Þ¤¿ LANG ´Ä¶­ÊÑ¿ô¤ò»ØÄꤹ¤ì¤Ð ISO-8859-1 ¤È 
UTF-8 ¤Ç¥¨¥ó¥³¡¼¥É¤µ¤ì¤¿¥Æ¥­¥¹¥È¤ò¤â°·¤¦¤³¤È¤¬¤Ç¤­¤Þ¤¹¡£
<P>
<H2><A NAME="ss2.3">2.3 ´ØϢʸ½ñ</A>
</H2>

<P>
<P>Markus Kuhn ¤ÎºÇ¿·¥ê¥½¡¼¥¹¥ê¥¹¥È¡§
<P>
<UL>
<LI>
<A HREF="http://www.cl.cam.ac.uk/~mgk25/unicode.html">http://www.cl.cam.ac.uk/~mgk25/unicode.html</A></LI>
<LI>
<A HREF="http://www.cl.cam.ac.uk/~mgk25/ucs-fonts.html">http://www.cl.cam.ac.uk/~mgk25/ucs-fonts.html</A></LI>
</UL>
<P>Roman Czyborra ¤Î Unicode¡¢UTF-8 ¤ª¤è¤Ó UTF-8 Âбþ¥×¥í¥°¥é¥à¤Î¥ª¡¼¥Ð¡¼¥Ó¥å¡¼¡§
<P>
<A HREF="http://czyborra.com/utf/#UTF-8">http://czyborra.com/utf/#UTF-8</A><P>UTF-8 ¥Õ¥¡¥¤¥ë¤ÎÎ㡧
<P>
<UL>
<LI>Markus Kuhn ¤Î ucs-fonts ¥Ñ¥Ã¥±¡¼¥¸ 
<A HREF="http://www.cl.cam.ac.uk/~mgk25/ucs/examples/quickbrown.txt">quickbrown.txt</A>,
<A HREF="http://www.cl.cam.ac.uk/~mgk25/ucs/examples/UTF-8-test.txt">UTF-8-test.txt</A>,
<A HREF="http://www.cl.cam.ac.uk/~mgk25/ucs/examples/UTF-8-demo.txt">UTF-8-demo.txt</A>.</LI>
<LI>
<A HREF="ftp://ftp.cs.su.oz.au/gary/x-utf8.html">ftp://ftp.cs.su.oz.au/gary/x-utf8.html</A></LI>
<LI>Kosta Kostis ¤Î trans-1.1.1 ¥Ñ¥Ã¥±¡¼¥¸¤Î <CODE>iso10646</CODE> ¥Õ¥¡¥¤¥ë 
<A HREF="ftp://ftp.nid.ru/pub/os/unix/misc/trans111.tar.gz">ftp://ftp.nid.ru/pub/os/unix/misc/trans111.tar.gz</A></LI>
<LI>
<A HREF="ftp://ftp.dante.de/pub/tex/info/lwc/apc/utf8.html">ftp://ftp.dante.de/pub/tex/info/lwc/apc/utf8.html</A></LI>
<LI>
<A HREF="http://www.cogsci.ed.ac.uk/~richard/unicode-sample.html">http://www.cogsci.ed.ac.uk/~richard/unicode-sample.html</A></LI>
</UL>
<P>
<HR>
<A HREF="Unicode-HOWTO-3.html">¼¡¤Î¥Ú¡¼¥¸</A>
<A HREF="Unicode-HOWTO-1.html">Á°¤Î¥Ú¡¼¥¸</A>
<A HREF="Unicode-HOWTO.html#toc2">Ìܼ¡¤Ø</A>
</BODY>
</HTML>