/usr/share/doc/HOWTO/ja-html/Unicode-HOWTO-2.html is in doc-linux-ja-html 2006.05.25-1.1.
This file is owned by root:root, with mode 0o644.
The actual contents of the file can be viewed below.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 | <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2 Final//EN">
<HTML>
<HEAD>
<META NAME="GENERATOR" CONTENT="SGML-Tools 1.0.9">
<TITLE>The Unicode HOWTO: ¥¤¥ó¥È¥í¥À¥¯¥·¥ç¥ó</TITLE>
<LINK HREF="Unicode-HOWTO-3.html" REL=next>
<LINK HREF="Unicode-HOWTO-1.html" REL=previous>
<LINK HREF="Unicode-HOWTO.html#toc2" REL=contents>
</HEAD>
<BODY>
<A HREF="Unicode-HOWTO-3.html">¼¡¤Î¥Ú¡¼¥¸</A>
<A HREF="Unicode-HOWTO-1.html">Á°¤Î¥Ú¡¼¥¸</A>
<A HREF="Unicode-HOWTO.html#toc2">Ìܼ¡¤Ø</A>
<HR>
<H2><A NAME="s2">2. ¥¤¥ó¥È¥í¥À¥¯¥·¥ç¥ó</A></H2>
<P>
<P>
<H2><A NAME="ss2.1">2.1 ¤Ê¤¼ Unicode ¤ò»È¤¦¤Î¤Ç¤¹¤«¡©</A>
</H2>
<P>
<P>°Û¤Ê¤Ã¤¿¹ñ¤Î¿Í¡¹¤Ï¡¢¤½¤ì¤¾¤ì¤ÎÊì¹ñ¸ì¤Îñ¸ì¤òɽ¸½¤¹¤ë¤Î¤Ë°Û¤Ê¤Ã¤¿Ê¸»ú¤ò»È
ÍѤ·¤Æ¤¤¤Þ¤¹¡£¸½ºß¤Ç¤Ï email ¥·¥¹¥Æ¥à¤ä web ¥Ö¥é¥¦¥¶¤Ê¤É¡¢¤Û¤È¤ó¤É¤Î¥¢¥×
¥ê¥±¡¼¥·¥ç¥ó¤Ï 8 ¥Ó¥Ã¥È¥¯¥ê¡¼¥ó¤Ç¤¹¡£¤Ä¤Þ¤ê ISO-8859-1 ¤Î¤è¤¦¤Ê 8 ¥Ó¥Ã¥È
ʸ»ú¥»¥Ã¥È¤Çɽ¸½¤µ¤ì¤ë¥Æ¥¥¹¥È¤Î¼è¤ê°·¤¤¤äɽ¼¨¤òÀµ¤·¤¯¹Ô¤¨¤ë¤È¤¤¤¦¤³¤È¤Ç
¤¹¡£
<P>À¤³¦¤Ë¤Ï 256 ¤è¤ê¤âÍÚ¤«¤Ë¿¤¯¤Îʸ»ú¤¬¤¢¤ê¤Þ¤¹¡£Î㤨¤Ð¥¥ê¥ëʸ»ú¡¢¥Ø¥Ö¥é
¥¤¸ì¡¢¥¢¥é¥Ó¥¢¸ì¡¢Ãæ¹ñ¸ì¡¢ÆüËܸ졢´Ú¹ñ¸ì¡¢¥¿¥¤¸ì¤Ê¤É¤Ç¤¹¡£¿·¤·¤¤Ê¸»ú¤â»þ¡¹
ºî¤é¤ì¤Æ¤¤¤Þ¤¹¡£ÍøÍѼԤËÌäÂê¤Ë¤Ê¤Ã¤Æ¤¯¤ë¤³¤È¤Ë¤Ï¼¡¤Î¤è¤¦¤Ê¤â¤Î¤¬¤¢¤ê¤Þ¤¹¡£
<P>
<UL>
<LI>°ì¤Ä¤Îʸ½ñ¤ÎÃæ¤Ë°Û¤Ê¤ëʸ»ú¥»¥Ã¥È¤Îʸ»ú¤òº®ºß¤µ¤»¤ë¤³¤È¤¬¤Ç¤¤Ê¤¤¾ì¹ç¡£Îã
¤ò¤¢¤²¤ë¤È TeX, xdvi, PostScript ¤ò»È¤Ã¤Æ¤¤¤ë¾ì¹ç¤Ë¤Ï¡¢¥É¥¤¥Ä¸ì¤ä¥Õ¥é¥ó
¥¹¸ì¤Îʸ½ñ¤Ç¥í¥·¥¢¸ì¤Ç¤Î°úÍѤò¤¹¤ë¤³¤È¤¬¤Ç¤¤Þ¤¹¤¬¡¢¤¿¤À¤Î¥Æ¥¥¹¥È¥Õ¥¡¥¤
¥ë¤Ç¤Ï̵Íý¤Ç¤¹¡£
</LI>
<LI>¤½¤ì¤¾¤ì¤Îʸ½ñ¤¬¸ÇͤÎʸ»ú¥»¥Ã¥È¤ò»ý¤Á¡¢Ê¸»ú¥»¥Ã¥È¤Îǧ¼±¤¬¼«Æ°¤Ç¤Ê¤±¤ì¤Ð¡¢
¥æ¡¼¥¶¡¼¤¬²ðºß¤·¤Æ¤³¤ì¤ò¼êÆ°¤Ç¹Ô¤ï¤Ê¤±¤ì¤Ð¤Ê¤ê¤Þ¤»¤ó¡£Î㤨¤Ð XTeamLinux
distribution ¤Î¥Û¡¼¥à¥Ú¡¼¥¸
<A HREF="http://www.xteamlinux.com.cn/">http://www.xteamlinux.com.cn/</A> ¤ò¸«¤ë¤¿¤á¤Ë¤Ï¡¢Netscape ¤Ë¤½¤Î
web ¥Ú¡¼¥¸¤Ï GB2312 ¥³¡¼¥É¤Ç¤¢¤ë¤È»Ø¼¨¤¹¤ëɬÍפ¬¤¢¤ê¤Þ¤¹¡£
</LI>
<LI>¥æ¡¼¥í¤Î¤è¤¦¤Ê¿·¤·¤¤¥·¥ó¥Ü¥ë¤âÀ¸¤ß½Ð¤µ¤ì¤Æ¤¤¤Þ¤¹¡£ISO ¤Ï¿·¤·¤¤É¸½à
ISO-8859-15 ¤òȯɽ(issue)¤·¤Þ¤·¤¿¡£¤³¤ì¤Ï¤Û¤È¤ó¤É ISO-8859-1 ¤ÈƱ¤¸¤Ç¤¹
¤¬¡¢¤Û¤È¤ó¤É»È¤ï¤ì¤Ê¤¤Ê¸»ú(¸Å¤¤Ä̲ߤΥޡ¼¥¯)¤ò¼è¤ê½ü¤¤¤Æ¡¢¥æ¡¼¥í¤Î¥Þ¡¼¥¯
¤ÈÃÖ¤´¹¤¨¤Þ¤·¤¿¡£¥æ¡¼¥¶¡¼¤¬¤³¤Îɸ½à¤ò»ÈÍѤ¹¤ë¤³¤È¤Ë¤·¤¿¾ì¹ç¡¢¥Ç¥£¥¹¥¯Æâ
¤Ë°ã¤Ã¤¿Ê¸»ú¥»¥Ã¥È¤Îʸ½ñ¤ò»ý¤Ä¤³¤È¤Ë¤Ê¤ê¤Þ¤¹¡£¤Ä¤Þ¤ê¡¢Ê¸»ú¥»¥Ã¥È¤Î¤³¤È¤ò
¾ï¤Ë¹Íθ¤¹¤ëÆü¡¹¤Î¤Ï¤¸¤Þ¤ê¤È¤¤¤¦¤³¤È¤Ç¤¹¡£¤Ç¤¹¤¬¥³¥ó¥Ô¥å¡¼¥¿¤Ïʪ»ö¤ò¥·¥ó
¥×¥ë¤Ë¤¹¤ë¤¿¤á¤Î¤â¤Î¤Ç¤¢¤ê¡¢¤è¤êÊ£»¨¤Ë¤¹¤ë¤â¤Î¤Ç¤Ï¤¢¤ê¤Þ¤»¤ó¡£</LI>
</UL>
<P>¤³¤ÎÌäÂê¤ò²ò·è¤¹¤ë¤Ë¤Ï¡¢¥ï¡¼¥ë¥É¥ï¥¤¥É¤Ë»ÈÍѤǤ¤ëʸ»ú¥»¥Ã¥È¤ò»È¤¦¤³¤È¤Ç
¤¹¡£¤½¤Îʸ»ú¥»¥Ã¥È¤È¤Ï Unicode
<A HREF="http://www.unicode.org/">http://www.unicode.org/</A> ¤Î¤³¤È¤Ç¤¹¡£Unicode ¤Ë´Ø¤¹¤ë¾ÜºÙ¤Ï
`<CODE>man 7 unicode</CODE>' ¤ò¼Â¹Ô¤·¤Æ¤¯¤À¤µ¤¤¡£(manpage ¤Ï man-pages-1.20 ¥Ñ¥Ã
¥±¡¼¥¸¤Ë´Þ¤Þ¤ì¤Æ¤¤¤Þ¤¹)
<P>
<H2><A NAME="ss2.2">2.2 Unicode ¤Î¥¨¥ó¥³¡¼¥Ç¥£¥ó¥°</A>
</H2>
<P>
<P>Unicode ¥¨¥ó¥³¡¼¥Ç¥£¥ó¥°¤ò»È¤¦¤È¡¢Ê¸»ú¥»¥Ã¥È¤ò°·¤¦¥æ¡¼¥¶¡¼¥×¥í¥°¥é¥à¤ÎÌä
Âê¤Ï¡¢¡Ö¤É¤¦¤ä¤Ã¤Æ 1 ¥ª¥¯¥Æ¥Ã¥È(8 ¥Ó¥Ã¥È)¤Ç Unicode ʸ»ú¤òÁ÷¤ë¤Î¤«¡×¤È¤¤
¤¦µ»½ÑŪ¤ÊÌäÂê¤À¤±¤Ë¤Ê¤ê¤Þ¤¹¡£8 ¥Ó¥Ã¥È¤È¤¤¤¦Ã±°Ì¤Ï¡¢Â¿¤¯¤Î¥³¥ó¥Ô¥å¡¼¥¿¤Ç¡¢
¥¢¥É¥ì¥¹¤òɽ¸½¤¹¤ëºÇ¾®Ã±°Ì¤Ç¤¹¡£¤Þ¤¿¤³¤Î 8 ¥Ó¥Ã¥È¤È¤¤¤¦Ã±°Ì¤Ï¡¢TCP/IP ¥Í¥Ã
¥È¥ï¡¼¥¯¤Ç¤Î¥³¥Í¥¯¥·¥ç¥ó¤Ë¤â»ÈÍѤµ¤ì¤Æ¤¤¤Þ¤¹¡£1 ʸ»ú¤òɽ¸½¤¹¤ë¤Î¤Ë 1 ¥Ð
¥¤¥È¤ò»ÈÍѤ¹¤ë¤È¤¤¤¦¤Î¤ÏÎò»ËŪ¤Ê¶öÁ³¤Ç¤¢¤ê¡¢¤³¤ì¤Ï¥³¥ó¥Ô¥å¡¼¥¿¤Î³«È¯¤¬¥è¡¼
¥í¥Ã¥Ñ¤È¥¢¥á¥ê¥«¤Ç»Ï¤Þ¤Ã¤¿¤³¤È¤Ë¤è¤ê¤Þ¤¹¡£¤³¤ì¤é¤Î¹ñ¡¹¤Ç¤ÏŤ¤´Ö¡¢96 ¼ï
Îà¤Îʸ»ú¤Ç½¼Ê¬¤È¤µ¤ì¤Æ¤¤Þ¤·¤¿¡£
<P>
<P>Unicode ʸ»ú¤ò¥Ð¥¤¥È¤Ç¥¨¥ó¥³¡¼¥É¤¹¤ëÊýË¡¤Ë¤Ï¡¢Ä̾ï 4 ¼ïÎढ¤ê¤Þ¤¹¡£
<P>
<DL>
<DT><B>UTF-8</B><DD><P>128 ʸ»ú¤¬ 1 ¥Ð¥¤¥È¤Ç¥¨¥ó¥³¡¼¥É¤µ¤ì¤Þ¤¹(ASCII ʸ»ú)¡£1920 ʸ»ú¤¬ 2 ¥Ð¥¤
¥È¤Ç¥¨¥ó¥³¡¼¥É¤µ¤ì¤Þ¤¹(¥í¡¼¥Þ»ú¡¢¥®¥ê¥·¥ãʸ»ú¡¢¥¥ê¥ëʸ»ú¡¢¥³¥×¥È¸ì¡¢¥¢
¥ë¥á¥Ë¥¢¸ì¡¢¥Ø¥Ö¥é¥¤¸ì¡¢¥¢¥é¥Ó¥¢¸ì¤Îʸ»ú)¡£63488 ʸ»ú¤¬ 3¥Ð¥¤¥È¤Ç¥¨¥ó¥³¡¼
¥É¤µ¤ì¤Þ¤¹(Ãæ¹ñ¸ì¤äÆüËܸì¤Ê¤É)¡£»Ä¤ê¤Î 2147418112 ʸ»ú¤Ï 4 ¡Á 6 ¥Ð¥¤¥È¤ò
»È¤Ã¤Æ¥¨¥ó¥³¡¼¥É¤¹¤ë¤³¤È¤¬¤Ç¤¤Þ¤¹(¤Þ¤À³ä¤êÅö¤Æ¤é¤ì¤Æ¤¤¤Þ¤»¤ó)¡£UTF-8 ¤Ë
´Ø¤¹¤ë¾ÜºÙ¤Ï `<CODE>man 7 utf-8</CODE>' ¤ò¼Â¹Ô¤·¤Æ¤¯¤À¤µ¤¤¡£(manpage ¤Ï
ldpman-1.20 ¥Ñ¥Ã¥±¡¼¥¸¤Ë´Þ¤Þ¤ì¤Æ¤¤¤Þ¤¹)
<P>
<DT><B>UCS-2</B><DD><P>Á´¤Æ¤Îʸ»ú¤Ï 2 ¥Ð¥¤¥È¤Çɽ¸½¤µ¤ì¤Þ¤¹¡£¤³¤Î¥¨¥ó¥³¡¼¥Ç¥£¥ó¥°¤Ç¤Ï Unicode¤Î
»Ï¤á¤Î 65536 ʸ»ú¤À¤±¤òɽ¸½¤Ç¤¤Þ¤¹¡£
<P>
<DT><B>UTF-16</B><DD><P>¤³¤ì¤Ï UCS-2 ¤Î³ÈÄ¥¤Ç 1112064 ¤Î Unicode ʸ»ú¤òɽ¸½¤¹¤ë¤³¤È¤¬¤Ç¤¤Þ¤¹¡£
Unicode ¤Î»Ï¤á¤Î 65536 ʸ»ú¤Ï 2 ¥Ð¥¤¥È¤Ç¡¢»Ä¤ê¤Ï 4 ¥Ð¥¤¥È¤Çɽ¸½¤µ¤ì¤Þ¤¹¡£
<P>
<DT><B>UCS-4</B><DD><P>Á´¤Æ¤Îʸ»ú¤Ï 4 ¥Ð¥¤¥È¤Çɽ¸½¤µ¤ì¤Þ¤¹¡£
<P>
</DL>
<P>¥Æ¥¥¹¥È¤ò¥¨¥ó¥³¡¼¥É¤¹¤ë¤Î¤ËɬÍפȤʤëÍÆÎÌ(¥è¡¼¥í¥Ã¥Ñ¤Î¸À¸ì¤Ç¤Ï 1 ʸ»ú¤¢
¤¿¤ê 8¥Ó¥Ã¥È¤Ç¡¢Ãæ¹ñ¸ì¡¢ÆüËܸ졢´Ú¹ñ¸ì¤Ç¤Ï¤è¤ê¿¤¯¤Î¥Ó¥Ã¥È¿ô)¤ò¡¢¸½ºß»È
ÍѤµ¤ì¤Æ¤¤¤ë¥¨¥ó¥³¡¼¥Ç¥£¥ó¥°¤ÈÈæ¤Ù¤¿¤â¤Î¤¬°Ê²¼¤Ë¤Ê¤ê¤Þ¤¹¡£¤³¤ì¤Ï¥Ç¥£¥¹¥¯
¤Ç»ÈÍѤ¹¤ëÍÆÎ̤䡢¥Í¥Ã¥È¥ï¡¼¥¯¤Ç¤Î¥À¥¦¥ó¥í¡¼¥É®Å٤˱ƶÁ¤·¤Þ¤¹¡Ê°µ½Ì¤ò¤·
¤Æ¤¤¤Ê¤¤¾ì¹ç¡Ë¡£
<P>
<DL>
<DT><B>UTF-8</B><DD><P>US ASCII ¤Ê¤éÊѲ½¤Ê¤·¡¢ISO-8859-1 ¤Ê¤é¿ô¥Ñ¡¼¥»¥ó¥ÈÁý¤¨¡¢Ãæ¹ñ¸ì¡¢ÆüËܸ졢
´Ú¹ñ¸ì¤Ç¤Ï 1.5 ÇÜ¡¢¥®¥ê¥·¥ãʸ»ú¤ä¥¥ê¥ëʸ»ú¤Ç¤Ï 2 Çܤˤʤê¤Þ¤¹¡£
<P>
<DT><B>UCS-2 ¤ª¤è¤Ó UTF-16</B><DD><P>Ãæ¹ñ¸ì¡¢ÆüËܸ졢´Ú¹ñ¸ì¤Ç¤ÏÊѲ½¤Ê¤·¡¢ASCII¡¢ISO-8859-1¡¢¥®¥ê¥·¥ãʸ»ú¡¢
¥¥ê¥ëʸ»ú¤Ç¤Ï 2 Çܤˤʤê¤Þ¤¹¡£
<P>
<DT><B>UCS-4</B><DD><P>Ãæ¹ñ¸ì¡¢ÆüËܸ졢´Ú¹ñ¸ì¤Ç¤Ï 2ÇÜ¡¢ASCII¡¢ISO-8859-1¡¢¥®¥ê¥·¥ãʸ»ú¡¢¥¥ê¥ë
ʸ»ú¤Ç¤Ï 3 Çܤˤʤê¤Þ¤¹¡£
</DL>
<P>UCS-2, UTF-16, UCS-4 ¤Ç US ¤ä¥è¡¼¥í¥Ã¥Ñ¤Îʸ½ñ¤ò½ñ¤¯¾ì¹ç¤Ë¤ÏASCII ¤ä
ISO-8859-1 ¤Ç½ñ¤¤¤¿¤È¤¤è¤ê¤â¥µ¥¤¥º¤¬Â礤¯¤Ê¤ë¤³¤È¤¬¤¢¤ë¤¿¤á¡¢¤½¤ì¤é¤Î
¥¨¥ó¥³¡¼¥Ç¥£¥ó¥°¤¬¹¤¯»È¤ï¤ì¤ë¤³¤È¤Ï¤Ê¤µ¤½¤¦¤Ç¤¹¡£ Microsoft ¤Î Win32
API ¤Ï UCS-2 ¥¨¥ó¥³¡¼¥Ç¥£¥ó¥°¤ò(¾¯¤Ê¤¯¤È¤â) 1995 ǯ¤«¤é¥µ¥Ý¡¼¥È¤·¤Æ¤¤¤Þ
¤¹¤¬¡¢UCS-2¤Ïʸ½ñ¤òµ½Ò¤¹¤ë¤Î¤Ë¹¤¯»È¤ï¤ì¤Æ¤Ï¤¤¤Þ¤»¤ó¡£ÆüËܤǤϥ·¥Õ¥È
JIS ¤¬¤¤¤Þ¤À°ìÈÌŪ¤Ç¤¹¡£
<P>°ìÊý¡¢US ¤ä¥è¡¼¥í¥Ã¥Ñ¤ÎÍøÍѼԤˤϥڥʥë¥Æ¥£¤¬¤Ê¤¯¡¢¤Þ¤¿Â¿¤¯¤Î¥Æ¥¥¹¥ÈÁà
ºî¤ò¹Ô¤¦¥×¥í¥°¥é¥à¤Ï UTF-8 ¥µ¥Ý¡¼¥È¤Î¤¿¤á¤ÎÊѹ¹¤¬É¬Íפʤ¤¤Î¤Ç¡¢UTF-8 ¤Ï
¹¤¯»È¤ï¤ì¤ë²ÄǽÀ¤¬¤¢¤ê¤Þ¤¹¡£
<P>¤³¤ì¤«¤é¡¢¥Æ¥¥¹¥È¤Î¥¨¥ó¥³¡¼¥Ç¥£¥ó¥°¤È¤·¤Æ UTF-8 ¤ò»È¤¦¤è¤¦¤Ë Linux ¥·¥¹
¥Æ¥à¤òÊѹ¹¤¹¤ëÊýË¡¤Ë¤Ä¤¤¤ÆÀâÌÀ¤·¤Æ¤¤¤¤Þ¤¹¡£
<P>
<H3>C/C++ ³«È¯¼Ô¤Ø¤ÎÊäÂÀâÌÀ</H3>
<P>Microsoft ¤¬ Win32 API ¤Ç¼è¤Ã¤Æ¤¤¤ë¥¢¥×¥í¡¼¥Á¤Ç¤Ï¡¢³«È¯¼Ô¤¬ Unicode ÈǤÎ
¥×¥í¥°¥é¥à¤òºîÀ®¤¹¤ë¤³¤È¤Ï´Êñ¤Ç¤¹¡£"#define UNICODE" ¤ò¥×¥í¥°¥é¥à¤Î
ÀèƬ¤ÇÀë¸À¤·¤Æ¡¢¥³¥ó¥Ñ¥¤¥ë¥¨¥é¡¼¤¬¤Ê¤¯¤Ê¤ë¤Þ¤Ç `<CODE>char</CODE>' ¤ò
`<CODE>TCHAR</CODE>' ¤ØÊѹ¹¤·¤Þ¤¹¡£¤³¤ÎÊýË¡¤ÎÌäÂê¤Ï¡¢ºÇ½ªÅª¤Ë 2 ¤Ä¤Î¥Ð¡¼¥¸¥ç
¥ó¤Î¥×¥í¥°¥é¥à¤¬¤Ç¤¤Æ¤·¤Þ¤¦¤³¤È¤Ç¤¹¡£1 ¤Ä¤Ï UCS-2 ¤Î¥Æ¥¥¹¥È¤ò°·¤¨¤Þ¤¹
¤¬¡¢8 ¥Ó¥Ã¥È¤Î¥¨¥ó¥³¡¼¥Ç¥£¥ó¥°¤ÏÂÌÌܤǤ¹¡£¤â¤¦ 1 ¤Ä¤ÏµìÍè¤Î 8 ¥Ó¥Ã¥È¥¨¥ó
¥³¡¼¥Ç¥£¥ó¥°¤·¤«°·¤¨¤Þ¤»¤ó¡£
<P>¤µ¤é¤Ë UCS-2 ¤È UCS-4 ¤Ë¤Ï¥¨¥ó¥Ç¥£¥¢¥ó¤ÎÌäÂ꤬¤¢¤ê¤Þ¤¹¡£The Internet
Assigned Numbers Authority (IANA) character set registry
<A HREF="http://www.isi.edu/in-notes/iana/assignments/character-sets">http://www.isi.edu/in-notes/iana/assignments/character-sets</A> ¤Ï
ISO-10646-UCS-2 ¤Ë¤Ä¤¤¤Æ¤³¤Î¤è¤¦¤Ë½Ò¤Ù¤Æ¤¤¤Þ¤¹¡§
<BLOCKQUOTE>
¡Ö¤³¤ì¤Ë¤Ï¥Í¥Ã¥È
¥ï¡¼¥¯¥Ð¥¤¥È¥ª¡¼¥À¡¼¤ò»ØÄꤹ¤ëɬÍפ¬¤¢¤ë: ɸ½à¤ÏÄê¤á¤é¤ì¤Æ¤¤¤Ê¤¤¡×
</BLOCKQUOTE>
¥Í¥Ã¥È¥ï¡¼¥¯¥Ð¥¤¥È¥ª¡¼¥À¡¼¤Ï¥Ó¥Ã¥°¥¨¥ó¥Ç¥£¥¢¥ó¤Ç¤¹¡£¤Þ¤¿ RFC
2152 ¤Ë¡¢¤è¤êÌÀ³Î¤Ëµ½Ò¤µ¤ì¤Æ¤¤¤Þ¤¹¡§¡ÖISO/IEC 10646-1:1993(E) ¤Ë¤ÏUCS-2
¤Îʸ»ú¤¬¥ª¥¯¥Æ¥Ã¥È¤Çɽ¸½¤µ¤ì¤ë»þ¤Ë¤Ï¡¢ºÇ¤âÂ礤¤¥ª¥¯¥Æ¥Ã¥È¤¬»Ï¤á¤ËÍè¤ë¤È
¼¨¤µ¤ì¤Æ¤¤¤Þ¤¹¡×¤È¤³¤í¤¬ Microsoft¤Ï¼«¼Ò¤Î C/C++ ³«È¯¥Ä¡¼¥ë¤Ç¤Ï¥Þ¥·¥ó°Í
¸¤Î¥¨¥ó¥Ç¥£¥¢¥ó(¤Ä¤Þ¤ê intel x86 ·Ï¤Î¥×¥í¥»¥Ã¥µ¤Ç¤Ï¥ê¥È¥ë¥¨¥ó¥Ç¥£¥¢¥ó)
¤ò»ÈÍѤ¹¤ë¤³¤È¤È¡¢¥É¥¥å¥á¥ó¥È¤Î»Ï¤á¤Ë¥Ð¥¤¥È¥ª¡¼¥À¡¼¤Î¥Þ¡¼¥¯¤â¤·¤¯¤ÏÅý·×
Ū¸¡½ÐË¡(statistical heuristics)¤ò»ÈÍѤ¹¤ë¤³¤È¤ò¿ä¾©¤·¤Æ¤¤¤Þ¤¹¡£
<P>(ÌõÃí¡§ heuristics ¤È¤ÏÎ㤨¤Ð¡¢¥Ð¥¤¥È¥ª¡¼¥À¤¬Æþ¤ìÂؤï¤Ã¤Æ¤¤¤ë¤è¤¦¤Ê¾õÂÖ
¤Ç¤Ï²¿¤Î¼ê¤¬¤«¤ê¤â¤Ê¤¤¤È¡¢¤É¤ó¤Ê¥¥ã¥é¥¯¥¿¥»¥Ã¥È¤«¤ï¤«¤é¤Ê¤¤¡£¤Ç¤âÎ㤨¤Ð
ÆüËܸì¤Îʸ¾Ï¤Î¾ì¹ç¤Ë¤ÏÅý·×Ū¤Ë¡¢'¡¢' ¤ä `¡£' ¤Ê¤É¤Ï¤½¤ì¤Ê¤ê¤ÎÉÑÅ٤Ǹ½¤ì
¤ë¤Èͽ¬¤µ¤ì¤ë¤Î¤Ç¡¢¤â¤·¤½¤¦¤Ê¤éÆüËܸ줸¤ã¤Ê¤¤¤«¤ÈȽÃǤ¹¤ë¤è¤¦¤Ê¤³¤È¤Ç¤¹)
<P>¤½¤ì¤ËÂФ·¤Æ UTF-8 ¤Î¥¢¥×¥í¡¼¥Á¤Ç¤Ï¡¢`<CODE>char*</CODE>' ¤ò C ¤Îɸ½à¤Îʸ»ú
Î󷿤ΤޤޤȤ·¤Æ¤¤¤Þ¤¹¡£·ë²Ì¤È¤·¤Æ¥×¥í¥°¥é¥à¤Ï ASCII ¥Æ¥¥¹¥È¤ò´Ä¶ÊÑ¿ô
¤Ë´Ø¤ï¤é¤º°·¤¦¤³¤È¤¬¤Ç¤¡¢¤Þ¤¿ LANG ´Ä¶ÊÑ¿ô¤ò»ØÄꤹ¤ì¤Ð ISO-8859-1 ¤È
UTF-8 ¤Ç¥¨¥ó¥³¡¼¥É¤µ¤ì¤¿¥Æ¥¥¹¥È¤ò¤â°·¤¦¤³¤È¤¬¤Ç¤¤Þ¤¹¡£
<P>
<H2><A NAME="ss2.3">2.3 ´ØϢʸ½ñ</A>
</H2>
<P>
<P>Markus Kuhn ¤ÎºÇ¿·¥ê¥½¡¼¥¹¥ê¥¹¥È¡§
<P>
<UL>
<LI>
<A HREF="http://www.cl.cam.ac.uk/~mgk25/unicode.html">http://www.cl.cam.ac.uk/~mgk25/unicode.html</A></LI>
<LI>
<A HREF="http://www.cl.cam.ac.uk/~mgk25/ucs-fonts.html">http://www.cl.cam.ac.uk/~mgk25/ucs-fonts.html</A></LI>
</UL>
<P>Roman Czyborra ¤Î Unicode¡¢UTF-8 ¤ª¤è¤Ó UTF-8 Âбþ¥×¥í¥°¥é¥à¤Î¥ª¡¼¥Ð¡¼¥Ó¥å¡¼¡§
<P>
<A HREF="http://czyborra.com/utf/#UTF-8">http://czyborra.com/utf/#UTF-8</A><P>UTF-8 ¥Õ¥¡¥¤¥ë¤ÎÎ㡧
<P>
<UL>
<LI>Markus Kuhn ¤Î ucs-fonts ¥Ñ¥Ã¥±¡¼¥¸
<A HREF="http://www.cl.cam.ac.uk/~mgk25/ucs/examples/quickbrown.txt">quickbrown.txt</A>,
<A HREF="http://www.cl.cam.ac.uk/~mgk25/ucs/examples/UTF-8-test.txt">UTF-8-test.txt</A>,
<A HREF="http://www.cl.cam.ac.uk/~mgk25/ucs/examples/UTF-8-demo.txt">UTF-8-demo.txt</A>.</LI>
<LI>
<A HREF="ftp://ftp.cs.su.oz.au/gary/x-utf8.html">ftp://ftp.cs.su.oz.au/gary/x-utf8.html</A></LI>
<LI>Kosta Kostis ¤Î trans-1.1.1 ¥Ñ¥Ã¥±¡¼¥¸¤Î <CODE>iso10646</CODE> ¥Õ¥¡¥¤¥ë
<A HREF="ftp://ftp.nid.ru/pub/os/unix/misc/trans111.tar.gz">ftp://ftp.nid.ru/pub/os/unix/misc/trans111.tar.gz</A></LI>
<LI>
<A HREF="ftp://ftp.dante.de/pub/tex/info/lwc/apc/utf8.html">ftp://ftp.dante.de/pub/tex/info/lwc/apc/utf8.html</A></LI>
<LI>
<A HREF="http://www.cogsci.ed.ac.uk/~richard/unicode-sample.html">http://www.cogsci.ed.ac.uk/~richard/unicode-sample.html</A></LI>
</UL>
<P>
<HR>
<A HREF="Unicode-HOWTO-3.html">¼¡¤Î¥Ú¡¼¥¸</A>
<A HREF="Unicode-HOWTO-1.html">Á°¤Î¥Ú¡¼¥¸</A>
<A HREF="Unicode-HOWTO.html#toc2">Ìܼ¡¤Ø</A>
</BODY>
</HTML>
|