/usr/share/doc/recode-doc/Micros.html is in recode-doc 3.6-22.
This file is owned by root:root, with mode 0o644.
The actual contents of the file can be viewed below.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 | <!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">
<html>
<!-- Created by GNU Texinfo 6.0, http://www.gnu.org/software/texinfo/ -->
<head>
<title>The recode reference manual: Micros</title>
<meta name="description" content="The recode reference manual: Micros">
<meta name="keywords" content="The recode reference manual: Micros">
<meta name="resource-type" content="document">
<meta name="distribution" content="global">
<meta name="Generator" content="makeinfo">
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
<link href="index.html#Top" rel="start" title="Top">
<link href="Concept-Index.html#Concept-Index" rel="index" title="Concept Index">
<link href="Charset-and-Surface-Index.html#SEC_Contents" rel="contents" title="Table of Contents">
<link href="index.html#Top" rel="up" title="Top">
<link href="Miscellaneous.html#Miscellaneous" rel="next" title="Miscellaneous">
<link href="CDC.html#Bang_002dBang" rel="prev" title="Bang-Bang">
<style type="text/css">
<!--
a.summary-letter {text-decoration: none}
blockquote.indentedblock {margin-right: 0em}
blockquote.smallindentedblock {margin-right: 0em; font-size: smaller}
blockquote.smallquotation {font-size: smaller}
div.display {margin-left: 3.2em}
div.example {margin-left: 3.2em}
div.lisp {margin-left: 3.2em}
div.smalldisplay {margin-left: 3.2em}
div.smallexample {margin-left: 3.2em}
div.smalllisp {margin-left: 3.2em}
kbd {font-style: oblique}
pre.display {font-family: inherit}
pre.format {font-family: inherit}
pre.menu-comment {font-family: serif}
pre.menu-preformatted {font-family: serif}
pre.smalldisplay {font-family: inherit; font-size: smaller}
pre.smallexample {font-size: smaller}
pre.smallformat {font-family: inherit; font-size: smaller}
pre.smalllisp {font-size: smaller}
span.nocodebreak {white-space: nowrap}
span.nolinebreak {white-space: nowrap}
span.roman {font-family: serif; font-weight: normal}
span.sansserif {font-family: sans-serif; font-weight: normal}
ul.no-bullet {list-style: none}
-->
</style>
</head>
<body lang="en">
<a name="Micros"></a>
<div class="header">
<p>
Next: <a href="Miscellaneous.html#Miscellaneous" accesskey="n" rel="next">Miscellaneous</a>, Previous: <a href="CDC.html#CDC" accesskey="p" rel="prev">CDC</a>, Up: <a href="index.html#Top" accesskey="u" rel="up">Top</a> [<a href="Charset-and-Surface-Index.html#SEC_Contents" title="Table of contents" rel="contents">Contents</a>][<a href="Concept-Index.html#Concept-Index" title="Index" rel="index">Index</a>]</p>
</div>
<a name="Other-micro_002dcomputer-charsets"></a>
<h2 class="chapter">11 Other micro-computer charsets</h2>
<a name="index-NeXT-charsets"></a>
<p>The <code>NeXT</code> charset, which used to be especially provided in releases of
<code>recode</code> before 3.5, has been integrated since as one RFC 1345<!-- /@w --> table.
</p>
<table class="menu" border="0" cellspacing="0">
<tr><td align="left" valign="top">• <a href="#Apple_002dMac" accesskey="1">Apple-Mac</a>:</td><td> </td><td align="left" valign="top">Apple’s Macintosh code
</td></tr>
<tr><td align="left" valign="top">• <a href="#AtariST" accesskey="2">AtariST</a>:</td><td> </td><td align="left" valign="top">Atari ST code
</td></tr>
</table>
<hr>
<a name="Apple_002dMac"></a>
<div class="header">
<p>
Next: <a href="#AtariST" accesskey="n" rel="next">AtariST</a>, Previous: <a href="#Micros" accesskey="p" rel="prev">Micros</a>, Up: <a href="#Micros" accesskey="u" rel="up">Micros</a> [<a href="Charset-and-Surface-Index.html#SEC_Contents" title="Table of contents" rel="contents">Contents</a>][<a href="Concept-Index.html#Concept-Index" title="Index" rel="index">Index</a>]</p>
</div>
<a name="Apple_0027s-Macintosh-code"></a>
<h3 class="section">11.1 Apple’s Macintosh code</h3>
<a name="index-Apple_002dMac"></a>
<a name="index-Macintosh-charset"></a>
<p>This charset is available in <code>recode</code> under the name <code>Apple-Mac</code>.
The shortest way of specifying it in <code>recode</code> is <code>ap</code>.
</p>
<p>The charset is aimed towards a Macintosh micro-computer from Apple.
This is an eight bit code. The file is the data fork only. This charset
is fairly old in <code>recode</code>, its tables were produced a long while ago
by mere inspection of a printed chart of the Macintosh codes and glyph.
</p>
<a name="index-CR-surface_002c-in-Macintosh-charsets"></a>
<p>It has <code>CR</code> as its implied surface. This means that, if the original
end of lines have to be preserved while going out of <code>Apple-Mac</code>, they
should currently be added back through the usage of a surface on the other
charset, or better, just never removed. Here are examples for both cases:
</p>
<div class="example">
<pre class="example">recode ap..l2/cr < <var>input</var> > <var>output</var>
recode ap/..l2 < <var>input</var> > <var>output</var>
</pre></div>
<p>RFC 1345<!-- /@w --> brings into <code>recode</code> 2 other Macintosh charsets. You can
discover them by using <code>grep</code> over the output of ‘<samp>recode -l</samp>’:
</p>
<div class="example">
<pre class="example">recode -l | grep -i mac
</pre></div>
<a name="index-macintosh_002c-a-charset_002c-and-its-aliases"></a>
<a name="index-macintosh_005fce_002c-and-its-aliases"></a>
<a name="index-mac-1"></a>
<a name="index-macce-1"></a>
<p>Charsets <code>macintosh</code> and <code>macintosh_ce</code>, as well as their aliases
<code>mac</code> and <code>macce</code> also have <code>CR</code> as their implied surface.
</p>
<p>There are a few discrepancies between the <code>Apple-Mac</code> charset and
the very similar RFC 1345<!-- /@w --> charset <code>macintosh</code>, which have not been
analysed yet, so the charsets are being kept separate for now. This might
change in the future, and the <code>Apple-Mac</code> charset might disappear.
Wizards would be interested in comparing the output of these two commands:
</p>
<div class="example">
<pre class="example">recode -vh Apple-Mac..Latin-1
recode -vh macintosh..Latin-1
</pre></div>
<p>The first command use the charset prior to RFC 1345<!-- /@w --> introduction.
Both methods give different recodings. These differences are annoying,
the fuzziness will have to be explained and settle down one day.
</p>
<a name="index-recode_002c-a-Macintosh-port"></a>
<p>As a side note, some people ask if there is a Macintosh port of the
<code>recode</code> program. I’m not aware of any. I presume that if the tool
fills a need for Macintosh users, someone will port it one of these days?
</p>
<hr>
<a name="AtariST"></a>
<div class="header">
<p>
Previous: <a href="#Apple_002dMac" accesskey="p" rel="prev">Apple-Mac</a>, Up: <a href="#Micros" accesskey="u" rel="up">Micros</a> [<a href="Charset-and-Surface-Index.html#SEC_Contents" title="Table of contents" rel="contents">Contents</a>][<a href="Concept-Index.html#Concept-Index" title="Index" rel="index">Index</a>]</p>
</div>
<a name="Atari-ST-code"></a>
<h3 class="section">11.2 Atari ST code</h3>
<a name="index-AtariST"></a>
<p>This charset is available in <code>recode</code> under the name <code>AtariST</code>.
</p>
<p>This is the character set used on the Atari ST/TT/Falcon. This is similar
to <code>IBM-PC</code>, but differs in some details: it includes some more accented
characters, the graphic characters are mostly replaced by Hebrew characters,
and there is a true German <kbd>sharp s</kbd> different from Greek <kbd>beta</kbd>.
</p>
<p>About the end-of-line conversions: the canonical end-of-line on the
Atari is ‘<samp>\r\n</samp>’, but unlike <code>IBM-PC</code>, the OS makes no
difference between text and binary input/output; it is up to the
application how to interpret the data. In fact, most of the libraries
that come with compilers can grok both ‘<samp>\r\n</samp>’ and ‘<samp>\n</samp>’ as end
of lines. Many of the users who also have access to Unix systems prefer
‘<samp>\n</samp>’ to ease porting Unix utilities. So, for easing reversibility,
<code>recode</code> tries to let ‘<samp>\r</samp>’ undisturbed through recodings.
</p>
<hr>
<div class="header">
<p>
Previous: <a href="#Apple_002dMac" accesskey="p" rel="prev">Apple-Mac</a>, Up: <a href="#Micros" accesskey="u" rel="up">Micros</a> [<a href="Charset-and-Surface-Index.html#SEC_Contents" title="Table of contents" rel="contents">Contents</a>][<a href="Concept-Index.html#Concept-Index" title="Index" rel="index">Index</a>]</p>
</div>
</body>
</html>
|