1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
|
=====
a8cat
=====
-------------------------------------------------------------
Convert Atari 8-bit text to UTF-8 encoded Unicode (and back).
-------------------------------------------------------------
.. include:: manhdr.rst
SYNOPSIS
========
*a8cat* [**-r**] [**-i**] [**-u**] [**-t**] [*infile*] [*infile ...*]
DESCRIPTION
===========
Convert Atari 8-bit ATASCII or XL ICS (International Character
Set) text to UTF-8 encoded Unicode. Control graphics characters are
replaced with their nearest Unicode equivalents (mostly from the Box
Drawing block, or from the Basic Latin block with **-i** option).
If no *infile*\s are given, input is read from standard input. Output always
goes to standard output; to write to a file, use a command like::
a8cat atari.txt > converted.txt
The output is UTF-8 Unicode, without BOM, but possibly with
ANSI/VT-100 control sequences. It will display correctly in modern
terminals that support Unicode and UTF-8. If you get lots of "empty
rectangle" characters, it means your font lacks the glyphs for the
codepoints; try using the Deja Vu Sans and/or Symbola fonts.
If you want to pipe the output to a pager, **less -MR** is
recommended. It will display the inverse characters correctly.
It's even possible to edit the converted text and turn it back into
ATASCII, if you're very careful. See the **-r** option, below.
Inverse video (characters codes above **$80**) are translated using
the ANSI/VT-100 reverse video escape sequences. Exception: **$9B**
(Atari EOL) is translated to **\\n** (newline).
OPTIONS
=======
-i
Input uses Atari XL/XE International Character Set encoding, rather than
ATASCII graphics.
-u
Use "underlining" for inverse video. Each inverse character is followed by
a backspace, then a *_* character. When viewed in a pager such as **less**\(1),
this causes the characters to appear underlined. Output created with this
option cannot be converted back to ATASCII with the **-r** option.
-t
Text mode. Normally, everything but EOL (**$9B**) is converted to a
Unicode graphics character. In text mode, ATASCII tabs, backspace,
and bells are translated to the ASCII versions.
-r
Reverse conversion: Input is UTF-8, output is ATASCII (or XL ICS, with **-i**).
Beware that printing ATASCII to a terminal may look funny, and may
even confuse the terminal. Redirecting to a file is safe; piping
to a pager usually is. Only the UTF-8 codepoints that correspond to
ATASCII characters will be converted. If the **-i** option was used to
create the input, it must be used with **-r** also.
-m
Magazine listing mode. Rather than Unicode graphics characters,
ATASCII characters are printed in symbolic form, e.g. *{clear}* or
*{ctrl-A}*. The result is similar to type-in listings in magazines
like Antic, Analog, or Compute! Output created with this option
cannot be converted back to ATASCII with the **-r** option.
-s
Strip the inverse video bit (bit 7) from all characters except the
EOL (**$9B**). Output created with this option can be converted
back to ATASCII with the **-r** option, but of course there won't
be any inverse-video characters in the result.
-v
Verbose operation.
**--**
End of options; the rest of the arguments are filenames. Use this if you're
trying to work with files whose names begin with *-*.
**-h**, **--help**
Show built-in help and exit.
**--version**
Show version number and exit.
.. include:: manftr.rst
|