aboutsummaryrefslogtreecommitdiff
path: root/a8cat.rst
blob: 8367e35eed20241dbefa8d1ade363d461e247cd9 (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
=====
a8cat
=====

-------------------------------------------------------------
Convert Atari 8-bit text to UTF-8 encoded Unicode (and back).
-------------------------------------------------------------

.. include:: manhdr.rst

SYNOPSIS
========

*a8cat* [**-r**] [**-i**] [**-u**] [**-t**] [*infile*] [*infile ...*]

DESCRIPTION
===========

Convert Atari 8-bit ATASCII or XL ICS (International Character
Set) text to UTF-8 encoded Unicode. Control graphics characters are
replaced with their nearest Unicode equivalents (mostly from the Box
Drawing block, or from the Basic Latin block with **-i** option).

If no *infile*\s are given, input is read from standard input. Output always
goes to standard output; to write to a file, use a command like::

  a8cat atari.txt > converted.txt

The output is UTF-8 Unicode, without BOM, but possibly with
ANSI/VT-100 control sequences. It will display correctly in modern
terminals that support Unicode and UTF-8. If you get lots of "empty
rectangle" characters, it means your font lacks the glyphs for the
codepoints; try using the Deja Vu Sans and/or Symbola fonts.

If you want to pipe the output to a pager, **less -MR** is
recommended. It will display the inverse characters correctly.

It's even possible to edit the converted text and turn it back into
ATASCII, if you're very careful. See the **-r** option, below.

Inverse video (characters codes above **$80**) are translated using
the ANSI/VT-100 reverse video escape sequences. Exception: **$9B**
(Atari EOL) is translated to **\\n** (newline).

OPTIONS
=======

-i
  Input uses Atari XL/XE International Character Set encoding, rather than
  ATASCII graphics.

-u
  Use "underlining" for inverse video. Each inverse character is followed by
  a backspace, then a *_* character. When viewed in a pager such as **less**\(1),
  this causes the characters to appear underlined. Output created with this
  option cannot be converted back to ATASCII with the **-r** option.

-t
  Text mode. Normally, everything but EOL (**$9B**) is converted to a
  Unicode graphics character. In text mode, ATASCII tabs, backspace,
  and bells are translated to the ASCII versions.

-r
  Reverse conversion: Input is UTF-8, output is ATASCII (or XL ICS, with **-i**).
  Beware that printing ATASCII to a terminal may look funny, and may
  even confuse the terminal. Redirecting to a file is safe; piping
  to a pager usually is. Only the UTF-8 codepoints that correspond to
  ATASCII characters will be converted. If the **-i** option was used to
  create the input, it must be used with **-r** also.

-m
  Magazine listing mode. Rather than Unicode graphics characters,
  ATASCII characters are printed in symbolic form, e.g. *{clear}* or
  *{ctrl-A}*. The result is similar to type-in listings in magazines
  like Antic, Analog, or Compute! Output created with this option
  cannot be converted back to ATASCII with the **-r** option.

-s
  Strip the inverse video bit (bit 7) from all characters except the
  EOL (**$9B**). Output created with this option can be converted
  back to ATASCII with the **-r** option, but of course there won't
  be any inverse-video characters in the result.

-v
  Verbose operation.

**--**
  End of options; the rest of the arguments are filenames. Use this if you're
  trying to work with files whose names begin with *-*.

**-h**, **--help**
  Show built-in help and exit.

**--version**
  Show version number and exit.

.. include:: manftr.rst