The title screen uses a crude form of compression, which I'll call ZRLE
(zero-run length encoding).

Theory of operation:
--------------------

Unlike real RLE, ZRLE only compresses runs of 0-bytes. Non-zero bytes
are stored as-is, not compressed at all. Also unlike real RLE, ZRLE
won't work on arbitrary input data. Why?

ZRLE relies on some byte values being unused in the input. In other words,
if the file contains every byte value 0 to 255, it can't be compressed
with ZRLE. There needs to be at least one unused value per length of
null run found in the file.

In other words, if the input looks like:

01 00 00 02 00 00 03 00 00 00 04

...there are 2 lengths of zero-run: 2 and 3. The fact that there are
two runs of length 2 doesn't affect the algorithm (in fact it helps
the compression ratio). Here are the runs, illustrated in glorious
ASCII art:

01 00 00 02 00 00 03 00 00 00 04
   \___/    \___/    \______/
   run of   run of    run of
  2 zeroes  2 zeroes  3 zeroes

For each length of zero-run found in the file, an otherwise-unused byte
is assigned as a marker for that length. The above input might be encoded
as:

01 05 02 05 03 06 04

...where 05 represents a run of two zeroes, and 06 represents a run of
three zeroes.

This means there are no escape bytes, markers, or counts stored in
the file. Each run of zero-bytes collapses into one byte of data,
which we'll call a code value. All bytes that aren't code values, are
just uncompressed data bytes, used as-is.

If there aren't enough unused values, the encoder will fail, saying it's
out of codes.

But how can this be decoded?

The encoder creates a table for the decoder to use, of course. The
table is written to the file comptitle.s, along with the rest of the
decoder code from comptitle.s.in. This means the decoder is specific to
the encoded file, not general-purpose. If we weren't on such a limited
system as the Atari 800, the table would be part of the output file,
not the code (like the "dictionary" in a deflated file, etc).

Each byte value in the compressed data either represents itself (is a
plain data value) or some number of consecutive zero bytes (is a code
value). The table has an entry for each byte value 0-255 (conceptually
anyway; see below), and if the byte value represents itself, the table
entry will be zero. If it represents a zero-run, the table entry will
be non-zero, and will be the number of bytes in the run.

For a "run" of one zero-byte, the encoder doesn't bother to encode it as
a run: a plain zero is stored in the output file instead. This doesn't
affect the file size, but does save one encoded value.

Here's the above example again:

01 00 00 02 00 00 03 00 00 00 04

It encoded to:

01 05 02 05 03 06 04

The table built into the decoder will look like:

byte value | definition
-----------+-----------
01         | 0
02         | 0
03         | 0
04         | 0
05         | 2
06         | 3

This tells the decoder that 01, 02, 03 represent themselves (they're
copied from compressed input to decompressed output as-is)... the value
05 represents a run of 2 consecutive zeroes, and 06 represents a run
of 3 zeroes.

The decoder will look at the input, one byte at a time, and consult
the table to decide what to do with each byte. "Output" here is the
decompressed data, which you can think of as a list of bytes, which
starts out empty.

input| table lookup |                 |
data | result       | action          | output after action is taken
-----+--------------+-----------------+---------------------------------
01   | 0            | append 01       | 01
     |              |                 |
05   | 2            | append 2 zeroes | 01 00 00
     |              |                 |
02   | 0            | append 02       | 01 00 00 02
     |              |                 |
05   | 2            | append 2 zeroes | 01 00 00 02 00 00
     |              |                 |
03   | 0            | append 03       | 01 00 00 02 00 00 03
     |              |                 |
06   | 3            | append 3 zeroes | 01 00 00 02 00 00 03 00 00 00
     |              |                 |
04   | 0            | append 04       | 01 00 00 02 00 00 03 00 00 00 04

As you can see, the final output after the last byte of data is read,
matches the original input exactly.

In this dumb example, it's pretty obvious that the encoded data plus
the table size is bigger than the original input, so the "compression"
is actually making it bigger. But it serves to illustrate, I hope.

For Taipan, the title screen (newtitle.png) contains large black areas.
A black pixel is a 0 bit. The title screen data ends up packed 8 pixels
per byte (1 bit per pixel), so those black areas are represented by runs
of consecutive zero bytes.

Each row of pixels in the image has 256 bits, or 32 bytes. The bytes
are arranged in consecutive order: the leftmost 8 visible pixels are the
first byte of the first row... after enough bytes to fill up the row, the
next byte in the file is the leftmost 8 pixels of the 2nd line, etc etc.

The algorithm treats the input as a stream of bytes, and doesn't know
nor care how they're arranged on the screen, but if you know how they're
stored you can see that a black area at the right-hand end of one line
is "joined" with a black area at the left-hand end of the next line,
as far as the compression is concerned.

Specifics:
----------

The original image newtitle.png is the authoritative source for the title
screen that ends up being built into the game binary. Its resolution is
256x184 pixels. If this image is ever modified, the Makefile will cause
all the stuff mentioned below to be rebuilt.

First, a script called newtitle.pl reads the PNG image and creates an
Atari loadable file called titledata.xex. It contains a 6-byte Atari
header followed by the pixel data, one bit per pixel. The pixel data
is 5888 bytes. Formerly, before the compression scheme was invented,
titledata.xex was used as-is in creating the game binary taipan.xex.

The compression code is written in Perl, in the file titlecomp.pl. It
reads titledata.xex, applies the compression, and writes a file called
comptitle.dat, containing the raw compressed data (minus the table
definition). It also creates an assembly source file comptitle.s,
containing the table definition and the rest of the decoding code.

If you look at titlecomp.pl, you'll see it takes an optional argument,
forcing it to use a particular byte value as the first code value. This
number is determined by running the script repeatedly with different
arguments, to find the one with the smallest table size.

Wait, what do I mean by smallest table size? Well, I said the table
contains one entry for each byte value 0 to 255... This isn't really
true. Byte values less than 128 *always* represent themselves, so I
don't bother storing them in the table. Also, the perl script knows the
lowest and highest numbered code values, so it only emits the portion
of the table between these two numbers (and modifies the code to not do
a table lookup, if it's outside the range). If you can read 6502
assembly, looking at the code will make this much clearer.

comptitle.s is then assembled, to create comptitle.xex.

When comptitle.s is assembled, it includes the contents of comptitle.dat
with an ".incbin", so the data ends up in comptitle.xex along with the
table and decoder. This file contains 2 Atari binary load segments:
the data + table + decoder, and an initialization segment telling the
Atari to run the decoder subroutine immediately after loading.

comptitle.xex ends up being the first part of the complete taipan.xex game
binary, so after its init routine exits, the rest of the game continues
to load. In particular, the next segment is another init routine that
tells the Atari to actually display the decoded title screen data.

Compression Ratio:
------------------

If we want to calculate the compression ratio, the size of comptitle.xex
is the compressed file size, and the size of titledata.xex is the original
file size:

$ ls -l comptitle.xex titledata.xex 
-rw-r--r-- 1 urchlay users 3602 Jan  6 02:08 comptitle.xex
-rw-r--r-- 1 urchlay users 5894 Jan  6 01:55 titledata.xex

The percentage is:

$ perl -e 'printf "%.1f\n", 3602/5894*100'
61.1

Actually, we have to do the calculations in disk sectors. A standard Atari
floppy disk holds 125 data bytes per sector (or less, but we assume full
sectors because that's how taipan.xex is generated). It's impossible
to read less than a full sector from the drive. So titledata.xex is 48
sectors, and comptitle.xex is 29.

What we really care about is loading time, not disk space: the title
screen should be visible as soon as possible after the program starts
loading (to give the user something to look at, while the huge bloated
game code loads). Standard 810 or 1050 drives load around 10 sectors
per second, we saved 19 sectors or around 2 seconds.  The decompression
takes 1/5 of a second, so it's a net gain.