aboutsummaryrefslogtreecommitdiff
path: root/uxd.rst
diff options
context:
space:
mode:
authorB. Watson <urchlay@slackware.uk>2024-12-21 04:34:10 -0500
committerB. Watson <urchlay@slackware.uk>2024-12-21 04:34:10 -0500
commit255436113adb73659428e18fd3f8e54858025ffe (patch)
tree874624736c8def7ff01ce641571561e49bd7a98d /uxd.rst
parentd7546f793ca01866ba1a1351e2f139a7c0ed9711 (diff)
downloaduxd-255436113adb73659428e18fd3f8e54858025ffe.tar.gz
update man page
Diffstat (limited to 'uxd.rst')
-rw-r--r--uxd.rst33
1 files changed, 18 insertions, 15 deletions
diff --git a/uxd.rst b/uxd.rst
index 6069e4e..7e35440 100644
--- a/uxd.rst
+++ b/uxd.rst
@@ -260,13 +260,13 @@ changed with the **-c** option (see above).
**green**, **yellow**
Printable characters (except the space, U+0020) alternate between green and yellow.
-**purple**
+**purple**, **cyan**
Spaces and unprintable characters ("control" characters, newlines,
- tabs, etc). These are printed as "visible" characters, e.g. ␣ for
- the space, ↵ for a newline. Hopefully this is an improvement over
- the usual practice of printing these as periods, like standard hex
- dumpers do. The Unicode BOM (byte order marker, U+FEFF) is printed
- as a purple letter B.
+ tabs, etc) alternate between purple and cyan. These are printed as
+ "visible" characters, e.g. ␣ for the space, ↵ for a newline.
+ Hopefully this is an improvement over the usual practice of printing
+ these as periods, like standard hex dumpers do. The Unicode BOM
+ (byte order marker, U+FEFF) is printed as a purple letter B.
**red**
Invalid UTF-8 sequences. These are rendered as � (U+0FFD) with
@@ -274,11 +274,14 @@ changed with the **-c** option (see above).
sequences are:
- Prefix bytes (>= 0x80) which are not followed by the correct number of continuation
- bytes (with their high 2 bits set to **10**). Rendered as **�**.
+ bytes (with their high 2 bits set to **10**).
- - Continuation bytes that aren't preceded by a valid prefix byte. Rendered as **�**.
+ - Continuation bytes that aren't preceded by a valid prefix byte.
- - Truncated UTF-8 sequence at EOF. Rendered as **�**.
+ - Truncated UTF-8 sequence at EOF.
+
+ Also, there are sequences that are valid UTF-8 encodings, but not valid Unicode.
+ These are normally rendered with a red background.
- UTF-16 surrogates (codepoints U+D800 to U+DFFF) [**\***]. Rendered as **S**.
@@ -288,12 +291,12 @@ changed with the **-c** option (see above).
- Overlong encodings (e.g. codepoints U+0000 to U+007F encoded
as 2 or more bytes) [**\***]. Rendered as **O**.
- For items marked with [**\***], the **-j**, **-p**, and/or **-w**
- options can disable error highlighting for this type of error. They
- will be displayed in purple rather than red.
+ Each error-highlighted sequence will increment the "Bad
+ sequences" count, if the **-i** option is used.
- Each error-highlighted sequence will increment the "Bad
- sequences" count, if the **-i** option is used.
+ For items marked with [**\***], the **-j**, **-p**, and/or **-w**
+ options can disable error highlighting for this type of error. They
+ will be displayed in purple or cyan rather than red.
TERMINAL SUPPORT
================
@@ -313,7 +316,7 @@ Known **not** to work: rxvt (doesn't support Unicode at all), and its
derivatives such as aterm.
**uxd** also builds and runs correctly on a Mac running a recent
-version of OSX (though I'm not sure what terminal was used).
+version of OSX with Terminal.app.
FONTS
=====