diff options
author | B. Watson <urchlay@slackware.uk> | 2024-12-21 04:34:10 -0500 |
---|---|---|
committer | B. Watson <urchlay@slackware.uk> | 2024-12-21 04:34:10 -0500 |
commit | 255436113adb73659428e18fd3f8e54858025ffe (patch) | |
tree | 874624736c8def7ff01ce641571561e49bd7a98d /uxd.rst | |
parent | d7546f793ca01866ba1a1351e2f139a7c0ed9711 (diff) | |
download | uxd-255436113adb73659428e18fd3f8e54858025ffe.tar.gz |
update man page
Diffstat (limited to 'uxd.rst')
-rw-r--r-- | uxd.rst | 33 |
1 files changed, 18 insertions, 15 deletions
@@ -260,13 +260,13 @@ changed with the **-c** option (see above). **green**, **yellow** Printable characters (except the space, U+0020) alternate between green and yellow. -**purple** +**purple**, **cyan** Spaces and unprintable characters ("control" characters, newlines, - tabs, etc). These are printed as "visible" characters, e.g. ␣ for - the space, ↵ for a newline. Hopefully this is an improvement over - the usual practice of printing these as periods, like standard hex - dumpers do. The Unicode BOM (byte order marker, U+FEFF) is printed - as a purple letter B. + tabs, etc) alternate between purple and cyan. These are printed as + "visible" characters, e.g. ␣ for the space, ↵ for a newline. + Hopefully this is an improvement over the usual practice of printing + these as periods, like standard hex dumpers do. The Unicode BOM + (byte order marker, U+FEFF) is printed as a purple letter B. **red** Invalid UTF-8 sequences. These are rendered as � (U+0FFD) with @@ -274,11 +274,14 @@ changed with the **-c** option (see above). sequences are: - Prefix bytes (>= 0x80) which are not followed by the correct number of continuation - bytes (with their high 2 bits set to **10**). Rendered as **�**. + bytes (with their high 2 bits set to **10**). - - Continuation bytes that aren't preceded by a valid prefix byte. Rendered as **�**. + - Continuation bytes that aren't preceded by a valid prefix byte. - - Truncated UTF-8 sequence at EOF. Rendered as **�**. + - Truncated UTF-8 sequence at EOF. + + Also, there are sequences that are valid UTF-8 encodings, but not valid Unicode. + These are normally rendered with a red background. - UTF-16 surrogates (codepoints U+D800 to U+DFFF) [**\***]. Rendered as **S**. @@ -288,12 +291,12 @@ changed with the **-c** option (see above). - Overlong encodings (e.g. codepoints U+0000 to U+007F encoded as 2 or more bytes) [**\***]. Rendered as **O**. - For items marked with [**\***], the **-j**, **-p**, and/or **-w** - options can disable error highlighting for this type of error. They - will be displayed in purple rather than red. + Each error-highlighted sequence will increment the "Bad + sequences" count, if the **-i** option is used. - Each error-highlighted sequence will increment the "Bad - sequences" count, if the **-i** option is used. + For items marked with [**\***], the **-j**, **-p**, and/or **-w** + options can disable error highlighting for this type of error. They + will be displayed in purple or cyan rather than red. TERMINAL SUPPORT ================ @@ -313,7 +316,7 @@ Known **not** to work: rxvt (doesn't support Unicode at all), and its derivatives such as aterm. **uxd** also builds and runs correctly on a Mac running a recent -version of OSX (though I'm not sure what terminal was used). +version of OSX with Terminal.app. FONTS ===== |