diff options
-rw-r--r-- | uxd.1 | 16 | ||||
-rw-r--r-- | uxd.rst | 16 |
2 files changed, 16 insertions, 16 deletions
@@ -92,7 +92,7 @@ Don\(aqt alternate colors. .TP .B \-a Don\(aqt dump lines that consist entirely of ASCII characters (codepoints -U+00 to U+7f). +\fBU+00\fP to \fBU+7F\fP). .UNINDENT .\" don't dump ASCII-only lines. . @@ -197,7 +197,7 @@ bad sequences. .TP .B \-j Java mode (aka MUTF\-8). Identical to UTF\-8 except the -overlong \fB0xc0 0x80\fP encoding for codepoint U+0000 (aka NUL), +overlong \fB0xc0 0x80\fP encoding for codepoint \fBU+0000\fP (aka NUL), is highlighted in purple and not counted as an error. This may be useful for looking at serialized data created by Java programs. @@ -393,7 +393,7 @@ changed with the \fB\-c\fP option (see above). .INDENT 0.0 .TP .B \fBgreen\fP, \fByellow\fP -Printable characters (except the space, U+0020) alternate between green and yellow. +Printable characters (except the space, \fBU+0020\fP) alternate between green and yellow. .TP .B \fBpurple\fP, \fBcyan\fP Spaces and unprintable characters ("control" characters, newlines, @@ -401,10 +401,10 @@ tabs, etc) alternate between purple and cyan. These are printed as "visible" characters, e.g. ␣ for the space, ↵ for a newline. Hopefully this is an improvement over the usual practice of printing these as periods, like standard hex dumpers do. The Unicode BOM -(byte order marker, U+FEFF) is printed as a purple letter B. +(byte order marker, \fBU+FEFF\fP) is printed as a purple letter B. .TP .B \fBred\fP -Invalid UTF\-8 sequences. These are rendered as � (U+0FFD) with +Invalid UTF\-8 sequences. These are rendered as � (\fBU+0FFD\fP) with a red background, to make them stand out. Invalid sequences are: .INDENT 7.0 @@ -427,12 +427,12 @@ These are normally rendered with a red background. .INDENT 3.5 .INDENT 0.0 .IP \(bu 2 -UTF\-16 surrogates (codepoints U+D800 to U+DFFF) [\fB*\fP]. Rendered as \fBS\fP\&. +UTF\-16 surrogates (codepoints \fBU+D800\fP to \fBU+DFFF\fP) [\fB*\fP]. Rendered as \fBS\fP\&. .IP \(bu 2 -Codepoints above U+10FFFF, which are disallowed by the Unicode standard [\fB*\fP]. +Codepoints above \fBU+10FFFF\fP, which are disallowed by the Unicode standard [\fB*\fP]. Rendered as \fB>\fP\&. .IP \(bu 2 -Overlong encodings (e.g. codepoints U+0000 to U+007F encoded +Overlong encodings (e.g. codepoints \fBU+0000\fP to \fBU+007F\fP encoded as 2 or more bytes) [\fB*\fP]. Rendered as \fBO\fP\&. .UNINDENT .UNINDENT @@ -77,7 +77,7 @@ Also, a decimal point can be used: **1.5K** is 1500 bytes, **1.5k** is -a Don't dump lines that consist entirely of ASCII characters (codepoints - U+00 to U+7f). + **U+00** to **U+7F**\). .. don't dump ASCII-only lines. @@ -134,7 +134,7 @@ Also, a decimal point can be used: **1.5K** is 1500 bytes, **1.5k** is -j Java mode (aka MUTF-8). Identical to UTF-8 except the - overlong **0xc0 0x80** encoding for codepoint U+0000 (aka NUL), + overlong **0xc0 0x80** encoding for codepoint **U+0000** (aka NUL), is highlighted in purple and not counted as an error. This may be useful for looking at serialized data created by Java programs. @@ -294,7 +294,7 @@ The colors in this description are the default ones. They can be changed with the **-c** option (see above). **green**, **yellow** - Printable characters (except the space, U+0020) alternate between green and yellow. + Printable characters (except the space, **U+0020**\) alternate between green and yellow. **purple**, **cyan** Spaces and unprintable characters ("control" characters, newlines, @@ -302,10 +302,10 @@ changed with the **-c** option (see above). "visible" characters, e.g. ␣ for the space, ↵ for a newline. Hopefully this is an improvement over the usual practice of printing these as periods, like standard hex dumpers do. The Unicode BOM - (byte order marker, U+FEFF) is printed as a purple letter B. + (byte order marker, **U+FEFF**\) is printed as a purple letter B. **red** - Invalid UTF-8 sequences. These are rendered as � (U+0FFD) with + Invalid UTF-8 sequences. These are rendered as � (**U+0FFD**\) with a red background, to make them stand out. Invalid sequences are: @@ -319,12 +319,12 @@ changed with the **-c** option (see above). Also, there are sequences that are valid UTF-8 encodings, but not valid Unicode. These are normally rendered with a red background. - - UTF-16 surrogates (codepoints U+D800 to U+DFFF) [**\***]. Rendered as **S**. + - UTF-16 surrogates (codepoints **U+D800** to **U+DFFF**\) [**\***]. Rendered as **S**. - - Codepoints above U+10FFFF, which are disallowed by the Unicode standard [**\***]. + - Codepoints above **U+10FFFF**\, which are disallowed by the Unicode standard [**\***]. Rendered as **>**. - - Overlong encodings (e.g. codepoints U+0000 to U+007F encoded + - Overlong encodings (e.g. codepoints **U+0000** to **U+007F** encoded as 2 or more bytes) [**\***]. Rendered as **O**. Each error-highlighted sequence will increment the "Bad |