diff options
Diffstat (limited to 'uxd.rst')
-rw-r--r-- | uxd.rst | 16 |
1 files changed, 8 insertions, 8 deletions
@@ -77,7 +77,7 @@ Also, a decimal point can be used: **1.5K** is 1500 bytes, **1.5k** is -a Don't dump lines that consist entirely of ASCII characters (codepoints - U+00 to U+7f). + **U+00** to **U+7F**\). .. don't dump ASCII-only lines. @@ -134,7 +134,7 @@ Also, a decimal point can be used: **1.5K** is 1500 bytes, **1.5k** is -j Java mode (aka MUTF-8). Identical to UTF-8 except the - overlong **0xc0 0x80** encoding for codepoint U+0000 (aka NUL), + overlong **0xc0 0x80** encoding for codepoint **U+0000** (aka NUL), is highlighted in purple and not counted as an error. This may be useful for looking at serialized data created by Java programs. @@ -294,7 +294,7 @@ The colors in this description are the default ones. They can be changed with the **-c** option (see above). **green**, **yellow** - Printable characters (except the space, U+0020) alternate between green and yellow. + Printable characters (except the space, **U+0020**\) alternate between green and yellow. **purple**, **cyan** Spaces and unprintable characters ("control" characters, newlines, @@ -302,10 +302,10 @@ changed with the **-c** option (see above). "visible" characters, e.g. ␣ for the space, ↵ for a newline. Hopefully this is an improvement over the usual practice of printing these as periods, like standard hex dumpers do. The Unicode BOM - (byte order marker, U+FEFF) is printed as a purple letter B. + (byte order marker, **U+FEFF**\) is printed as a purple letter B. **red** - Invalid UTF-8 sequences. These are rendered as � (U+0FFD) with + Invalid UTF-8 sequences. These are rendered as � (**U+0FFD**\) with a red background, to make them stand out. Invalid sequences are: @@ -319,12 +319,12 @@ changed with the **-c** option (see above). Also, there are sequences that are valid UTF-8 encodings, but not valid Unicode. These are normally rendered with a red background. - - UTF-16 surrogates (codepoints U+D800 to U+DFFF) [**\***]. Rendered as **S**. + - UTF-16 surrogates (codepoints **U+D800** to **U+DFFF**\) [**\***]. Rendered as **S**. - - Codepoints above U+10FFFF, which are disallowed by the Unicode standard [**\***]. + - Codepoints above **U+10FFFF**\, which are disallowed by the Unicode standard [**\***]. Rendered as **>**. - - Overlong encodings (e.g. codepoints U+0000 to U+007F encoded + - Overlong encodings (e.g. codepoints **U+0000** to **U+007F** encoded as 2 or more bytes) [**\***]. Rendered as **O**. Each error-highlighted sequence will increment the "Bad |