From 343fd43b95960f59a3bf901f59503757b81a5592 Mon Sep 17 00:00:00 2001 From: "B. Watson" Date: Thu, 12 Dec 2024 16:46:35 -0500 Subject: fix Makefile, dashes in hex dump, red for codepoints > U+10FFFF, fix spacing. --- uxd.1 | 7 ++----- 1 file changed, 2 insertions(+), 5 deletions(-) (limited to 'uxd.1') diff --git a/uxd.1 b/uxd.1 index cb69b28..fe1bb34 100644 --- a/uxd.1 +++ b/uxd.1 @@ -121,6 +121,8 @@ bytes (with their high 2 bits set to \fB10\fP). Continuation bytes that aren\(aqt preceded by a valid prefix byte. .IP \(bu 2 Truncated UTF\-8 sequence at EOF. +.IP \(bu 2 +Codepoints above U+10FFFF, which are disallowed by RFC 3629. .UNINDENT .UNINDENT .UNINDENT @@ -172,11 +174,6 @@ that could be a 1\-byte sequence, but is encoded as 2 or more). Sequences like this really should be colorized in red. Technically, this means \fBuxd\fP supports WTF\-8, not UTF\-8. .sp -RFC 3629 doesn\(aqt allow UTF\-8 to use codepoints above U+10FFFF. 4\-byte -sequences can support codepoints U+110000 to U+1FFFFF, which are not -valid Unicode. If these occur in the input, \fBuxd\fP should colorize -them in red, but it doesn\(aqt (yet). -.sp There should be options and/or a config file to change the colors, rather than baking them into the binary. .sp -- cgit v1.2.3