diff options
author | David Robillard <d@drobilla.net> | 2023-02-05 18:39:49 -0500 |
---|---|---|
committer | David Robillard <d@drobilla.net> | 2023-12-02 18:49:08 -0500 |
commit | 343124df71010055c2c1e6cdcadd13d23b2c013a (patch) | |
tree | 7c2de6a72021adaac89e9c4fa97e7cc5503e0657 /doc/man/serd-pipe.1 | |
parent | 530edb265fbbed20e6d3a6fd7a36461ff83d9b46 (diff) | |
download | serd-343124df71010055c2c1e6cdcadd13d23b2c013a.tar.gz serd-343124df71010055c2c1e6cdcadd13d23b2c013a.tar.bz2 serd-343124df71010055c2c1e6cdcadd13d23b2c013a.zip |
[WIP] Add support for URI hex escape decoding
Diffstat (limited to 'doc/man/serd-pipe.1')
-rw-r--r-- | doc/man/serd-pipe.1 | 34 |
1 files changed, 29 insertions, 5 deletions
diff --git a/doc/man/serd-pipe.1 b/doc/man/serd-pipe.1 index 54e3a3b1..9ee3ee54 100644 --- a/doc/man/serd-pipe.1 +++ b/doc/man/serd-pipe.1 @@ -152,6 +152,11 @@ Note that this may corrupt the output by merging distinct blank nodes. Generate blank node labels with suffixes left-padded with zeros. This generates IDs like "_:b0000000123" that sort in numerical order, which can be useful to preserve statement ordering. +.It Cm decoded +Read URIs with percent-encoded UTF-8 characters decoded. +Normally, percent-encoded octets in URIs are preserved as plain text. +This flag enables interpreting them as UTF-8, +decoding escapes like "%7E" to characters like "~" where possible. .El .It Fl O Ar syntax Set an output syntax or option. @@ -168,13 +173,32 @@ or an option: .Bl -tag -width 3n .It Cm ascii Escape all non-ASCII characters. +Normally, text is written in UTF-8. +This flag will escape additional non-printable-ASCII characters in string literals like +.Li \eU00B7 +or +.Li \eU0001F600 , +and in URIs like +.Li %B7 +or +.Li %F0%9F%98%80 . +.It Cm escapes +Escape all non-ASCII characters with +.Dq U +escapes. +This works like +.Cm ascii , +except percent-encoding will not be used in URIs +(matching the format used in the Turtle test suite). .It Cm contextual Suppress writing directives that describe the context. -Normally when writing Turtle or Trig, -a document will have a header that defines all the prefixes used in the input. -This flag will disable writing those directives, -so the output is document fragment with an implicit context. -This can be useful for writing output intended for humans. +This can be used to suppress the header of +.Li prefix +and +.Li base +directives, +making the output depend on an implied context. +Note that this option may produce incomprehensible output if prefixes change while writing! .It Cm expanded Write expanded URIs instead of prefixed names. .It Cm lax |