aboutsummaryrefslogtreecommitdiffstats
path: root/doc/man/serd-pipe.1
diff options
context:
space:
mode:
authorDavid Robillard <d@drobilla.net>2023-02-05 18:39:49 -0500
committerDavid Robillard <d@drobilla.net>2023-12-02 18:49:08 -0500
commit343124df71010055c2c1e6cdcadd13d23b2c013a (patch)
tree7c2de6a72021adaac89e9c4fa97e7cc5503e0657 /doc/man/serd-pipe.1
parent530edb265fbbed20e6d3a6fd7a36461ff83d9b46 (diff)
downloadserd-343124df71010055c2c1e6cdcadd13d23b2c013a.tar.gz
serd-343124df71010055c2c1e6cdcadd13d23b2c013a.tar.bz2
serd-343124df71010055c2c1e6cdcadd13d23b2c013a.zip
[WIP] Add support for URI hex escape decoding
Diffstat (limited to 'doc/man/serd-pipe.1')
-rw-r--r--doc/man/serd-pipe.134
1 files changed, 29 insertions, 5 deletions
diff --git a/doc/man/serd-pipe.1 b/doc/man/serd-pipe.1
index 54e3a3b1..9ee3ee54 100644
--- a/doc/man/serd-pipe.1
+++ b/doc/man/serd-pipe.1
@@ -152,6 +152,11 @@ Note that this may corrupt the output by merging distinct blank nodes.
Generate blank node labels with suffixes left-padded with zeros.
This generates IDs like "_:b0000000123" that sort in numerical order,
which can be useful to preserve statement ordering.
+.It Cm decoded
+Read URIs with percent-encoded UTF-8 characters decoded.
+Normally, percent-encoded octets in URIs are preserved as plain text.
+This flag enables interpreting them as UTF-8,
+decoding escapes like "%7E" to characters like "~" where possible.
.El
.It Fl O Ar syntax
Set an output syntax or option.
@@ -168,13 +173,32 @@ or an option:
.Bl -tag -width 3n
.It Cm ascii
Escape all non-ASCII characters.
+Normally, text is written in UTF-8.
+This flag will escape additional non-printable-ASCII characters in string literals like
+.Li \eU00B7
+or
+.Li \eU0001F600 ,
+and in URIs like
+.Li %B7
+or
+.Li %F0%9F%98%80 .
+.It Cm escapes
+Escape all non-ASCII characters with
+.Dq U
+escapes.
+This works like
+.Cm ascii ,
+except percent-encoding will not be used in URIs
+(matching the format used in the Turtle test suite).
.It Cm contextual
Suppress writing directives that describe the context.
-Normally when writing Turtle or Trig,
-a document will have a header that defines all the prefixes used in the input.
-This flag will disable writing those directives,
-so the output is document fragment with an implicit context.
-This can be useful for writing output intended for humans.
+This can be used to suppress the header of
+.Li prefix
+and
+.Li base
+directives,
+making the output depend on an implied context.
+Note that this option may produce incomprehensible output if prefixes change while writing!
.It Cm expanded
Write expanded URIs instead of prefixed names.
.It Cm lax