serd.git - Serd RDF serialisation library

Age	Commit message (Collapse)	Author	Files	Lines
2023-12-02	[WIP] Partially resurrect CURIE nodes1.x	David Robillard	3	-12/+50

2023-12-02	[WIP] Simplify block writing interface and improve error handling	David Robillard	3	-90/+95

2023-12-02	[WIP] Improve read_IRIREF_suffix() performance	David Robillard	1	-38/+23

2023-12-02	Move read_IRIREF to read_ntriples.c	David Robillard	3	-20/+31
	Bit of a weird situation this one, since the rule is from NTriples but serd doesn't actually use it in the NTriples parser (since it uses a stricter one instead). Still, I think it makes sense here, and in practice, makes for a more sensible inlining situation.
2023-12-02	[WIP] Tighten up reader node management	David Robillard	15	-290/+361
	[WIP] Broken on 32-bit This makes the reader stack manipulations stricter, to make the code more regular and avoid redundant work and bad cache activity. Now, functions that push node headers and their bodies are responsible for (more or less) immediately pushing any trailing null bytes required for termination and alignment. This makes the writes to the node in the stack more local, ensures nodes are terminated as early as possible (to reduce the risk of using non-terminated strings), and avoids the need to calculate aligned stack allocations.
2023-12-02	Factor out serd_byte_source_skip_bom()	David Robillard	3	-18/+22

2023-12-02	[WIP] Add support for URI hex escape decoding	David Robillard	2	-127/+290

2023-12-02	Add serd_env_unset_prefix()	David Robillard	1	-0/+37

2023-12-02	Use ZixAllocator directly	David Robillard	29	-305/+189

2023-12-02	Use ZixStringView directly	David Robillard	13	-135/+135

2023-12-02	Use SerdNodes instead of manual memory management in tests	David Robillard	1	-0/+42

2023-12-02	Add "contextual" output option	David Robillard	1	-0/+4
	This is mainly for developer or power-user cases, where one wants to look at some data for investigation or debugging. In such cases, it's common for the set of prefixes to be implicitly known (because they are baked in to the application, for example), so printing them just produces a large amount of redundant noise. That said, it can also be useful programmatically, because it allows several snippets to be written independently and ultimately concatenated (with a header to define the prefixes) without redundancy.
2023-12-02	Add SERD_READ_ORDERED to preserve blank node ordering in models	David Robillard	1	-5/+11

2023-12-02	Add model and serd-sort utility	David Robillard	10	-1/+1793
	With all the new functionality, the complexity of the serd-pipe command-line interface is starting to push the limits of available flags. So, instead of grafting on further options to control a model, this commit adds a new tool, serd-sort, which acts somewhat like a stripped-down serd-pipe that stores statements in a model in memory. This keeps the complexity (including the user-facing complexity) of any one tool down, since other more focused tools can be used for streaming tasks in a pipeline. In other words, abandon Swissarmyknifeism, take a page from the Unix philosophy, and try to expose the model functionality to the command-line in a dedicated focused tool. The model implementation is tested by using this tool to run a subset of the usual test suites, and a special suite to test statement sorting.
2023-12-02	Add statement filter sink and serd-filter tool	David Robillard	2	-0/+129

2023-12-02	Add "longhand" mode to write rdf:type predicates normally	David Robillard	1	-1/+1

2023-12-02	Add serd_node_from_syntax() and serd_node_to_syntax()	David Robillard	3	-0/+206

2023-12-02	Use SerdNodes in Env implementation	David Robillard	1	-75/+80

2023-12-02	Avoid dynamic allocation of world blank node	David Robillard	2	-12/+15

2023-12-02	Reuse the same rdf:type node for all "a" abbreviations	David Robillard	3	-3/+5
	This reduces the stack space requirements for every type statement, at the cost of adding a bit of constant bloat for the node in documents that do not contain it.
2023-12-02	Cache commonly used nodes in the world	David Robillard	3	-22/+61

2023-12-02	Add SerdNodes for storing a cache of interned nodes	David Robillard	4	-1/+603

2023-12-02	Make blank node prefixing automatic	David Robillard	5	-62/+23
	Though potentially useful, I don't think the complexity cost of the old interface (both to the implementation and to the user) is worth it. A special tool to transform blank node labels (for example with regular expressions) would be a better approach to this if it's ever needed in the future.
2023-12-02	Split SERD_READ_VERBATIM into two more precise flags	David Robillard	1	-1/+1
	Although the "verbatim" idea is nice and simple, more fine-grained control is necessary since these features (relative URI preservation and blank node label clash avoidance) are useful in different situations.
2023-12-02	Add a reader flag to disable generated blank label avoidance	David Robillard	1	-10/+5

2023-12-02	Expose serd_strncasecmp in public API	David Robillard	5	-17/+21

2023-12-02	[WIP] Add support for converting literals to canonical form	David Robillard	2	-4/+224

2023-12-02	[WIP] Generalize node construction API	David Robillard	10	-461/+788

2023-12-02	Add support for custom allocators	David Robillard	24	-320/+611
	This makes it explicit in the API where memory is allocated, and allows the user to provide a custom allocator to avoid the use of the default system allocator for whatever reason.
2023-12-02	[WIP] Simplify reader and writer flags	David Robillard	3	-6/+7
	[WIP] Testing?
2023-12-02	[WIP] Remove serd_new_resolved_uri()	David Robillard	3	-63/+12

2023-12-02	[WIP] Remove SERD_CURIE node datatype entirely	David Robillard	6	-127/+58

2023-12-02	[WIP] Expand URIs in reader	David Robillard	6	-24/+147
	This expands relative and prefixed URIs in the reader on the stack, rather than passing them to the caller to be dealt with. This pushes these context-full forms to the edge of the system as much as possible to minimise the headaches they can cause. Towards having stricter guarantees about nodes and eliminating the CURIE node type altogether.
2023-12-02	[WIP] Move SerdEnv mutation from writer to reader	David Robillard	4	-13/+20
	Writing having side-effects seems questionable in general, and this prepares things for expanding URIs in the reader.
2023-12-02	[WIP] Preserve long or short quoting from input documents	David Robillard	7	-154/+87

2023-12-02	Leave statement caret at the start of literals	David Robillard	3	-5/+24
	This allows a precise location to be reported for errors within literals, by adding the offset of the error in the literal to the caret. This will be used to report nice errors for things like regular expressions and supported XSD datatypes.
2023-12-02	[WIP] Add extensible logging API	David Robillard	9	-96/+392

2023-12-02	[WIP] Preserve anonymous graph syntax in TriG	David Robillard	2	-1/+3
	[WIP] Untested
2023-12-02	[WIP] Add support for reading and writing variables	David Robillard	7	-4/+86
	[WIP] Command line option, move later? This adds a reader flag and serdi option for extending a syntax with support for SPARQL-like variables, for storing things like patterns or simple queries.
2023-12-02	[TESTED] Replace -b and -e options with a block size option	David Robillard	1	-17/+0
	This is more powerful, and reduces the number of command line options that almost nobody needs to care about.
2023-12-02	Move serdi to tools subdirectory	David Robillard	4	-343/+0
	This separates the command-line tool code from the library implementation.
2023-12-02	Factor out command line version printing	David Robillard	3	-19/+24

2023-12-02	Factor out Windows UTF-8 console setup code	David Robillard	3	-12/+33

2023-12-02	Remove datatype arguments from integer and base64 constructors	David Robillard	1	-9/+11

2023-12-02	Simplify output stream API	David Robillard	7	-107/+241

2023-12-02	Simplify input stream API	David Robillard	12	-229/+272

2023-12-02	Rename SerdErrorFunc to SerdLogFunc	David Robillard	3	-9/+9

2023-12-02	Reduce complexity of URI parsing code	David Robillard	2	-101/+97

2023-12-02	Make serd_writer_set_root_uri() take a string view	David Robillard	2	-6/+4
	This is generally more convenient, and the node was just being copied anyway.
2023-12-02	Make SerdBuffer an output stream	David Robillard	4	-34/+47