aboutsummaryrefslogtreecommitdiffstats
AgeCommit message (Collapse)AuthorFilesLines
2023-12-02[WIP] Add C++ bindingsDavid Robillard32-14/+5411
2023-12-02Move read_IRIREF to read_ntriples.cDavid Robillard3-20/+31
Bit of a weird situation this one, since the rule is from NTriples but serd doesn't actually use it in the NTriples parser (since it uses a stricter one instead). Still, I think it makes sense here, and in practice, makes for a more sensible inlining situation.
2023-12-02[WIP] Tighten up reader node managementDavid Robillard15-290/+361
[WIP] Broken on 32-bit This makes the reader stack manipulations stricter, to make the code more regular and avoid redundant work and bad cache activity. Now, functions that push node headers and their bodies are responsible for (more or less) immediately pushing any trailing null bytes required for termination and alignment. This makes the writes to the node in the stack more local, ensures nodes are terminated as early as possible (to reduce the risk of using non-terminated strings), and avoids the need to calculate aligned stack allocations.
2023-12-02Add high-level documentationDavid Robillard31-38/+2765
2023-12-02Factor out serd_byte_source_skip_bom()David Robillard3-18/+22
2023-12-02[WIP] Add support for URI hex escape decodingDavid Robillard46-149/+555
2023-12-02Add serd_env_unset_prefix()David Robillard3-0/+110
2023-12-02Use ZixAllocator directlyDavid Robillard62-619/+373
2023-12-02Use ZixStringView directlyDavid Robillard41-517/+451
2023-12-02Use SerdNodes instead of manual memory management in testsDavid Robillard11-123/+204
2023-12-02Add "contextual" output optionDavid Robillard11-3/+88
This is mainly for developer or power-user cases, where one wants to look at some data for investigation or debugging. In such cases, it's common for the set of prefixes to be implicitly known (because they are baked in to the application, for example), so printing them just produces a large amount of redundant noise. That said, it can also be useful programmatically, because it allows several snippets to be written independently and ultimately concatenated (with a header to define the prefixes) without redundancy.
2023-12-02Add SERD_READ_ORDERED to preserve blank node ordering in modelsDavid Robillard9-19/+55
2023-12-02Add model and serd-sort utilityDavid Robillard53-43/+4886
With all the new functionality, the complexity of the serd-pipe command-line interface is starting to push the limits of available flags. So, instead of grafting on further options to control a model, this commit adds a new tool, serd-sort, which acts somewhat like a stripped-down serd-pipe that stores statements in a model in memory. This keeps the complexity (including the user-facing complexity) of any one tool down, since other more focused tools can be used for streaming tasks in a pipeline. In other words, abandon Swissarmyknifeism, take a page from the Unix philosophy, and try to expose the model functionality to the command-line in a dedicated focused tool. The model implementation is tested by using this tool to run a subset of the usual test suites, and a special suite to test statement sorting.
2023-12-02Add statement filter sink and serd-filter toolDavid Robillard29-18/+1148
2023-12-02Add "longhand" mode to write rdf:type predicates normallyDavid Robillard12-1/+75
2023-12-02Factor out and rewrite command-line interfaceDavid Robillard12-372/+679
2023-12-02Add serd_node_from_syntax() and serd_node_to_syntax()David Robillard9-2/+518
2023-12-02Use SerdNodes in Env implementationDavid Robillard2-77/+82
2023-12-02Avoid dynamic allocation of world blank nodeDavid Robillard2-12/+15
2023-12-02Use SerdNodes to simplify some testsDavid Robillard6-178/+286
2023-12-02Reuse the same rdf:type node for all "a" abbreviationsDavid Robillard4-4/+6
This reduces the stack space requirements for every type statement, at the cost of adding a bit of constant bloat for the node in documents that do not contain it.
2023-12-02Cache commonly used nodes in the worldDavid Robillard5-24/+73
2023-12-02Add SerdNodes for storing a cache of interned nodesDavid Robillard16-70/+1395
2023-12-02Make blank node prefixing automaticDavid Robillard17-227/+30
Though potentially useful, I don't think the complexity cost of the old interface (both to the implementation and to the user) is worth it. A special tool to transform blank node labels (for example with regular expressions) would be a better approach to this if it's ever needed in the future.
2023-12-02Split SERD_READ_VERBATIM into two more precise flagsDavid Robillard5-21/+40
Although the "verbatim" idea is nice and simple, more fine-grained control is necessary since these features (relative URI preservation and blank node label clash avoidance) are useful in different situations.
2023-12-02Add a reader flag to disable generated blank label avoidanceDavid Robillard5-10/+30
2023-12-02Collapse input and output options into single flagsDavid Robillard10-113/+215
2023-12-02Expose serd_strncasecmp in public APIDavid Robillard6-24/+27
2023-12-02[WIP] Add support for converting literals to canonical formDavid Robillard20-13/+657
2023-12-02[WIP] Generalize node construction APIDavid Robillard31-924/+1741
2023-12-02Add support for custom allocatorsDavid Robillard55-819/+1823
This makes it explicit in the API where memory is allocated, and allows the user to provide a custom allocator to avoid the use of the default system allocator for whatever reason.
2023-12-02Test handling of deferred write errors that happen when closingDavid Robillard2-5/+18
2023-12-02Add command-line option to write output to a fileDavid Robillard5-22/+75
2023-12-02[WIP] Simplify reader and writer flagsDavid Robillard6-16/+89
[WIP] Testing?
2023-12-02[WIP] Remove serd_new_resolved_uri()David Robillard3-63/+12
2023-12-02[WIP] Remove SERD_CURIE node datatype entirelyDavid Robillard13-220/+119
2023-12-02[WIP] Expand URIs in readerDavid Robillard18-35/+282
This expands relative and prefixed URIs in the reader on the stack, rather than passing them to the caller to be dealt with. This pushes these context-full forms to the edge of the system as much as possible to minimise the headaches they can cause. Towards having stricter guarantees about nodes and eliminating the CURIE node type altogether.
2023-12-02[WIP] Move SerdEnv mutation from writer to readerDavid Robillard11-66/+76
Writing having side-effects seems questionable in general, and this prepares things for expanding URIs in the reader.
2023-12-02[WIP] Preserve long or short quoting from input documentsDavid Robillard34-290/+239
2023-12-02Leave statement caret at the start of literalsDavid Robillard4-5/+80
This allows a precise location to be reported for errors within literals, by adding the offset of the error in the literal to the caret. This will be used to report nice errors for things like regular expressions and supported XSD datatypes.
2023-12-02[WIP] Add extensible logging APIDavid Robillard22-173/+798
2023-12-02[WIP] Preserve anonymous graph syntax in TriGDavid Robillard4-7/+11
[WIP] Untested
2023-12-02[WIP] Add support for reading and writing variablesDavid Robillard30-23/+271
[WIP] Command line option, move later? This adds a reader flag and serdi option for extending a syntax with support for SPARQL-like variables, for storing things like patterns or simple queries.
2023-12-02[TESTED] Replace -b and -e options with a block size optionDavid Robillard5-39/+34
This is more powerful, and reduces the number of command line options that almost nobody needs to care about.
2023-12-02Factor out serd_choose_syntax()David Robillard3-5/+25
2023-12-02Factor out opening output filesDavid Robillard3-7/+23
2023-12-02Factor out setting base URI from a pathDavid Robillard3-11/+27
2023-12-02Add support for reading multiple files at onceDavid Robillard11-44/+222
2023-12-02Simplify handling of input string argumentsDavid Robillard3-13/+18
2023-12-02Add base URI command-line optionDavid Robillard7-20/+71