Age | Commit message (Collapse) | Author | Files | Lines | |
---|---|---|---|---|---|
2023-12-02 | Add model and serd-sort utility | David Robillard | 10 | -1/+1793 | |
With all the new functionality, the complexity of the serd-pipe command-line interface is starting to push the limits of available flags. So, instead of grafting on further options to control a model, this commit adds a new tool, serd-sort, which acts somewhat like a stripped-down serd-pipe that stores statements in a model in memory. This keeps the complexity (including the user-facing complexity) of any one tool down, since other more focused tools can be used for streaming tasks in a pipeline. In other words, abandon Swissarmyknifeism, take a page from the Unix philosophy, and try to expose the model functionality to the command-line in a dedicated focused tool. The model implementation is tested by using this tool to run a subset of the usual test suites, and a special suite to test statement sorting. | |||||
2023-12-02 | Add statement filter sink and serd-filter tool | David Robillard | 2 | -0/+129 | |
2023-12-02 | Add "longhand" mode to write rdf:type predicates normally | David Robillard | 1 | -1/+1 | |
2023-12-02 | Add serd_node_from_syntax() and serd_node_to_syntax() | David Robillard | 3 | -0/+206 | |
2023-12-02 | Use SerdNodes in Env implementation | David Robillard | 1 | -75/+80 | |
2023-12-02 | Avoid dynamic allocation of world blank node | David Robillard | 2 | -12/+15 | |
2023-12-02 | Reuse the same rdf:type node for all "a" abbreviations | David Robillard | 3 | -3/+5 | |
This reduces the stack space requirements for every type statement, at the cost of adding a bit of constant bloat for the node in documents that do not contain it. | |||||
2023-12-02 | Cache commonly used nodes in the world | David Robillard | 3 | -22/+61 | |
2023-12-02 | Add SerdNodes for storing a cache of interned nodes | David Robillard | 4 | -1/+603 | |
2023-12-02 | Make blank node prefixing automatic | David Robillard | 5 | -62/+23 | |
Though potentially useful, I don't think the complexity cost of the old interface (both to the implementation and to the user) is worth it. A special tool to transform blank node labels (for example with regular expressions) would be a better approach to this if it's ever needed in the future. | |||||
2023-12-02 | Split SERD_READ_VERBATIM into two more precise flags | David Robillard | 1 | -1/+1 | |
Although the "verbatim" idea is nice and simple, more fine-grained control is necessary since these features (relative URI preservation and blank node label clash avoidance) are useful in different situations. | |||||
2023-12-02 | Add a reader flag to disable generated blank label avoidance | David Robillard | 1 | -10/+5 | |
2023-12-02 | Expose serd_strncasecmp in public API | David Robillard | 5 | -17/+21 | |
2023-12-02 | [WIP] Add support for converting literals to canonical form | David Robillard | 2 | -4/+224 | |
2023-12-02 | [WIP] Generalize node construction API | David Robillard | 10 | -461/+788 | |
2023-12-02 | Add support for custom allocators | David Robillard | 24 | -320/+611 | |
This makes it explicit in the API where memory is allocated, and allows the user to provide a custom allocator to avoid the use of the default system allocator for whatever reason. | |||||
2023-12-02 | [WIP] Simplify reader and writer flags | David Robillard | 3 | -6/+7 | |
[WIP] Testing? | |||||
2023-12-02 | [WIP] Remove serd_new_resolved_uri() | David Robillard | 3 | -63/+12 | |
2023-12-02 | [WIP] Remove SERD_CURIE node datatype entirely | David Robillard | 6 | -127/+58 | |
2023-12-02 | [WIP] Expand URIs in reader | David Robillard | 6 | -24/+147 | |
This expands relative and prefixed URIs in the reader on the stack, rather than passing them to the caller to be dealt with. This pushes these context-full forms to the edge of the system as much as possible to minimise the headaches they can cause. Towards having stricter guarantees about nodes and eliminating the CURIE node type altogether. | |||||
2023-12-02 | [WIP] Move SerdEnv mutation from writer to reader | David Robillard | 4 | -13/+20 | |
Writing having side-effects seems questionable in general, and this prepares things for expanding URIs in the reader. | |||||
2023-12-02 | [WIP] Preserve long or short quoting from input documents | David Robillard | 7 | -154/+87 | |
2023-12-02 | Leave statement caret at the start of literals | David Robillard | 3 | -5/+24 | |
This allows a precise location to be reported for errors within literals, by adding the offset of the error in the literal to the caret. This will be used to report nice errors for things like regular expressions and supported XSD datatypes. | |||||
2023-12-02 | [WIP] Add extensible logging API | David Robillard | 9 | -96/+392 | |
2023-12-02 | [WIP] Preserve anonymous graph syntax in TriG | David Robillard | 2 | -1/+3 | |
[WIP] Untested | |||||
2023-12-02 | [WIP] Add support for reading and writing variables | David Robillard | 7 | -4/+86 | |
[WIP] Command line option, move later? This adds a reader flag and serdi option for extending a syntax with support for SPARQL-like variables, for storing things like patterns or simple queries. | |||||
2023-12-02 | [TESTED] Replace -b and -e options with a block size option | David Robillard | 1 | -17/+0 | |
This is more powerful, and reduces the number of command line options that almost nobody needs to care about. | |||||
2023-12-02 | Move serdi to tools subdirectory | David Robillard | 4 | -343/+0 | |
This separates the command-line tool code from the library implementation. | |||||
2023-12-02 | Factor out command line version printing | David Robillard | 3 | -19/+24 | |
2023-12-02 | Factor out Windows UTF-8 console setup code | David Robillard | 3 | -12/+33 | |
2023-12-02 | Remove datatype arguments from integer and base64 constructors | David Robillard | 1 | -9/+11 | |
2023-12-02 | Simplify output stream API | David Robillard | 7 | -107/+241 | |
2023-12-02 | Simplify input stream API | David Robillard | 12 | -229/+272 | |
2023-12-02 | Rename SerdErrorFunc to SerdLogFunc | David Robillard | 3 | -9/+9 | |
2023-12-02 | Reduce complexity of URI parsing code | David Robillard | 2 | -101/+97 | |
2023-12-02 | Make serd_writer_set_root_uri() take a string view | David Robillard | 2 | -6/+4 | |
This is generally more convenient, and the node was just being copied anyway. | |||||
2023-12-02 | Make SerdBuffer an output stream | David Robillard | 4 | -34/+47 | |
2023-12-02 | Factor out and expose serd_write_file_uri() | David Robillard | 2 | -82/+89 | |
2023-12-02 | Strengthen handling of corrupt UTF-8 input | David Robillard | 3 | -15/+24 | |
2023-12-02 | Improve URI read performance | David Robillard | 1 | -15/+12 | |
2023-12-02 | Use tighter types for UTF-8 | David Robillard | 3 | -17/+19 | |
2023-12-02 | Support writing all escapes in Turtle and TriG prefixed names | David Robillard | 3 | -74/+74 | |
2023-12-02 | Simplify reading code with TRY_LAX macro | David Robillard | 1 | -14/+14 | |
2023-12-02 | Separate Turtle and TriG reading code | David Robillard | 6 | -150/+400 | |
2023-12-02 | Reduce complexity of Turtle and TriG block reading functions | David Robillard | 1 | -87/+108 | |
2023-12-02 | Reduce complexity of Turtle object reading code | David Robillard | 1 | -41/+66 | |
2023-12-02 | Factor out and expose prefixed name predicates | David Robillard | 5 | -63/+63 | |
Towards using these in the writer to escape names more precisely. | |||||
2023-12-02 | Factor out NQuads reader | David Robillard | 7 | -113/+167 | |
2023-12-02 | Factor out NTriples reader | David Robillard | 6 | -521/+1009 | |
2023-12-02 | Factor out UTF-8 reading utilities | David Robillard | 3 | -85/+133 | |