Age | Commit message (Collapse) | Author | Files | Lines | |
---|---|---|---|---|---|
2023-12-02 | [WIP] Partially resurrect CURIE nodes1.x | David Robillard | 7 | -20/+93 | |
2023-12-02 | [WIP] Simplify block writing interface and improve error handling | David Robillard | 3 | -90/+95 | |
2023-12-02 | [WIP] Improve read_IRIREF_suffix() performance | David Robillard | 1 | -38/+23 | |
2023-12-02 | [WIP] Add C++ bindings | David Robillard | 32 | -14/+5411 | |
2023-12-02 | Move read_IRIREF to read_ntriples.c | David Robillard | 3 | -20/+31 | |
Bit of a weird situation this one, since the rule is from NTriples but serd doesn't actually use it in the NTriples parser (since it uses a stricter one instead). Still, I think it makes sense here, and in practice, makes for a more sensible inlining situation. | |||||
2023-12-02 | [WIP] Tighten up reader node management | David Robillard | 15 | -290/+361 | |
[WIP] Broken on 32-bit This makes the reader stack manipulations stricter, to make the code more regular and avoid redundant work and bad cache activity. Now, functions that push node headers and their bodies are responsible for (more or less) immediately pushing any trailing null bytes required for termination and alignment. This makes the writes to the node in the stack more local, ensures nodes are terminated as early as possible (to reduce the risk of using non-terminated strings), and avoids the need to calculate aligned stack allocations. | |||||
2023-12-02 | Add high-level documentation | David Robillard | 31 | -38/+2765 | |
2023-12-02 | Factor out serd_byte_source_skip_bom() | David Robillard | 3 | -18/+22 | |
2023-12-02 | [WIP] Add support for URI hex escape decoding | David Robillard | 46 | -149/+555 | |
2023-12-02 | Add serd_env_unset_prefix() | David Robillard | 3 | -0/+110 | |
2023-12-02 | Use ZixAllocator directly | David Robillard | 62 | -619/+373 | |
2023-12-02 | Use ZixStringView directly | David Robillard | 41 | -517/+451 | |
2023-12-02 | Use SerdNodes instead of manual memory management in tests | David Robillard | 11 | -123/+204 | |
2023-12-02 | Add "contextual" output option | David Robillard | 11 | -3/+88 | |
This is mainly for developer or power-user cases, where one wants to look at some data for investigation or debugging. In such cases, it's common for the set of prefixes to be implicitly known (because they are baked in to the application, for example), so printing them just produces a large amount of redundant noise. That said, it can also be useful programmatically, because it allows several snippets to be written independently and ultimately concatenated (with a header to define the prefixes) without redundancy. | |||||
2023-12-02 | Add SERD_READ_ORDERED to preserve blank node ordering in models | David Robillard | 9 | -19/+55 | |
2023-12-02 | Add model and serd-sort utility | David Robillard | 53 | -43/+4886 | |
With all the new functionality, the complexity of the serd-pipe command-line interface is starting to push the limits of available flags. So, instead of grafting on further options to control a model, this commit adds a new tool, serd-sort, which acts somewhat like a stripped-down serd-pipe that stores statements in a model in memory. This keeps the complexity (including the user-facing complexity) of any one tool down, since other more focused tools can be used for streaming tasks in a pipeline. In other words, abandon Swissarmyknifeism, take a page from the Unix philosophy, and try to expose the model functionality to the command-line in a dedicated focused tool. The model implementation is tested by using this tool to run a subset of the usual test suites, and a special suite to test statement sorting. | |||||
2023-12-02 | Add statement filter sink and serd-filter tool | David Robillard | 29 | -18/+1148 | |
2023-12-02 | Add "longhand" mode to write rdf:type predicates normally | David Robillard | 12 | -1/+75 | |
2023-12-02 | Factor out and rewrite command-line interface | David Robillard | 12 | -372/+679 | |
2023-12-02 | Add serd_node_from_syntax() and serd_node_to_syntax() | David Robillard | 9 | -2/+518 | |
2023-12-02 | Use SerdNodes in Env implementation | David Robillard | 2 | -77/+82 | |
2023-12-02 | Avoid dynamic allocation of world blank node | David Robillard | 2 | -12/+15 | |
2023-12-02 | Use SerdNodes to simplify some tests | David Robillard | 6 | -178/+286 | |
2023-12-02 | Reuse the same rdf:type node for all "a" abbreviations | David Robillard | 4 | -4/+6 | |
This reduces the stack space requirements for every type statement, at the cost of adding a bit of constant bloat for the node in documents that do not contain it. | |||||
2023-12-02 | Cache commonly used nodes in the world | David Robillard | 5 | -24/+73 | |
2023-12-02 | Add SerdNodes for storing a cache of interned nodes | David Robillard | 16 | -70/+1395 | |
2023-12-02 | Make blank node prefixing automatic | David Robillard | 17 | -227/+30 | |
Though potentially useful, I don't think the complexity cost of the old interface (both to the implementation and to the user) is worth it. A special tool to transform blank node labels (for example with regular expressions) would be a better approach to this if it's ever needed in the future. | |||||
2023-12-02 | Split SERD_READ_VERBATIM into two more precise flags | David Robillard | 5 | -21/+40 | |
Although the "verbatim" idea is nice and simple, more fine-grained control is necessary since these features (relative URI preservation and blank node label clash avoidance) are useful in different situations. | |||||
2023-12-02 | Add a reader flag to disable generated blank label avoidance | David Robillard | 5 | -10/+30 | |
2023-12-02 | Collapse input and output options into single flags | David Robillard | 10 | -113/+215 | |
2023-12-02 | Expose serd_strncasecmp in public API | David Robillard | 6 | -24/+27 | |
2023-12-02 | [WIP] Add support for converting literals to canonical form | David Robillard | 20 | -13/+657 | |
2023-12-02 | [WIP] Generalize node construction API | David Robillard | 31 | -924/+1741 | |
2023-12-02 | Add support for custom allocators | David Robillard | 55 | -819/+1823 | |
This makes it explicit in the API where memory is allocated, and allows the user to provide a custom allocator to avoid the use of the default system allocator for whatever reason. | |||||
2023-12-02 | Test handling of deferred write errors that happen when closing | David Robillard | 2 | -5/+18 | |
2023-12-02 | Add command-line option to write output to a file | David Robillard | 5 | -22/+75 | |
2023-12-02 | [WIP] Simplify reader and writer flags | David Robillard | 6 | -16/+89 | |
[WIP] Testing? | |||||
2023-12-02 | [WIP] Remove serd_new_resolved_uri() | David Robillard | 3 | -63/+12 | |
2023-12-02 | [WIP] Remove SERD_CURIE node datatype entirely | David Robillard | 13 | -220/+119 | |
2023-12-02 | [WIP] Expand URIs in reader | David Robillard | 18 | -35/+282 | |
This expands relative and prefixed URIs in the reader on the stack, rather than passing them to the caller to be dealt with. This pushes these context-full forms to the edge of the system as much as possible to minimise the headaches they can cause. Towards having stricter guarantees about nodes and eliminating the CURIE node type altogether. | |||||
2023-12-02 | [WIP] Move SerdEnv mutation from writer to reader | David Robillard | 11 | -66/+76 | |
Writing having side-effects seems questionable in general, and this prepares things for expanding URIs in the reader. | |||||
2023-12-02 | [WIP] Preserve long or short quoting from input documents | David Robillard | 34 | -290/+239 | |
2023-12-02 | Leave statement caret at the start of literals | David Robillard | 4 | -5/+80 | |
This allows a precise location to be reported for errors within literals, by adding the offset of the error in the literal to the caret. This will be used to report nice errors for things like regular expressions and supported XSD datatypes. | |||||
2023-12-02 | [WIP] Add extensible logging API | David Robillard | 22 | -173/+798 | |
2023-12-02 | [WIP] Preserve anonymous graph syntax in TriG | David Robillard | 4 | -7/+11 | |
[WIP] Untested | |||||
2023-12-02 | [WIP] Add support for reading and writing variables | David Robillard | 30 | -23/+271 | |
[WIP] Command line option, move later? This adds a reader flag and serdi option for extending a syntax with support for SPARQL-like variables, for storing things like patterns or simple queries. | |||||
2023-12-02 | [TESTED] Replace -b and -e options with a block size option | David Robillard | 5 | -39/+34 | |
This is more powerful, and reduces the number of command line options that almost nobody needs to care about. | |||||
2023-12-02 | Factor out serd_choose_syntax() | David Robillard | 3 | -5/+25 | |
2023-12-02 | Factor out opening output files | David Robillard | 3 | -7/+23 | |
2023-12-02 | Factor out setting base URI from a path | David Robillard | 3 | -11/+27 | |