RDF
Features
- "has the aesthetics of steampunk, of some technology for some futuristic Semantic Web1 in a toolset that feels about as lightweight and modern as iron dreadnought."
Properties
- Properties are used for specifying information about entities like :Polar_Bear or :Knut.
- Datatype properties like :birthDate or :population relate to literal values like strings, numbers or dates
- Object properties like conservation_status relate to other entities
Objects
- can be either
- literal values
- individuals? entity objects? need a better name here
Blank Nodes
- Blank (or anonymous) nodes are RDF resources with identifiers that exist only locally. In other words, their identifiers are not URLs. They are sometimes also called anonymous nodes. They make life easier for data producers, who can easily create (nested) resources without having to mint all the URLs. In most non-RDF data models, blank nodes are the default. For example, we nest JSON object without thinking twice.
- @chatgpt.4: "While you can avoid using blank nodes, it often results in more verbose RDF graphs and might require additional effort to manage URIs for entities that do not need global identification. If your use case involves a lot of ephemeral, intermediary, or context-specific data, blank nodes can be very useful. However, for datasets that require high interoperability and ease of querying, minimizing the use of blank nodes and favoring URIs might be beneficial."
Namespaces
Namespace Prefixes
-
"The XML Namespaces specification (Namespaces in XML 1.0) states that a prefix must be a valid XML name, which means it must conform to the following constraints:
-
It must start with a letter or underscore (_).
-
It cannot start with a number.
-
Subsequent characters can be letters, digits, hyphens (-), underscores (_), or periods (.)."
Datasets
-
An RDF dataset is a collection of RDF graphs, and comprises:
- Exactly one default graph, being an RDF graph. The default graph does not have a name and MAY be empty.
- Zero or more named graphs. Each named graph is a pair consisting of an IRI or a blank node (the graph name), and an RDF graph. Graph names are unique within an RDF dataset.
-
"We informally use the term RDF source to refer to a persistent yet mutable source or container of RDF graphs."
Dataset Thoughts
- [-][p.summarized]] RDF datasets
- the concept of the "unnamed default graph"/master-graph is interesting;
Typed Literals
- RDF does not place any limits on what datatypes might be associated with a literal. The model recommends using a standard type library, like XML Schema, wherever possible but the only real constraint is that datatypes should be identified by a URI. Creating a new URI for an application or domain specific datatype allows the type system to be easily extended. The datatype URI can be annotated with additional RDF statements, e.g. a label or description, to describe its usage.
- The key limitation to custom datatypes is that SPARQL query engines will not understand how to compare values of that type. Some SPARQL query processors may provide support for understanding the range of types it understands.
- Note that a typed literal with a custom datatype can also be modelled as a sub-property.
- Advantages
- Simpler querying of data by allowing the use of triple patterns, rather than FILTERs for extracting data of a common type.
- More explicit, fine-grained semantics
- Disadvantages
- Increases the number of very similar properties, requiring redundancy of data or reasoning to allow applications to treat them generically
- Doesn't address the core requirement to indicate the lexical form of a structured value
- Advantages
- Recommendation:
- Use a custom datatype to label particular types of structured value that share a common lexical form. These values may be associated with a broad range of different properties. Processing applications may want to implement a common set of type conversions or display options for the values.
- Use a sub-property in all other cases
- Note that these options aren't always mutually exclusive. It might be useful in some scenarios to have an explicit property for associating a general class of code, identifier, or other Literal Key with a resource, but also assign a specific datatype to the identifier as a cue for applications and reasoners
Highlights
- the property "a" is used to indicate the "type" or "class" of a resource. It is a shorthand notation for the "rdf:type" predicate.
Issues
- "RDF is a shitty data model. It doesn’t have native support for lists."
n-ary relations
-
aka Relationship to Entity Conversion
- although you can also express multiple "arguments" as a list, see http://www.ninebynine.org/RDFNotes/RDFFactsAndRules.html#xtocid-75600719
-
sure you can use a class per relation, but...
-
"Considerations when introducing a new class for a relation
-
In our example, we did not give meaningful names to instances of properties or to the classes used to represent instances of n-ary relations, but merely label them _:Temperature_Observation_1, Purchase_1, etc. In most cases, these individuals do not stand on their own but merely function as auxiliaries to group together other objects. Hence a distinguishing name serves no purpose. Note that a similar approach is taken when reifying statements in RDF.
-
Creating a class to represent an n-ary relation limits the use of many OWL constructs and creates a maintenance problem. The problem arises when we want to have local range or cardinality restrictions on some role in the n-ary relation that depend on the class of some other role. For example, we might want to say that we buy only instances of a class Book from companies in the category Bookseller (cf. use case 3). Expressing this constraint requires a special subclass of the n-ary relation class that represents the combination of restrictions. For instance, we will have to create a class Book_Purchase with the corresponding range restrictions for the property seller (allValuesFrom Bookseller) and object (allValuesFrom Book). We end up having to build an explicit lattice of classes to represent all the possible combinations.
- OWL allows definition of inverse properties. Defining inverse properties with n-ary relations, using any of the patterns above, requires more work than with binary relations. In order to specify inverse properties for n-ary relations, we must specify an inverse for each of the properties participating in the n-ary relation (with the proper constraints). Consider the example of John buying the Lenny_The_Lion book. We may want to have an instance of an inverse relation pointing from the Lenny_The_Lion book to the person who bought it. If we had a simple binary relation John buys Lenny_The_Lion, defining an inverse is simple: we simply define a property is_bought_by as an inverse of buys:
:is_bought_by a owl:ObjectProperty ; owl:inverseOf :buys .
With the purchase relation represented as an instance, however, we need to add inverse relations between participants in the relation and the instance relation itself:"
-
-
t.2024.06.10.22 "Moreover, the use of cardinality re- strictions becomes limiting on some roles that depend on the class of some other roles"
Ordering
Other Issues
- contrary to some claims, RDF and OWL are not fluent, at least compared to GraphDown
- calls verbs properties / predicates
- confuses properties and predicates
- can't easily express intransitive relations (properties)
- Resources That Refer Vs Resources That Are
- aka "indicator vs subject"
- punning: also a solution
- @chatgpt.4:
- Class-Individual Punning: This is where the same URI is used to denote a class in one context and an individual in another. For example, you might have a class
:Person
and also use:Person
as an individual to represent the generic concept of a person. In RDF and OWL, this is legal because classes in OWL are also individuals in the sense of RDF.- t.2024.05.24.11 see Class Vs Individual
- Class-Property Punning: Similarly, the same URI can be used to denote a class in one context and a property in another. This can be useful for modeling certain kinds of relationships in a more flexible way.
- Class-Individual Punning: This is where the same URI is used to denote a class in one context and an individual in another. For example, you might have a class
- @chatgpt.4:
- Class vs Individual
temporal problems
- see A Temporal Rdf Model for Multi Grained Time Information Modeling
- https://blog.iandavis.com/2009/08/representing-time-in-rdf-part-1/
- Temporal RDF introduces a fourth time component to the triple. "I chose not to cover this approach in a lot of detail because it extends the RDF model in a way that no current triple store implements"
Ordering
There are no arrays in RDF, and don't use the order in which serialized triples appear.
RDF Containers:
- Come in three forms: rdf:Seq (ordered), rdf:Bag (unordered), rdf:Alt (alternatives with default)
- You can add new items by simply adding RDF triples
- Inserting items is hard: requires rewriting many statements
- Must be stored in a single graph / machine / server (centralized)
- Have a formally unknown ending (open world assumption)
RDF Collections:
- An ordered chain of rdf:List resources
- You have to edit / remove statements before you can add new items
- Inserting items is easy: requires changing just a few statements
- Can span many graphs / machines / servers (decentralized)
- Have a known ending (the rdf:nil)
Pagination:
- Use ActivityStreams collections
Converting to arrays:
- Use a library, such as @rdfdev/collections (Private)
Abstract Model
1.5 RDF and Change over Time
The RDF data model is atemporal: RDF graphs are static snapshots of information.
However, RDF graphs can express information about events and about temporal aspects of other entities, given appropriate vocabulary terms.
Since RDF graphs are defined as mathematical sets, adding or removing triples from an RDF graph yields a different RDF graph.
We informally use the term RDF source to refer to a persistent yet mutable source or container of RDF graphs. An RDF source is a resource that may be said to have a state that can change over time. A snapshot of the state can be expressed as an RDF graph. For example, any web document that has an RDF-bearing representation may be considered an RDF source. Like all resources, RDF sources may be named with IRIs and therefore described in other RDF graphs.
Intuitively speaking, changes in the universe of discourse can be reflected in the following ways:
- An IRI, once minted, should never change its intended referent. (See URI persistence [WEBARCH].)
- Literals, by design, are constants and never change their value.
- A relationship that holds between two resources at one time may not hold at another time.
- RDF sources may change their state over time. That is, they may provide different RDF graphs at different times.
- Some RDF sources may, however, be immutable snapshots of another RDF source, archiving its state at some point in time.
History
can be expressed as
Solutions
Resources
- https://www.w3.org/TR/rdf11-datasets/
- begsQuestion [-] What is the difference betwee the union of the graphs and their merge (Private)? Is the distinction meaningful for Dendron?
- https://www.w3.org/TR/rdf-concepts
- https://www.w3.org/TR/rdf-schema/
- Predefined Namespace Prefixes
- https://ontola.io/blog/rdf-serialization-formats
- mentions JSON-AD
hasLearningResource
hasWishList
References
- https://patterns.dataincubator.org/book/custom-datatype.html
- https://www.w3.org/TR/swbp-n-aryRelations/
- https://cran.r-project.org/web/packages/rdflib/vignettes/rdf_intro.html
Children
Backlinks
- Decentralized Collaborative Knowledge Management Using Git
- Fuzzy Spatiotemporal Data Modeling and Operations in Rdf
- Towards Semantic Identification of Temporal Data in Rdf
- TerminusDB
- API Platform
- Atomic Data
- Dojo Toolkit
- LinkML
- Milleniumdb
- Unrival Protocol
- Class vs. Type vs. Category
- Rdf Statement Vs Assertion
- 2024-09-27: Introducing... Carpe Noctem
- 2024-09-27: Let’s Reboot the Semantic Web with Alternate Realities
- Property Graph
- Dynamic-RDF