Refining RDFTerm-equals #187

afs · 2025-01-24T10:53:58Z

#185 shows that RDFTerm-equals needs some attention.
(#25 is the issue for renaming this. Trying it out here.)

In the definition of RDFTerm-equals says:

The function is defined as follows:

Returns TRUE if term1 and term2 are equal RDF terms, as defined below.

Produces a type error if term1 and term2 are both literals having the same datatype IRI; this datatype IRI is not in the set of recognized datatype IRIs; and the lexical forms of the two literals are different from one another.

Returns FALSE otherwise.

Bullet 2 applies if the terms have the same dataype IRI. What if there are two diffrerent but related datatype IRIs?

"recognized" is about the data generally, and so could apply elsewhere such a comparision (if it matters), not just this function.
Currently, there is a proposal to remove it from RDF Concepts because it is also defined by RDF Semantics.

We would be better having a SPARQL-focused definition we can limit to this function (e.g. a system may be able to determine equality in some, but not all, cases).

Example 1

sameValue("0.13"^^xsd:precisionDecimal, "0.13"^^xsd:decimal)

Not dispatched by the operator mapping table.
Different terms.
Not the same datatype IRI.

Returns false.
But they are the same value - and only error allows extension.

(xsd:precisionDecimal is not derived from xsd:decimal nor the other way round)

Example 2

sameValue("IV"^^my:romanNumeral, "4"^^xsd:decimal)

Not dispatched ("IV"^^my:romanNumeral is not mentioned in Operand Data Types
Different terms.
Different datatypes.

Returns false.

my:romanNumeral can't be made derived because the lexical space is not digits.

Example 3

sameValue("2025-01-01T00:00:00Z"^^xsd:dateTime, "2025-01-01T00:00:00Z"^^xsd:dateTimeStamp)

Returns False.

xsd:dateTimeStamp is a derived datatype of xsd:dateTime but the spec only covers numeric derived types.

Example 4

Systems involving units (different value space to numbers)

sameValue("32"^^x:meters, "3200"^^y:centimeters)

Not error so can't be an extensions. This is an erratum.

Example 5

sameValue("abc"^^xsd:string, "IV"^^my:romanNumeral)

All it takes is to know to return FALSE is that my:romanNumeral is a number or not a legal value, without

Not error so can't be an extension. This is an erratum IMO.

Example 6

sameValue("abc"^^xsd:integer, "123"^^xsd:integer)

Probably, this should be FALSE because the operator mapping does apply to = and a SPARQL processor must understand xsd:integer.
It could be error because it can't be 123. c.f.

sameValue(1 + "abc"^^xsd:integer, "123"^^xsd:integer)

which is an error before sameValue is invoked.

Proposal

Instead of adding one or more cases, I propose defining RDFTerm-equals more in the style of "by contract"
(not exact wording for now): in order:

If one or both arguments are known to be ill-typed, then error.
If the arguments are the sameTerm then return TRUE.
If the SPARQL processor can determine the values of both terms and it can determine the values are equal, then return TRUE.
If the SPARQL processor can determine the values of the terms can not be equal, the return FALSE.
Otherwise error.

The 4th case does not necessarily require values themselves because a processor may know they are different value spaces so can't be value-equal. Case 1 removes know-to-ill-typed.

Because of the order of cases 1 and 2, "sameTerm=true" does not imply "sameValue=true". They could be swapped so that is the case.

I don't think there can be a perfect solution in all cases - we are going from terms to values with incomplete knowledge in some situations.

The text was updated successfully, but these errors were encountered:

rubensworks · 2025-01-24T12:14:39Z

The note in https://www.w3.org/TR/sparql12-query/#func-RDFterm-equal starts with:

An extended implementation may support additional datatypes for literals.

Based on my interpretation, this note indicates that implementations can choose to support the examples you listed above.

Unless the point of this issue is that this note should be removed in favor of a more precise definition?

afs · 2025-01-24T18:49:26Z

Based on my interpretation, this note indicates that implementations can choose to support the examples you listed above.

Unless the point of this issue is that this note should be removed in favor of a more precise definition?

Yes, a more precise definition (better coverage of possible cases) - extensions can added where there is error. That's why some of the examples can be seen as errata - e.g. two different extension datatypes.

Leave the note, even expand it. It is explaining what is going on in more accessible language.

hartig · 2025-02-18T13:03:06Z

I am starting to look into this one now (apologies for the delay).

@afs two questions to start with:

Regarding Examples 4 and 5, I understand that these are not errors (because they are covered by the "Returns FALSE otherwise." case of the current definition of RDFterm-equal) and, thus, they cannot be an extension. But why do you say that they are an erratum? Or did you want to say that you consider it a mistake in the spec that these cases are not treated as errors (which would make it possible to define extensions if they were treated as errors)?
I think there are two aspects to the issue as a whole (ignoring the related issue about renaming RDFterm-equal): One is that the current definition of RDFterm-equal relies on the notion of "recognized datatype IRIs", which has been removed completely from RDF-Concepts now. The second aspect is your question about "what if there are two diffrerent but related datatype IRIs?" I think these two aspects are somewhat orthogonal and I would prefer we address the first aspect first; that is, we first fix the definition to account for the removal of the "recognized datatype IRIs" concept, without also addressing the second aspect at the same time. The question of different but related datatype IRIs should then be tackled afterwards. What do you think of such a separation of the issue into two sub-issues?

afs · 2025-02-18T16:37:56Z

Regarding Examples 4 and 5, I understand that these are not errors (because they are covered by the "Returns FALSE otherwise." case of the current definition of RDFterm-equal) and, thus, they cannot be an extension. But why do you say that they are an erratum? Or did you want to say that you consider it a mistake in the spec that these cases are not treated as errors (which would make it possible to define extensions if they were treated as errors)?

Maybe erratum is too definite. The vague text in query 1.1 does not give reasonable conditions for an extension (e.g. ill-typed literals can be an extension).

The first revision for SPARQL 1.2 addressed the general issues of rdfTerm-equals but required the two literals to be of the same datatype for an extension else false is required.

In ex4, a system can know that two different datatypes give the same value, or conversely are definitely not the same value.

ex5 is similar but an illustration of partial knowledge.

afs · 2025-02-18T16:46:16Z

What do you think of such a separation of the issue into two sub-issues?

#194 deals with these together by having giving the contract for extensions.

I don't see the advantage of trying to split apart the proposed 4 conditions for separate discussions then putting them back together again.

I sketched the changes in the current #194 - I have started to remove the old text and editors notes for rdfTerm-equals (with the section reordering) and I'll push that ASAP. (PS Mostly done and pushed)

Tpt · 2025-02-18T17:01:15Z

+1 to this proposal.

A problem with having clause "2. If the arguments are the sameTerm then return TRUE." at position 2 is that "NaN"^^xsd:double = "NaN"^^xsd:double is true whereas IEEE 754-2008 and XML schema datatype specify that NaN ≠ NaN.

afs · 2025-02-18T17:38:15Z

NaN ≠ NaN.

Good point. And also ! (NaN ≠ NaN) is true.

The operator dispatch table will have sent doubles/floats to op:numeric-equal.
If sameValue becomes callable, we should call out this as an exception, , or note they are "same value" but not =.
Added to the "callable" editors note.

See also : https://www.w3.org/TR/rdf12-concepts/#dfn-literal-term-equality

afs · 2025-02-23T16:57:18Z

(long) discussion point in an editors' note in #194 on "NaN"s.

tl;dr:

I don't think there is a consistent choice and there are arguments for/against any of true, false or error. I prefer the line of argument that "sameTerm implies sameValue" because of term pattern matching.

The spec has a note that = has an operator mapping to op:numeric-equal.

afs added the spec:enhancement Change to enhance the spec without affecting conformance (class 2) –see also spec:editorial label Jan 24, 2025

afs self-assigned this Jan 24, 2025

afs mentioned this issue Jan 24, 2025

Broken links in SPARQL 1.2 Query Language #185

Open

1 task

afs added a commit that referenced this issue Feb 5, 2025

GH-187: Revise RDFterm-equal; update RDF term equality

108e37c

afs mentioned this issue Feb 5, 2025

Revise RDFterm-equal and function sameTerm. Rename RDFterm-equal to sameValue. #194

Merged

afs added a commit that referenced this issue Feb 6, 2025

GH-187: Revise RDFterm-equal; update RDF term equality

3e4eb6c

afs added a commit that referenced this issue Feb 13, 2025

GH-187: Revise RDFterm-equal; update RDF term equality

d07541c

hartig mentioned this issue Feb 18, 2025

Renames 'RDFterm-equal' to 'sameValue' #197

Closed

afs added a commit that referenced this issue Feb 18, 2025

GH-187: Revise RDFterm-equal; update RDF term equality

9787ec4

afs added a commit that referenced this issue Feb 18, 2025

GH-187: Revise RDFterm-equal; update RDF term equality

389cbe6

afs added a commit that referenced this issue Feb 18, 2025

GH-187: Revise RDFterm-equal; update RDF term equality

9e31ba9

afs added a commit that referenced this issue Feb 19, 2025

GH-187: Revise RDFterm-equal; update RDF term equality

544322d

afs added a commit that referenced this issue Feb 19, 2025

GH-187: Revise RDFterm-equal; update RDF term equality

83784f3

afs added a commit that referenced this issue Feb 23, 2025

GH-187: Revise RDFterm-equal; update RDF term equality

a7ec54a

afs added a commit that referenced this issue Feb 23, 2025

GH-187: Revise RDFterm-equal; update RDF term equality

5555e41

afs added a commit that referenced this issue Feb 23, 2025

GH-187: Revise RDFterm-equal; update RDF term equality

49052af

afs mentioned this issue Apr 2, 2025

Should sameValue be callable? #201

Open

afs added a commit that referenced this issue Apr 4, 2025

GH-187: Revise RDFterm-equal; update RDF term equality

829967e

afs closed this as completed in #194 Apr 4, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refining RDFTerm-equals #187

Refining RDFTerm-equals #187

afs commented Jan 24, 2025 •

edited

Loading

rubensworks commented Jan 24, 2025

afs commented Jan 24, 2025

hartig commented Feb 18, 2025

afs commented Feb 18, 2025

afs commented Feb 18, 2025 •

edited

Loading

Tpt commented Feb 18, 2025

afs commented Feb 18, 2025

afs commented Feb 23, 2025 •

edited

Loading

Refining RDFTerm-equals #187

Refining RDFTerm-equals #187

Comments

afs commented Jan 24, 2025 • edited Loading

Example 1

Example 2

Example 3

Example 4

Example 5

Example 6

Proposal

rubensworks commented Jan 24, 2025

afs commented Jan 24, 2025

hartig commented Feb 18, 2025

afs commented Feb 18, 2025

afs commented Feb 18, 2025 • edited Loading

Tpt commented Feb 18, 2025

afs commented Feb 18, 2025

afs commented Feb 23, 2025 • edited Loading

afs commented Jan 24, 2025 •

edited

Loading

afs commented Feb 18, 2025 •

edited

Loading

afs commented Feb 23, 2025 •

edited

Loading