-
Notifications
You must be signed in to change notification settings - Fork 38
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
OAI-PMH => ORE? #149
Comments
F-UJI apparently has problems with the DOIs you provided, but the reason seems some re3data entries F-UJI is trying to use.. I'll try to fix this Regarding OAI-ORE, F-UJI is not yet able to handle this, can you please provide a endpoint URI which provides the ORE resource map file? |
On 13 Apr 2021, at 12:39, huberrob ***@***.***> wrote:
F-UJI apparently has problems with the DOIs you provided, but the reason seems some re3data entries F-UJI is trying to use.. I'll try to fix this
Thanks!
Regarding OAI-ORE, F-UJI is not yet able to handle this, can you please provide a endpoint URI which provides the ORE resource map file?
This would be eg https://data.hpc.imperial.ac.uk/resolve/?ore=6880 and is invoked as
<relatedIdentifier relatedIdentifierType="URL" relationType="HasMetadata">https://data.hpc.imperial.ac.uk/resolve/?ore=6880</relatedIdentifier>
I would like to see the latter better populated, perhaps with a schemaURI or some such, but its present state is largely historical.
—
You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub, or unsubscribe.
By the way
"FsF-R1.3-02D - Data is available in a file format recommended by the target research community"
we use Media types to specify this and these are declared in the ORE metadata. We use a community media type first set out in 1993 and is now widely accepted if not actually formalised. So arguably, we do comply with the below
"Data is available in a file format recommended by the target research community"
FsF-R1.3-01M - Metadata follows a standard recommended by the target research community of the data.
is more of a challenge, give how difficult it is to get any research community to agree standards. I am a member of an IUPAC (the standards body in chemistry) working party currently trying to achieve this.
Meanwhile we use <subject> declarations, including eg subjectscheme to achieve this. Attached is a typical example.
|
Rearding ORE, it would be not a big problem to implement this in F-UJI. Regarding the data standards, I think F-UJI could detect these formats, but only if there would be the possibility to find ORE files from the PID landing page as starting point (see above). I would love to learn more about the IUPAC approach re metadata standardisatio. I think many communities now switch to more generic formats like schema.org or DCAT which are quite flexible. |
On 13 Apr 2021, at 15:32, huberrob ***@***.***> wrote:
Rearding ORE, it would be not a big problem to implement this in F-UJI.
The problem for F-UJI with the given examples is that a link between the PID landing page and the ORE file seems to be missing.
According to the specification (http://www.openarchives.org/ore/1.0/http) content negotiation should be possible (e.g. Accept: application/atom+xml) but this seems not to work on your site. Alternatively, links to ORE could also be expressed in the head section of the landing page like this:
We have chosen to do this in the metadata record. We are focussing on machine processing rather than human processing. I tend to think that metadata records are for machines, whilst parochial landing pages are more for human benefit. Ideally both, but if one had to choose, I would go for machine processing over human processing!
But I could not find it in the example data sets you provided. Could you please check?
Regarding the data standards, I think F-UJI could detect these formats, but only if there would be the possibility to find ORE files from the PID landing page as starting point (see above).
It is trivial to insert and I will ask the programmers if they could do that.
I would love to learn more about the IUPAC approach re metadata standardisation. I think many communities now switch to more generic formats like schema.org or DCAT which are quite flexible.
Certainly, but chemistry in Schema.org is very very underdeveloped.
…
—
You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub, or unsubscribe.
|
Great! can you please let me know once this is implemented? I will try to integrate ORE support immediately once this is ready.. |
Can I ask what the syntax indicated by my ??? would be? I think you meant to give an example here? I need to show it to the programmers so that it is inserted correctly.
On 13 Apr 2021, at 15:32, huberrob ***@***.***> wrote:
Rearding ORE, it would be not a big problem to implement this in F-UJI.
The problem for F-UJI with the given examples is that a link between the PID landing page and the ORE file seems to be missing.
According to the specification (http://www.openarchives.org/ore/1.0/http) content negotiation should be possible (e.g. Accept: application/atom+xml) but this seems not to work on your site.
could you give an example? Something like
curl -LH "Accept: application/atom+xml" https://doi.org/10.14469/hpc/6215 ?
Alternatively, links to ORE could also be expressed in the head section of the landing page like this:
??? Can you confirm eg
<link rel="alternate" href="https://data.hpc.imperial.ac.uk/resolve/?ore=7350" type="application/atom+xml" />
We also link to metadata as per
<relatedIdentifier relatedIdentifierType="URL" relationType="HasMetadata">https://data.hpc.imperial.ac.uk/resolve/?ore=7350</relatedIdentifier>
Should it be as below or do you ignore any schemeURI declarations?
<relatedIdentifier relatedIdentifierType="URL" relationType="HasMetadata" schemeURI="http://www.openarchives.org/ore/1.0/" >https://data.hpc.imperial.ac.uk/resolve/?ore=7350</relatedIdentifier>
Thanks!
|
could you give an example? Something like
curl -LH "Accept: application/atom+xml" https://doi.org/10.14469/hpc/6215
?
Yes this is exactly what
> Alternatively, links to ORE could also be expressed in the head section
of the landing page like this:
>
??? Can you confirm eg
<link rel="alternate" href="
https://data.hpc.imperial.ac.uk/resolve/?ore=7350"
type="application/atom+xml" />
You can do it this way, but this might be confused with standard feed
links, I would recommend to follow the ORE specs and use
<link rel="resourcemap" href="https://data.hpc.imperial.ac.uk/resolve/?ore=7350"type="application/atom+xml" />
Alternatively you can also follow the signposting conventions (
https://signposting.org/conventions) and use:
<link rel="describedby" href="https://data.hpc.imperial.ac.uk/resolve/?ore=7350"type="application/atom+xml" />
We also link to metadata as per
<relatedIdentifier relatedIdentifierType="URL" relationType="HasMetadata">
https://data.hpc.imperial.ac.uk/resolve/?ore=7350</relatedIdentifier>
Do you mean your DataCite metadata record? I will check this...
|
I am trying to isolate the causes of failures for https://doi.org/10.14469/hpc/6216
The metadata record for this is attached and the failures in this test are itemised below.
1. FsF-F3-01M - Metadata includes the identifier of the data it describes. The metadata includes eg
<identifier identifierType="DOI">10.14469/HPC/6216</identifier>
can you help identify why this is failing item 1 above?
2. FsF-A1-01M - Metadata contains access level and access conditions of the data and
FsF-A1-03D - Data is accessible through a standardized communication protocol.
Can I presume that when eg <relatedIdentifier relatedIdentifierType="URL" relationType="HasMetadata">https://data.hpc.imperial.ac.uk/resolve/?ore=6216</relatedIdentifier>
is elevated to <link rel="resourcemap" href="https://data.hpc.imperial.ac.uk/resolve/?ore=6216"
type="application/atom+xml" />
this will address items 2 above?
3. Metadata uses semantic resources
Can you give an example of how to specify semantic resources. I presume by an ontology?
4. Metadata specifies the content of the data.
I presume once
<link rel="resourcemap" href="https://data.hpc.imperial.ac.uk/resolve/?ore=6216" type="application/atom+xml" />
is declared, this will retrieve eg the media type of the data, which is declared at
https://data.hpc.imperial.ac.uk/resolve/?ore=6216
5. FsF-R1.3-02D - Data is available in a file format recommended by the target research community.
Will this also be addressed by item 4 (declaration of media types). But how does one declare the target research community itself. We use a subject line to do this
<subject subjectScheme="inchikey">KKOOECZDSKSVIG-UHFFFAOYSA-N</subject>
but this is currently missing a schemeURI declaration. Would that address the problem?
You kindly sent us emails earlier regarding eg <link rel="resourcemap" href="https://data.hpc.imperial.ac.uk/resolve/ ore=6216" type="application/atom+xml" />
Although easy to implement, our ICT services take their time to cascade this through their processes and we are still waiting for this to be implemented.
Thanks!!
… On 15 Apr 2021, at 10:12, huberrob ***@***.***> wrote:
10.14469/hpc/6215
|
I'll try to answer your questions below:
Actually, F-UJI tests the presence of data links here, not the inclusion of the DOI or so in metadata (this is done in FsF-F2-01M). The output for FsF-F3-01M is a bit confusing..
For FsF-A1-01M access levels such as 'closed' 'public' have to be defined such as http://vocabularies.coar-repositories.org/documentation/access_rights/ The resourcemap will not solve this problem since these rights have to be explicitely declared in the metadata.
Yes, F-UJI will identify media types and file sizes in the resourcemap which is sufficient to verify the data content.
The test does not require to specify a community or discipline, F-UJI checks if data formats are community specific (in your case chemistry) or generic scientific file formats (matlab etc..) As far as I can see, the dataset provided will pass the test based on the resourcemap info provided. I understand that your ICT staff needs some time to embed the resource map link in the landing page, no problem, just let me know once this is ready.. best regads, |
Thanks Robert! I do appreciate the time you are taking to answer my questions.
One more!!
For FsF-A1-01M access levels such as 'closed' 'public' have to be defined such as http://vocabularies.coar-repositories.org/documentation/access_rights/ The resourcemap will not solve this problem since these rights have to be explicitely declared in the metadata.
But FsF-A1-03D will be passed using the resourcemap info in case F-UJI finds links to data files there.
So in our metadata we have eg
<rightsList>
<rights rightsURI="https://creativecommons.org/publicdomain/zero/1.0/legalcode" rightsIdentifier="cc0-1.0" rightsIdentifierScheme="SPDX" schemeURI="https://spdx.org/licenses/">Creative Commons Zero v1.0 Universal</rights>
</rightsList>
But
shows a controlled vocabulary with the term "open access".
But the current DataCite schema does not have this term specified anywhere.
So I would appreciate if you could send an example of the syntax needed for eg DataCite metadata to declare this property.
Or alternatively, any eg <link> syntax for the landing page/root document which could also server the purpose. I would prefer the former.
OR: That you recognise eg
<rightsList>
<rights rightsURI="https://creativecommons.org/publicdomain/zero/1.0/legalcode" rightsIdentifier="cc0-1.0" rightsIdentifierScheme="SPDX" schemeURI="https://spdx.org/licenses/">Creative Commons Zero v1.0 Universal</rights>
</rightsList>
as appropriate? I appreciate that re-use is not the same as access however!
Thanks Again
Henry
|
Dear Henry, You can also use other vocabularies: http://vocabularies.coar-repositories.org/documentation/access_rights/ (some of them unfortunaletly are currently offline) DataCite schema is very focused on licenses regarding their rightsList element. But According to their specs ("Any rights information for this resource"), it can also be used to define access rights and conditions. I could imagine something like:
But maybe @mfenner can explain this in more detail..
Is recognized by F-UJI as licence information which is good since this is needed to pass metric FsF-R1.1-01M (Metadata includes license information under which data can be reused.) Robert |
Robert,
I will be presenting F-UJI to an IUPAC standards body working party today. Do you have a reference example which passes all the validation tests that I could show them?
(Hopefully our site will eventually also fully validate, but not quite there yet!)
Henry
|
Dear Henry,
unfortunately the v.1.0.6 has some problems with your DOIs I'll try to
install v.1.1.0 to f-uji.net tomorrow...
Robert
…On Mon, May 3, 2021 at 3:29 PM hrzepa ***@***.***> wrote:
Robert,
I will be presenting F-UJI to an IUPAC standards body working party today.
Do you have a reference example which passes all the validation tests that
I could show them?
(Hopefully our site will eventually also fully validate, but not quite
there yet!)
Henry
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#149 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AACW5R64XRFRA7ELCS7I4XTTL2QLJANCNFSM423DVZEQ>
.
--
Dr. Robert Huber,
PANGAEA - www.pangaea.de
_____________________________________________
MARUM - Center for Marine Environmental Sciences
University Bremen
Leobener Strasse
POB 330 440
28359 Bremen
Phone ++49 421 218-65593, Fax ++49 421 218-65505
e-mail ***@***.***
|
Robert,
If 1.1.0 does not solve the issue, do let me know which specific DOIs are causing problems. Our repository was fine until our institute (the Imperial library ) decided to put an extra layer of protection into the process of sending metadata to DataCite which would need direct intervention by a librarian! Sometimes it can take a little while.
It would be good to have some reference sites which pass ALL your tests. They would probably all do so in a host of different ways, but would serve as a nice illustration to others of the usefulness of doing this.
Henry
… On 3 May 2021, at 16:41, huberrob ***@***.***> wrote:
Dear Henry,
unfortunately the v.1.0.6 has some problems with your DOIs I'll try to
install v.1.1.0 to f-uji.net tomorrow...
Robert
On Mon, May 3, 2021 at 3:29 PM hrzepa ***@***.***> wrote:
> Robert,
>
> I will be presenting F-UJI to an IUPAC standards body working party today.
> Do you have a reference example which passes all the validation tests that
> I could show them?
>
> (Hopefully our site will eventually also fully validate, but not quite
> there yet!)
>
> Henry
|
I just tested https://doi.org/10.14469/hpc/6215 using v1.1.1 @
https://www.f-uji.net and it works fine!
…On Mon, May 3, 2021 at 6:28 PM hrzepa ***@***.***> wrote:
Robert,
If 1.1.0 does not solve the issue, do let me know which specific DOIs are
causing problems. Our repository was fine until our institute (the Imperial
library ) decided to put an extra layer of protection into the process of
sending metadata to DataCite which would need direct intervention by a
librarian! Sometimes it can take a little while.
It would be good to have some reference sites which pass ALL your tests.
They would probably all do so in a host of different ways, but would serve
as a nice illustration to others of the usefulness of doing this.
Henry
> On 3 May 2021, at 16:41, huberrob ***@***.***> wrote:
>
>
> Dear Henry,
>
> unfortunately the v.1.0.6 has some problems with your DOIs I'll try to
> install v.1.1.0 to f-uji.net tomorrow...
>
>
> Robert
>
> On Mon, May 3, 2021 at 3:29 PM hrzepa ***@***.***> wrote:
>
> > Robert,
> >
> > I will be presenting F-UJI to an IUPAC standards body working party
today.
> > Do you have a reference example which passes all the validation tests
that
> > I could show them?
> >
> > (Hopefully our site will eventually also fully validate, but not quite
> > there yet!)
> >
> > Henry
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#149 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AACW5RZBODJOWBHFJYIF3CTTL3FLBANCNFSM423DVZEQ>
.
--
Dr. Robert Huber,
PANGAEA - www.pangaea.de
_____________________________________________
MARUM - Center for Marine Environmental Sciences
University Bremen
Leobener Strasse
POB 330 440
28359 Bremen
Phone ++49 421 218-65593, Fax ++49 421 218-65505
e-mail ***@***.***
|
Great. Now for the additions at our end to enable all the FAIR tests to be passed!!
Thanks Robert!
Henry
… On 4 May 2021, at 09:28, huberrob ***@***.***> wrote:
I just tested https://doi.org/10.14469/hpc/6215 using v1.1.1 @
https://www.f-uji.net and it works fine!
On Mon, May 3, 2021 at 6:28 PM hrzepa ***@***.***> wrote:
|
I just discovered this issue by means of a link in #65. A few remarks:
|
Thanks Herbert. Robert has indeed suggested
<link rel="resourcemap" href="https://data.hpc.imperial.ac.uk/resolve/?ore=7350" type="application/atom+xml" />
which I hope will be implemented soon.
I also agree that content negotiation is probably not the optimal method.
Finally thanks for "The spec describing the representation of ResourceMaps in Atom was deprecated quite a while ago, see https://www.openarchives.org/ore/1.0/toc. There is a spec describing a https://www.openarchives.org/ore/0.9/jsonld though" Our implementation dates from 2016, but it was probably obsoleted even then!
… On 12 May 2021, at 14:57, Herbert Van de Sompel ***@***.***> wrote:
I just discovered this issue by means of a link in #65. A few remarks:
• Discovery of OAI-ORE Resource Maps is described in the ORE Resource Map Discovery guidelines. Of particular interest, IMO, is the approach that supports discovery of Resource Maps by means of typed links at the level of the landing page of the object described by the Resource Map, using <link> in HTML or a Link header in the HTTP response header, both using the "resourcemap" link relation type, see https://www.openarchives.org/ore/1.0/discovery#ResourceEmbedding. This discovery approach is also illustrated in https://www.openarchives.org/ore/1.0/http#MultiSplash of the ORE HTTP Implementation spec.
• The content negotiation approach, described in the same section, could be implemented by introducing a new URI (A-1 of the Aggregation in the pic) to which the DOI redirects (you can't implement content negotiation at the DOI because you don't control it). At that new URI one could then implement content negotiation for the landing page (description of the object for humans) or a Resource Map (description of the object for machines).
• But you're probably not going to go through the effort of introducing such a new URI, changing the DOI redirections, and implementing content negotiation. But you still need a dedicated URI for the Aggregation according to OAI-ORE and the Linked Data principles it is based on. Which I don't see in the Resource Map e.g. https://data.hpc.imperial.ac.uk/resolve/?ore=6216. A simple approach is to append a #aggregation at the end of the Resource Map URI to obtain a dedicated URI for the Aggregation, see https://www.openarchives.org/ore/1.0/http#Simple.
• Note that the OAI-ORE vocabulary includes the term ore:similarTo to express the relation between the Aggregation and, for example, the DOI (HTTP version) of the object described by the Resource Map associated with the Aggregation, see https://www.openarchives.org/ore/1.0/datamodel#ore:similarTo
• The spec describing the representation of ResourceMaps in Atom was deprecated quite a while ago, see https://www.openarchives.org/ore/1.0/toc. There is a spec describing a https://www.openarchives.org/ore/0.9/jsonld though.
—
You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub, or unsubscribe.
|
We use ORE for harvesting rather than OAI-PMH. Can you detect this?
Eg https://doi.org/10.14469/hpc/6215 or https://doi.org/10.14469/hpc/6880
The text was updated successfully, but these errors were encountered: