Skip to content

Latest commit

 

History

History
46 lines (36 loc) · 2.07 KB

README.md

File metadata and controls

46 lines (36 loc) · 2.07 KB

CatalogueOfLife to RDF converter

Goal

To work towards a version of Synospecies being as inclusive as possible by including the taxonomic names available in Checklistbank.

Approach

To include the taxonomic names in ChecklistBank, their API is used to download the names which then are conveted into RDF. The results are accessible in the new version of Synospecies.

Usage: Whenever a tag is pushed or a release is created, the gihub action will run and attach a gzipped col.ttl.gz file to the release.

  • query.sparl is meant to be used with tarql to generate RDF from the Taxon.tsv file contained in the COL DWC archive
wget "https://api.checklistbank.org/dataset/3LR/export.zip?format=DwCA&extended=true" -O dwca.zip
unzip dwca.zip Taxon.tsv
tarql --tabs query.sparql Taxon.tsv > col.ttl

The result looks like this:

@prefix dwc:  <http://rs.tdwg.org/dwc/terms/> .
@prefix rdf:  <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .

<https://www.catalogueoflife.org/data/taxon/47BF5>
        dwc:taxonName  "Neurotheca congolana De Wild. & T. Durand" ;
        dwc:parent     <https://www.catalogueoflife.org/data/taxon/63SP> ;
        dwc:rank       "species" ;
        dwc:species    "congolana" .

<https://www.catalogueoflife.org/data/taxon/5BVQY>
        dwc:taxonName  "Weberbauerocereus cephalomacrostibas (Werderm. & Backeb.) F. Ritter" ;
        dwc:parent     <https://www.catalogueoflife.org/data/taxon/87PB> ;
        dwc:rank       "species" ;
        dwc:species    "cephalomacrostibas" .

<https://www.catalogueoflife.org/data/taxon/3B4PY>
        dwc:taxonName  "Eriogonum hastatum Wiggins" ;
        dwc:parent     <https://www.catalogueoflife.org/data/taxon/62RM7> ;
        dwc:rank       "species" ;
        dwc:species    "hastatum" .

<https://www.catalogueoflife.org/data/taxon/B52Y3>
        dwc:taxonName  "Ericoides virescens (Thunb.) Kuntze" ;
        dwc:rank       "species" ;
        dwc:species    "virescens" .